2022‑2023
National Surveys on Drug Use and Health:
Guide to State Tables and Summary of Small
Area Estimation Methodology

Section A: Overview of NSDUH and Model-Based State Estimates

A.1 Introduction

This document provides information on the model-based small area estimates of substance use and mental health disorders in states based on data from the combined 2022‑2023 National Surveys on Drug Use and Health (NSDUHs). These estimates are available online along with other related information.1

NSDUH is an annual survey of the civilian, noninstitutionalized population aged 12 or older residing within the United States, conducted from January through December, and is sponsored by the Substance Abuse and Mental Health Services Administration (SAMHSA). The survey covers residents of households (e.g., individuals living in houses or townhouses, apartments, and condominiums; civilians living in housing on military bases) and individuals in noninstitutional group quarters (e.g., shelters, rooming or boarding houses, college dormitories, migratory workers’ camps, halfway houses). The 2022 and 2023 NSDUHs used multimode data collection, in which respondents completed the survey via the web or in person in eligible locations. Across 2022 and 2023 combined, NSDUH collected data from about 139,050 respondents aged 12 or older from the 50 states and the District of Columbia.

NSDUH is planned and managed by SAMHSA’s Center for Behavioral Health Statistics and Quality (CBHSQ). Data collection and analysis are conducted under contract with RTI International.2 A summary of NSDUH’s methodology is given in Section A.2. Section A.3 lists all the tables and files associated with the 2022‑2023 state estimates. Information is given in Section A.4 on the Bayesian confidence intervals and how to make interpretations with respect to the small area estimates. Section A.5 discusses related substance use measures and warns users about not drawing conclusions by subtracting small area estimates from two different measures. Section A.6 briefly discusses methodological changes for the 2022 and 2023 NSDUHs.

The survey-weighted hierarchical Bayes (SWHB) small area estimation (SAE) methodology used in the production of state estimates from the 1999 to 20223 surveys also was used in the production of the 2022‑2023 state estimates. The SWHB methodology is described in Appendix E of the 2001 state report (Wright, 2003b) and in Folsom and colleagues (1999). A general model description is given in Section B.1 of this document. A list of measures (outcomes) for which small area estimates are produced is given in Section B.2. Predictors used in the 2022‑2023 SAE modeling are listed and described in Section B.3. Selection of predictors for SAE modeling is described in Section B.4.

Small area estimates obtained using the SWHB methodology are design consistent (i.e., the small area estimates for states with large sample sizes are close to the robust design-based estimates). Additionally, the national small area estimates4 are very close to the national design-based estimates. However, to ensure internal consistency, it is desirable to have the national small area estimates exactly match the national design-based estimates. This process is called “benchmarking.” The benchmarked state-level estimates are also potentially less biased than the unbenchmarked state-level estimates. Beginning in 2002, exact benchmarking was introduced, as described in Section B.5. The census region–level estimates in the tables are population-weighted aggregates of the benchmarked state-level estimates. Tables of the estimated numbers of people associated with each measure are available online,5 and an explanation of how these counts and their respective Bayesian confidence intervals are calculated can be found in Section B.6. Section B.7 discusses the method to compute aggregated estimates by combining two age groups. The definition and explanation of the formula used in estimating the marijuana initiation rate are given in Section B.8.

State estimates for the age groups 12 to 17, 18 to 25, 26 or older, 18 or older, and 12 or older6 are provided for all measures except for any mental illness (AMI), co‑occurring substance use disorder (SUD) and AMI, serious mental illness (SMI), co‑occurring SUD and SMI, receipt of mental health treatment, major depressive episode (MDE), serious thoughts of suicide, suicide plans, and suicide attempts. Additionally, estimates for adolescents aged 12 to 17 are not available for past year heroin use because this outcome was extremely rare among adolescents aged 12 to 17 in the 2022‑2023 NSDUHs. As a result, estimates of past year heroin use for people aged 12 or older are also not produced.

Estimates of underage (aged 12 to 20) alcohol use, binge alcohol use, perceptions of great risk from having five or more drinks of an alcoholic beverage once or twice a week, alcohol use disorder, cigarette use, and tobacco product use were also produced.7 Alcohol consumption is expected to differ significantly across the 18 to 25 age group because of the legalization of alcohol at age 21. Additionally, legislation in December 2019 raised the federal minimum age for sale of tobacco products (along with e‑cigarettes) from 18 to 21 years (U.S. Food and Drug Administration, 2021). All 50 states and the District of Columbia prohibit the sale of tobacco products to people younger than 21. Therefore, it was decided that it would be useful to produce small area estimates for people aged 12 to 20. A short description of the methodology used to produce estimates of underage outcomes is provided in Section B.9.

The remainder of Section B covers four additional topics:

In Section C, the 2022‑2023 combined survey sample sizes, response rates, and population estimates are included in Tables C.1 to C.4.

A.2 Summary of NSDUH Methodology

NSDUH is the primary source of statistical information on the use of tobacco, alcohol, prescription pain relievers, and other substances (e.g., marijuana, cocaine) by the U.S. civilian, noninstitutionalized population aged 12 or older. The survey also includes several series of questions that focus on mental health issues. NSDUH has been ongoing since 1971 and is conducted by the federal government. The survey collects information from residents of households (e.g., individuals living in houses or townhouses, apartments, and condominiums; civilians living in housing on military bases) and individuals in noninstitutional group quarters (e.g., shelters, rooming or boarding houses, college dormitories, migratory workers’ camps, halfway houses). Not included are individuals with no fixed household address (e.g., homeless and/or transient people not in shelters), military personnel on active duty, and residents of institutional group quarters, such as jails and hospitals. From 1999 to 2019, the data were collected via face-to-face (in‑person) interviews at a respondent’s place of residence using a combination of computer-assisted personal interviewing conducted by an interviewer and audio computer-assisted self-interviewing. Because of the coronavirus disease 2019 (COVID‑19) pandemic, an additional web data collection mode was introduced to the 2020 NSDUH and continued to be used in the 2021 through 2023 surveys.

The 2023 sample was selected using the coordinated sample design developed for the 2014 through 2023 NSDUHs. The coordinated sample design is state based, with an independent, multistage area probability sample within each state and the District of Columbia. This design designates 12 states as large sample states. These 12 states have the following target sample sizes per year: 4,560 interviews in California; 3,300 interviews in Florida, New York, and Texas; 2,400 interviews in Illinois, Michigan, Ohio, and Pennsylvania; and 1,500 interviews in Georgia, New Jersey, North Carolina, and Virginia. Making the sample sizes more proportional to the state population sizes improves the precision of national estimates. This change also allows for a more cost-efficient sample allocation to the largest states while slightly increasing the sample sizes in smaller states to improve the precision of state estimates. The target sample size per year in the small states is 960 interviews, except for Hawaii, where the target sample size is 967 interviews. The combined 2022‑2023 fielded sample sizes for each state are provided in Table C.1.

Nationally in 2022‑2023, a total of approximately 415,700 dwelling units (DUs) were screened, and approximately 139,050 people responded within the screened DUs (see Table C.1). The weighted screening response rate (SRR) was 24.91 percent, the weighted interview response rate (IRR) was 48.94 percent, and the overall weighted response rate (ORR) was 12.19 percent (Table C.1). The ORRs ranged from 9.28 percent in South Carolina to 18.54 percent in Virginia. Estimates reflect the probability of selection, unit nonresponse, poststratification to known census population estimates, item imputation, and other aspects of the estimation process. These procedures are described in detail in 2023 National Survey on Drug Use and Health: Methodological Resource Book (CBHSQ, 2024a).

All sampled DUs8 are screened to confirm eligibility and to select zero, one, or two members to participate in the survey. The weighted SRR is calculated as the weighted number of successfully screened DUs9 divided by the weighted number of eligible DUs, or

Equation 1. Click 'D' link to access long description.,     D

where w sub D U is the inverse of the unconditional probability of selection for the DU and excludes all adjustments for nonresponse and poststratification.

In successfully screened DUs, eligible DU members who were selected were asked to complete the interview. The weighted IRR for NSDUH is calculated as the weighted number of respondents divided by the weighted number of selected people, or

Equation 2. Click 'D' link to access long description.,     D

where w sub i is the design-based weight or the inverse of the probability of selection for the ith person and includes DU-level nonresponse and poststratification adjustments. In an effort to maximize the IRR, all respondents were offered a $30 incentive to encourage them to complete the 2023 NSDUH interview, similar to 2021 and 2022. Some Quarter 4 respondents in 2022 were given a $5 prepaid screening incentive and/or a $50 interview incentive as part of an incentives experiment to test whether these changes increased the screening and interview response rates (see Section 2.1.2 in 2022 National Survey on Drug Use and Health: Methodological Summary and Definitions [CBHSQ, 2023]). To be considered a completed interview, a respondent must provide enough data to pass the usable case rule.10

The weighted ORR is defined as the product of the weighted SRR and the weighted IRR or

Equation 3. Click 'D' link to access long description..     D

For more details on the screening and response rates, see Section 3.3.1 in 2023 National Survey on Drug Use and Health: Methodological Summary and Definitions (CBHSQ, 2024b).

A.3 Presentation of Data

This section lists all products associated with the 2022‑2023 state estimates.

The following products exclude age groups 12 to 17 and 12 or older for past year heroin use because in 2022‑2023, heroin use among adolescents aged 12 to 17 was very rare. In addition to this methodology document for the 2022‑2023 state estimates, the following products are available at https://www.samhsa.gov/data/nsduh/state-reports-NSDUH-2023:

A.4 Bayesian Confidence Intervals

The Total U.S. estimates given in each of the 41 tables showing state-level, model-based estimates12 is the design-based national estimate along with a 95 percent design-based confidence interval, all of which are based on the survey design, the survey weights, and the reported data. The state estimates are model-based statistics (using SAE methodology) that have been adjusted (benchmarked) such that the population-weighted mean of the estimates across the 50 states and the District of Columbia equals the design-based national estimate. For more details on this benchmarking, see Section B.5. The census region–level estimates are also benchmarked and are obtained by taking the population-weighted mean of the associated state-level benchmarked estimates. Associated with each state and census region-level estimate is a 95 percent Bayesian confidence interval. These intervals indicate the uncertainty in the estimate due to both sampling variability and model fit. For example, the state with the highest estimate of past month use of marijuana for young adults aged 18 to 25 in 2022‑2023 was Vermont, with an estimate of 42.1 percent and a 95 percent Bayesian confidence interval that ranged from 36.2 to 48.2 percent (see Table 3 of 2022‑2023 National Surveys on Drug Use and Health: Model-Based Prevalence Estimates (50 States and the District of Columbia) [CBHSQ, 2024c]). Assuming that sampling and modeling conditions held, the Bayes posterior probability was 0.95 that the true percentage of past month marijuana use in Vermont for young adults aged 18 to 25 in 2022‑2023 was between 36.2 and 48.2 percent.

The confidence intervals shown in NSDUH state reports are asymmetric (for details, see Section B.5), meaning that the distance between the estimate and the lower confidence limit will not be the same as the distance between the upper confidence limit and the estimate. For example, Utah’s 2022‑2023 past month marijuana use estimate is 16.2 percent for young adults aged 18 to 25, with a 95 percent Bayesian confidence interval equal to 12.9 to 20.1 percent (see Table 3 of the 2022‑2023 Model-Based Prevalence Estimates [CBHSQ, 2024c]). Therefore, Utah’s estimate is 3.3 (i.e., 16.2 − 12.9) percentage points from the lower 95 percent confidence limit and 3.9 (i.e., 20.1 − 16.2) percentage points from the upper 95 percent confidence limit. These asymmetric confidence intervals work well for small percentages often found in NSDUH state estimate tables and reports while still being appropriate for larger percentages.

When it is indicated that a state has the highest or lowest estimate, it does not imply that the state’s estimate is significantly higher or lower than the next highest or lowest state’s estimate. Additionally, two significantly different state estimates (at the 5 percent level of significance) may have overlapping 95 percent confidence intervals. For details on a more accurate test to compare state estimates, see 2022‑2023 National Surveys on Drug Use and Health: Comparison of Population Percentages from the United States, Census Regions, States, and the District of Columbia (CBHSQ, forthcoming a).

A.5 Related Measures

State estimates are produced for a number of related measures, such as marijuana use in the past month and illicit drug use in the past month, or SMI and AMI. It might appear that one could draw conclusions by subtracting one from the other (e.g., subtracting the percentage who misused pain relievers in the past year from the percentage who misused opioids [misuse of pain relievers or use of heroin] in the past year to find the percentage who used only heroin in the past year but did not misuse pain relievers). Because related measures have been estimated separately with different models, subtracting the percentage of one measure from the percentage of another related measure at the state or census region level can give misleading results, perhaps even a “negative” estimate, and should be avoided. Users are advised to view the estimates along with their respective confidence intervals to get a better idea of the range in which the “true” value of the population percentage might fall (see Section A.4 for more details).

However, at the national level (Total U.S.), because these estimates are design-based estimates, such comparisons can be made. For example, at the national level, subtracting estimates for cigarette use in the past month from the estimates of tobacco use in the past month will give the estimate of people who did not use cigarettes in the past month but only used other forms of tobacco, such as cigars, pipes, or smokeless tobacco, in the past month.

A.6 2022 and 2023 NSDUH Methodological Changes and Implication for Estimates

The 2022 and 2023 NSDUHs continued the use of multimode data collection procedures that were first implemented in October 2020 for the 2020 NSDUH. Multimode data collection was used for the entire 2022 and 2023 NSDUH samples. In 2022, more than half of the interviews in Quarter 1 were completed via the web, but for the remaining quarters in 2022 and all quarters in 2023, the majority of interviews were completed in person. The multimode nature of the 2022 and 2023 NSDUHs, however, marks an important methodological change from prior years. This section discusses special methodological issues specific to 2022 and 2023 NSDUHs. More detailed information can be found in Chapter 6 of 2021 National Survey on Drug Use and Health: Methodological Summary and Definitions (CBHSQ, 2022).

A.6.1 Special Adjustment for the 2022 and 2023 NSDUH Weights

Analyses conducted for the 2021 NSDUH indicated that key substance use and mental health estimates differed between data collection modes (i.e., web or in person), also known as “mode effects.” See Chapter 6 in the 2021 Methodological Summary and Definitions report (CBHSQ, 2022). The proportion of interviews completed via the web or in person differed between 2021, 2022, and 2023 (i.e., 45.4% via in‑person in 2021, 57.6% in 2022, and 63.9% in 2023). Consequently, mode effects could distort differences in estimates from 2021 to 2023, unless the analysis weights are adjusted to take into account these different in‑person data collection proportions.

When multimode data collection for NSDUH stabilizes, the targeted proportions are expected to be 30 percent of interviews completed via the web and 70 percent completed in person. The unweighted proportions of interviews in 2023 that were completed via the web or in person were closer to these targeted proportions than in 2021 or 2022, but they had not reached the 30/70 target. Therefore, for the NSDUH weights since 2022, mode was included as a main effect in the person-level poststratification adjustment, with a 30 percent target for the web mode and a 70 percent target for the in‑person mode to standardize the weighted proportions for each mode. This adjustment continued to be included as part of the 2023 weighting procedures to facilitate comparisons of estimates over time. This mode adjustment also was applied to the weights for 2021 data to produce revised weights. Making a similar adjustment to the 2021 weights to assume the respective 30/70 proportions for web and in‑person interviews allows estimates for 2021 to be compared with those in future survey years without differences in estimates being confounded by changes in proportions of interviews in each mode. The mode adjusted weights for 2021 were used to produce the 2021‑2022 state small area estimates.

A.6.2 Comparisons with Prior Years

The 2021, 2022, and 2023 NSDUHs used multimode data collection, in which respondents completed the survey in person or via the web. Methodological investigations led to the conclusion that estimates based on multimode data collection since 2021 are not comparable with estimates from 2020 or prior years. Chapter 6 in the 2021 Methodological Summary and Definitions report (CBHSQ, 2022) discusses these methodological investigations in greater detail. Thus, the 2021‑2022 and 2022‑2023 small area estimates should not be compared to state estimates from prior years. The 2022‑2023 state estimates are comparable with the 2021‑2022 estimates.

Section B: State Model-Based Estimation Methodology

B.1 General Model Description

The state small area estimation (SAE) model is a complex mixed13 (including both fixed and random effects) logistic regression model of the following form:

Equation 4. Click 'D' link to access long description.,     D

where pi sub a, i, j, k is the probability of engaging in the behavior of interest (e.g., using marijuana in the past month) for person‑k belonging to age group‑a in grouped state sampling region (SSR)‑j of state‑i.14 Let x sub a, i, j, k denote a p sub a times 1 vector of predictor variables (independent variables or fixed effects) associated with age group‑a (12 to 17, 18 to 25, 26 to 34, and 35 or older) and beta sub a denote the associated vector of the regression parameters. The age group–specific vectors of the auxiliary variables are defined for every block group in the nation and include person-level demographic variables, such as race/ethnicity and sex. The vectors of state-level random effects An eta sub i is a transposed vector of values eta sub 1, i and so on until eta sub capital A, i. and grouped SSR-level random effects A nu sub i, j is a vector of transposed values nu sub 1, i, j and so on until nu sub capital A, i, j. are assumed to be mutually independent and normally distributed with mean vector 0 and variance-covariance matrices matrix capital D sub eta and matrix capital D sub nu, respectively—that is, An eta sub i is normally distributed with mean 0 and variance denoted by matrix capital D sub eta. and A nu sub i, j is normally distributed with mean 0 and variance denoted by matrix capital D sub nu., where capital A is the total number of individual age groups modeled (generally, Capital A= 4). For hierarchical Bayes (HB) estimation purposes, an improper uniform prior distribution is assumed for beta sub a, and proper Wishart prior distributions are assumed for inverse of capital D sub eta and inverse of capital D sub nu. The HB solution for pi sub a, i, j, k involves a series of complex Markov Chain Monte Carlo (MCMC) steps to generate values of the desired fixed and random effects from the underlying joint posterior distribution. The basic process is described in Folsom and colleagues (1999), Shah and colleagues (2000), and Wright (2003a, 2003b).

Once the required number of MCMC samples (1,250 in all) for the parameters of interest are generated and tested for convergence properties (see Raftery & Lewis, 1992), the small area estimates for each race/ethnicity × sex cell within a block group can be obtained for each age group as described in Wright (2003b). These block group-level small area estimates then can be aggregated using the appropriate population count projections for the desired age group(s) to form state-level small area estimates. These state-level small area estimates are benchmarked to the national design-based estimates as described in Section B.5.

B.2 Measures (Outcomes) Modeled

The 2023 National Survey on Drug Use and Health (NSDUH) data were combined with the 2022 NSDUH data, and age group–specific state estimates for 38 binary (0, 1) outcomes listed below were produced. The 2021‑2022 and the 2022‑2023 state estimates were also compared for all measures with these exceptions:15 nicotine vaping in the past month, underage cigarette use, underage tobacco use, all substance use treatment measures (#27‑29 below), co‑occurring substance use disorder (SUD) and any mental illness (AMI), co‑occurring SUD and serious mental illness (SMI), and mental health treatment.

  1. illicit drug use in the past month,
  2. marijuana use in the past year,
  3. marijuana use in the past month,
  4. perceptions of great risk from smoking marijuana once a month,
  5. first use of marijuana in the past year among people at risk for initiation of marijuana use,16
  6. illicit drug use other than marijuana in the past month,
  7. cocaine use in the past year,
  8. perceptions of great risk from using cocaine once a month,
  9. heroin use in the past year,
  10. perceptions of great risk from trying heroin once or twice,
  11. hallucinogen use in the past year,
  12. methamphetamine use in the past year,
  13. prescription pain reliever misuse in the past year,
  14. opioid misuse in the past year,
  15. alcohol use in the past month,17
  16. binge alcohol use in the past month,18
  17. perceptions of great risk from having five or more drinks of an alcoholic beverage once or twice a week,19
  18. tobacco product use in the past month,20
  19. cigarette use in the past month,21
  20. nicotine vaping in the past month,
  21. perceptions of great risk from smoking one or more packs of cigarettes per day,
  22. SUD in the past year,
  23. alcohol use disorder in the past year,22
  24. drug use disorder in the past year,
  25. pain reliever use disorder in the past year,
  26. opioid use disorder in the past year,
  27. received substance use treatment in the past year,
  28. classified as needing substance use treatment in the past year,
  29. did not receive substance use treatment in the past year among people classified as needing treatment,
  30. AMI in the past year,
  31. SMI in the past year,
  32. co‑occurring SUD and AMI in the past year,
  33. co‑occurring SUD and SMI in the past year,
  34. received mental health treatment in the past year,
  35. major depressive episode (MDE) in the past year,
  36. had serious thoughts of suicide in the past year,
  37. made any suicide plans in the past year, and
  38. attempted suicide in the past year.

B.3 Predictors Used in Mixed Logistic Regression Models

Local area data used as potential predictor variables in the mixed logistic regression models were obtained from the following sources:

Data sources, along with the description of potential predictor variables obtained from each source, are provided in the following lists.

Claritas Data (Description) Claritas Data (Level)
% Population Aged 0 to 19 in Block Group Block Group
% Population Aged 20 to 24 in Block Group Block Group
% Population Aged 25 to 34 in Block Group Block Group
% Population Aged 35 to 44 in Block Group Block Group
% Population Aged 45 to 54 in Block Group Block Group
% Population Aged 55 to 64 in Block Group Block Group
% Population Aged 65 or Older in Block Group Block Group
% Non-Hispanic Blacks in Block Group Block Group
% Hispanics in Block Group Block Group
% Non-Hispanic Other Races in Block Group Block Group
% Non-Hispanic Whites in Block Group Block Group
% Males in Block Group Block Group
% American Indians, Eskimos, Aleuts in Tract Tract
% Asians, Pacific Islanders in Tract Tract
% Population Aged 0 to 19 in Tract Tract
% Population Aged 20 to 24 in Tract Tract
% Population Aged 25 to 34 in Tract Tract
% Population Aged 35 to 44 in Tract Tract
% Population Aged 45 to 54 in Tract Tract
% Population Aged 55 to 64 in Tract Tract
% Population Aged 65 or Older in Tract Tract
% Non-Hispanic Blacks in Tract Tract
% Hispanics in Tract Tract
% Non-Hispanic Other Races in Tract Tract
% Non-Hispanic Whites in Tract Tract
% Males in Tract Tract
% Population Aged 0 to 19 in County County
% Population Aged 20 to 24 in County County
% Population Aged 25 to 34 in County County
% Population Aged 35 to 44 in County County
% Population Aged 45 to 54 in County County
% Population Aged 55 to 64 in County County
% Population Aged 65 or Older in County County
% Non-Hispanic Blacks in County County
% Hispanics in County County
% Non-Hispanic Other Races in County County
% Non-Hispanic Whites in County County
% Males in County County
American Community Survey (ACS) (Description) ACS Data (Level)
% Population Who Dropped Out of High School Tract
% Housing Units Built in 1940 to 1949 Tract
% Females 16 Years or Older in Labor Force Tract
% Females Never Married Tract
% Females Separated, Divorced, Widowed, or Other Tract
% One-Person Households Tract
% Males 16 Years or Older in Labor Force Tract
% Males Never Married Tract
% Males Separated, Divorced, Widowed, or Other Tract
% Housing Units Built in 1939 or Earlier Tract
Average Number of People per Room Tract
% Families below Poverty Level Tract
% Households with Public Assistance Income Tract
% Housing Units Rented Tract
% Population with 9 to 12 Years of School, No High School Diploma Tract
% Population with 0 to 8 Years of School Tract
% Population with Associate’s Degree Tract
% Population with Some College and No Degree Tract
% Population with Bachelor’s, Graduate, Professional Degree Tract
% Housing Units with No Telephone Service Available Tract
% Households with No Vehicle Available Tract
% Population with No Health Insurance Tract
Median Rents for Rental Units Tract
Median Value of Owner-Occupied Housing Units Tract
Median Household Income Tract
% Families below the Poverty Level County
Uniform Crime Report (UCR) Data (Description) UCR Data (Level)
Drug Possession Arrest Rate County
Drug Sale or Manufacture Arrest Rate County
Drug Violations’ Arrest Rate County
Marijuana Possession Arrest Rate County
Marijuana Sale or Manufacture Arrest Rate County
Opium or Cocaine Possession Arrest Rate County
Opium or Cocaine Sale or Manufacture Arrest Rate County
Other Drug Possession Arrest Rate County
Other Dangerous Non-Narcotics Arrest Rate County
Serious Crime Arrest Rate County
Violent Crime Arrest Rate County
Driving under Influence Arrest Rate County
Other Categorical Data
(Description)
Other Categorical Data
(Source)
Other Categorical Data
(Level)
= 1 if Hispanic, = 0 Otherwise National Survey on Drug Use and
Health (NSDUH) Sample
Person
= 1 if Non-Hispanic Black, = 0 Otherwise NSDUH Sample Person
= 1 if Non-Hispanic Other, = 0 Otherwise NSDUH Sample Person
= 1 if Male, = 0 if Female NSDUH Sample Person
= 1 if Metropolitan Statistical Area (MSA) with
≥ 1 Million, = 0 Otherwise
2010 Census County
= 1 if MSA with < 1 Million, = 0 Otherwise 2010 Census County
= 1 if Non-MSA Urban, = 0 Otherwise 2010 Census Tract
= 1 if Urban Area, = 0 if Rural Area 2010 Census Tract
= 1 if No Cubans in Tract, = 0 Otherwise 2010 Census Tract
= 1 if No Arrests for Dangerous Non-Narcotics,
= 0 Otherwise
Uniform Crime Report (UCR) County
= 1 if No Arrests for Opium or Cocaine Possession,
= 0 Otherwise
UCR County
= 1 if No Housing Units Built in 1939 or Earlier,
= 0 Otherwise
American Community Survey (ACS) Tract
= 1 if No Housing Units Built in 1940 to 1949,
= 0 Otherwise
ACS Tract
= 1 if No Households with Public Assistance
Income, = 0 Otherwise
ACS Tract
Miscellaneous Data (Description) Miscellaneous Data (Source) Miscellaneous Data
(Level)
Alcohol Death Rate, Underlying Cause National Center for Health Statistics’ International
Classification of Diseases, 10th revision (NCHS-
ICD-10)
County
Cigarette Death Rate, Underlying Cause NCHS-ICD-10 County
Drug Death Rate, Underlying Cause NCHS-ICD-10 County
Alcohol Treatment Rate National Survey of Substance Abuse Treatment
Services (N‑SSATS)
County
Alcohol and Drug Treatment Rate N-SSATS County
Drug Treatment Rate N-SSATS County
Unemployment Rate Bureau of Labor Statistics (BLS) County
Per Capita Income (in Thousands) Bureau of Economic Analysis (BEA) County
Average Suicide Rate (per 10,000) NCHS-ICD-10 County
Food Stamp Participation Rate Census Bureau County
Single State Agency Maintenance of Effort National Association of State Alcohol and Drug
Abuse Directors (NASADAD)
State
Block Grant Awards Substance Abuse and Mental Health Services
Administration (SAMHSA)
State
Cost of Services Factor Index SAMHSA State
Total Taxable Resources per Capita Index U.S. Department of Treasury State
% Hispanics Who Are Cuban 2010 Census Tract

The predictor variables used in the SAE models were selected from the set of potential predictors given above using the method described in Section B.4.

B.4 Selection of Predictor Variables for the SAE Models

Predictor variable selection was done using the 2022‑2023 data for all measures, using the following multistep process:24

  1. There were about 139,050 respondents in the combined 2022 and 2023 NSDUH data. Any variable selection performed on such a large dataset would result in an excessive number of predictors in the final model. To avoid this and build parsimonious models, the pooled data were partitioned into modeling and validation samples. For more information on how the data was partitioned, see the 2002‑2003 state SAE report (Wright & Sathe, 2005).
  2. For each measure, age group–specific25 SAS® stepwise logistic regression models were fit using the modeling sample data (SAS Institute Inc., 2017). The input list to these models included all linear polynomials (constructed from continuous predictor variables) and other categorical or indicator variables given in Section B.3. All significant predictors were input to step 3.
  3. Using modeling sample, all significant predictors from step 2 then were input to PROC HPSPLIT to identify significant complex (at most three-way) interaction terms. PROC HPSPLIT is a SAS procedure that uses decision-tree algorithms to build classification systems. The exhaustive chi-squared automatic interaction detector algorithm was used to create the trees.
  4. All the significant variables from step 2, along with their corresponding higher-order polynomials (quadratic and cubic), interaction of sex and race, and the significant interactions detected by PROC HPSPLIT in step 3 then were input to SAS stepwise logistic regression models, run on modeling sample. All predictors that remained significant then were input to step 5 of variable selection.
  5. All significant variables from step 4 were input to fit SUDAAN (RTI International, 2013) logistic regression models on the validation sample, and predictors that remained significant were used in the SAE models described in Section B.1. The race and sex predictors were forced in most of the models.

B.5 Benchmarking the Age Group–Specific Small Area Estimates

The self-calibration built into the survey-weighted hierarchical Bayes (SWHB) solution ensures the population-weighted average of the state small area estimates will closely match the national design-based estimates. The national design-based estimates in NSDUH are based entirely on survey-weighted data using a direct estimation approach, whereas the state and census region estimates are model based. Given the self-calibration ensured by the SWHB method, for state reports prior to 2002, the standard Bayes prescription was followed; specifically, the posterior mean was used for the point estimate, and the tail percentiles of the posterior distribution were used for the Bayesian confidence interval limits.

Singh and Folsom (2001) extended Ghosh’s (1992) results on constrained Bayes estimation to include exact benchmarking to design-based national estimates. In the simplest version of this constrained Bayes solution where only the design-based mean is imposed as a benchmarking constraint, each of the 2022‑2023 state-by–age group small area estimates is adjusted by adding the common factor Delta sub a is defined as the national design-based estimate, capital D sub a, minus the national model-based small area estimate, capital P sub a., where capital D sub a is the design-based national estimate and capital P sub a is the population-weighted mean of the state small area estimates (Capital P sub s and a) for age group‑a. The exactly benchmarked state‑s and age group‑a small area estimates then are given by The benchmarked state-s and age group-a small area estimate, Theta sub s and a, is defined as the sum of capital P sub s and a and Delta sub a.. Experience with such additive adjustments suggests that the resulting exactly benchmarked state small area estimates will always be between 0 and 100 percent because the SWHB self-calibration ensures that the adjustment factor is small relative to the size of the state-level small area estimates.

Relative to the Bayes posterior mean, these benchmark-constrained state small area estimates are biased by the common additive adjustment factor. Therefore, the posterior mean squared error for each benchmarked state small area estimate has the square of this adjustment factor added to its posterior variance. To achieve the desirable feature of exact benchmarking, this constrained Bayes adjustment factor was implemented for the state-by–age group small area estimates. The associated Bayesian confidence (credible) intervals can be recentered at the benchmarked small area estimates on the logit scale with the symmetric interval end points based on the posterior root mean squared errors. The adjusted 95 percent Bayesian confidence intervals (Lower sub s and a is the lower bound of the 95 percent Bayesian confidence interval of Theta sub s and a; upper sub s and a is the upper bound of the 95 confidence interval of Theta sub s and a.) are defined as follows:

Equation 5. Click 'D' link to access long description.,     D

where

Equation 6. Click 'D' link to access long description.,     D

Equation 7. Click 'D' link to access long description., and     D

Equation 8. Click 'D' link to access long description..     D

The associated posterior coverage probabilities for these benchmarked intervals are very close to the prescribed 0.95 value because the state small area estimates have posterior distributions that can be approximated exceptionally well by a Gaussian distribution after the logit transformation.

B.6 Calculation of Estimated Number of People Associated with Each Outcome

Tables 1 to 41 of 2022‑2023 National Surveys on Drug Use and Health: Model-Based Estimated Totals (in Thousands) (50 States and the District of Columbia) (CBHSQ, forthcoming b) show the estimated numbers of people associated with each of the 38 measures of interest. To calculate these numbers, the benchmarked small area estimates and associated 95 percent Bayesian confidence intervals are multiplied by the average population across the 2 years (in this case, 2022 and 2023) of the state by the age group of interest (Tables C.1 to C.4 in Section C of this methodology document).

For example, alcohol use in the past month among 18- to 25-year-olds in Alabama was 45.73 percent in 2022‑2023.26 The corresponding Bayesian confidence intervals ranged from 41.52 to 49.99 percent. The population count for 18- to 25-year-olds averaged across 2022 and 2023 in Alabama was 539,025 (see Table C.2). Hence, the estimated number of 18- to 25-year-olds using alcohol in the past month in Alabama was 0.4573 × 539,025, which is 246,496.27 The associated Bayesian confidence intervals ranged from 0.4152 × 539,025 (i.e., 223,803) to 0.4999 × 539,025 (i.e., 269,459). Note that when estimates of the number of people are calculated for Tables 1 to 41 in the 2022‑2023 Model-Based Estimated Totals report (CBHSQ, forthcoming b), the unrounded percentages and population counts are used, then the numbers are reported to the nearest thousand. Hence, the number obtained by multiplying the published estimate with the published population estimate may not exactly match the counts published in these tables because of rounding differences.

The only two exceptions to this calculation are the production of the estimated numbers of marijuana initiates among the population at risk and the estimated number of those not receiving substance use treatment among those classified as needing treatment. Those estimates cannot be directly calculated as the product of the percentage estimate and the population counts available in Section C. That is because the denominator of the marijuana initiation percentage estimate is defined as the number of people at risk for marijuana initiation, which is a combination of people who never used marijuana and one half of the people who initiated in the past 24 months (see Section B.8 for more details). And the denominator of those not receiving substance use treatment who were classified as needing treatment percentage estimate is defined as the number of people classified as needing substance use treatment (see Section B.12 for details).

B.7 Calculation of Aggregate Age Group Estimates and Limitations

Tables 1 to 41 of 2022‑2023 National Surveys on Drug Use and Health: Model-Based Prevalence Estimates (50 States and the District of Columbia) (CBHSQ, 2024c) show estimates for the following age groups: 12 to 17, 18 to 25, 26 or older, 18 or older, and 12 or older. If a user was interested in producing aggregated estimates, such as for those aged 12 to 25, the aggregated estimates could be calculated using prevalence estimates along with the population totals shown in Section C of this document. However, with the information provided in the tables, the confidence intervals cannot be calculated. Below is an example of the calculation of aggregated estimate for a given state.

In 2022‑2023, alcohol use in the past month in Alabama among youths aged 12 to 17 was 6.11 percent, and among young adults aged 18 to 25 it was 45.73 percent.28 The population counts for 12- to 17-year-olds and 18- to 25-year-olds averaged across 2022 and 2023 in Alabama were 395,361 and 539,025, respectively (see Table C.2). Hence, one would calculate the estimate for people aged 12 to 25 by first finding the number of users aged 12 to 25, which is 270,653 ([0.0611 × 395,361] + [0.4573 × 539,025]), then dividing that number by the population aged 12 to 25 (270,653 ÷ [395,361 + 539,025]), which results in a rate of 28.97 percent.

B.8 Calculation of Initiation of Marijuana Use

Initiation29 rates typically are calculated as the number of new initiates of a substance during a period of time (such as in the past year) divided by an estimate of the number of person-years of exposure (in thousands). The initiation definition used here employs a simpler form of the at-risk population based on the model-based methodology. This model-based initiation rate (i.e., first use of marijuana in the past year among people at risk for initiation of marijuana use) is defined as follows:

Equation 9. Click 'D' link to access long description.,     D

where capital X sub 1 is the number of marijuana initiates in the past 24 months, capital X sub 2 is the number of people who never used marijuana, and (0.5 × capital X sub 1 + capital X sub 2) denotes the at-risk population.

The initiation rate is expressed as a percentage or rate per 100 person-years of exposure. Note that this estimate uses a 2-year time period to accumulate initiation cases from the annual survey. By assuming further that the distribution of first use for the initiation cases is uniform across the 2-year interval, the total number of person-years of exposure is 1 year on average for the initiation cases plus 2 years for all the “never users” at the end of the time period. This approximation to the person-years of exposure permits one to recast the initiation rate as a function of two population prevalence rates—namely, the fraction of people who first used marijuana in the past 2 years and the fraction who had never used marijuana. State and census region estimates, along with the 95 percent Bayesian confidence intervals, are based on simultaneous modeling of capital X sub 1 and capital X sub 2 components using the SWHB SAE approach. The associated MCMC chains were used to calculate the posterior variance.

B.9 Underage Estimates

To obtain small area estimates for people aged 12 to 20 for past month alcohol use, binge alcohol use, perceptions of great risk from having five or more drinks of an alcoholic beverage once or twice a week, alcohol use disorder, past month tobacco product use, and past month cigarette use, a separate set of SAE models with predictors selected for the age groups 12 to 17, 18 to 20, 21 to 34, and 35 or older were used. Model-based estimates for people aged 12 to 20 were produced by taking the population-weighted average of the individual age group (12 to 17 and 18 to 20) estimates. Estimates for these underage outcomes were benchmarked to match national design-based estimates for that age group using the process described in Section B.5.

B.10 Marijuana Use

The marijuana section of the 2023 NSDUH questionnaire included questions to assess the variety of ways that people consume marijuana. These questions were first introduced in the 2022 NSDUH questionnaire. The following definitional information preceded the question about the use of marijuana: “The next questions are about marijuana and any cannabis products, sometimes called pot, weed, hashish, or concentrates. Some of the ways these products can be used are smoking (such as in joints, pipes, bongs, blunts, or hookahs), vaping (using vape pens, dab pens, tabletop vaporizers, or portable vaporizers), dabbing, eating or drinking, or applying as a lotion.” Additional questions about marijuana vaping were asked in the emerging issues section of the questionnaire, but the overall marijuana measures did not take marijuana vaping data from the emerging issues section into account since the marijuana section specifically included marijuana vaping as a way marijuana could be used. For additional details on marijuana vaping, please refer to Section 3.4.13 of CBHSQ (2024b).

B.11 Substance Use Disorder (SUD)

The NSDUH questionnaire includes questions to measure SUDs for alcohol and drugs. SUD estimates in the 2022 and 2023 NSDUHs were based on the criteria in the Diagnostic and Statistical Manual of Mental Disorders, 5th edition (DSM‑5; American Psychiatric Association [APA], 2013). For information about the SUD definitions based on criteria from DSM‑5, see Section 3.4.3 of CBHSQ (2024b). Respondents were asked SUD questions separately for any drugs or alcohol they used in the 12 months prior to the survey.30 SUD questions for drugs applied to marijuana, cocaine (including crack), heroin, hallucinogens, inhalants, methamphetamine, and any use of prescription pain relievers, tranquilizers, stimulants, or sedatives.

The following lists the substances that are included in selected SUD measures in the 2022‑2023 NSDUH state small area estimates:

B.12 Substance Use Treatment

The substance use treatment questions underwent considerable revisions for the 2022 NSDUH; these questions remained the same for 2023. Revisions for 2022 were intended to reflect contemporary changes in the delivery of substance use treatment services. For information about these changes, see Section 3.4.4 of CBHSQ (2024b). Because of these changes, the definition for the receipt of substance use treatment changed beginning in 2022. Receipt of substance use treatment includes the receipt of treatment in the past year for the use of alcohol or drugs in an inpatient location; in an outpatient location; via telehealth; or in a prison, jail, or juvenile detention center. The definition also includes the receipt of medication assisted treatment for alcohol use or opioid use.

In 2022 and 2023, relatively large proportions of people who reported that they received inpatient or outpatient treatment in the past 12 months did not indicate the specific substance(s) for which they received treatment in these locations, including treatment for the use of some other drug. Stated another way, these reports of inpatient or outpatient treatment were not substantiated by reports of treatment for the use of specific substances. Specifically, about one quarter of respondents who reported inpatient treatment in the past year did not report the specific substances for which they received treatment as inpatients. Among respondents who reported outpatient treatment in the past year, about one third did not report the specific substances for which they received treatment as outpatients. A “substance unspecified” category was created for these respondents. If respondents in this “substance unspecified” group did not actually receive substance use treatment, then estimates for any substance use treatment and for inpatient or outpatient substance use treatment could be overestimates. Thus, estimates for the overall substance use treatment measure could be overestimates.

Historically, NSDUH data products have included substance use treatment at a “specialty facility” in the past year as part of the definition for whether people needed substance use treatment. With the changes to the questionnaire in 2022, the term “specialty facility” was dropped from 2022 NSDUH data products. Consequently, the definition of the need for substance use treatment was revised beginning with the 2022 NSDUH. Respondents were classified as needing substance use treatment if they had an SUD in the past year or they received substance use treatment in the past year.

The percentage of people not receiving substance use treatment among those classified as needing treatment is defined as follows:

Equation 10. Click 'D' link to access long description.,     D

where capital X sub 1 is the number of people not receiving treatment who needed treatment, capital X sub 2 is the number of people receiving treatment who needed treatment, and (capital X sub 1 + capital X sub 2) denotes the number of people who needed treatment. State and census region estimates, along with the 95 percent Bayesian confidence intervals, are based on simultaneous modeling of capital X sub 1 and capital X sub 2 components using the SWHB SAE approach. The associated MCMC chains were used to calculate the posterior variance.

For more information about the substance use treatment outcomes, see Section 3.4.4 of CBHSQ (2024b).

B.13 Mental Health Measures

Sections 3.4.5, 3.4.7, 3.4.8, and 3.4.12 of CBHSQ (2024b) provide a summary of the measurement issues associated with seven mental health outcome variables such as mental illness, MDE, suicidal thoughts and behaviors, and mental health treatment.

B.13.1 Mental Illness

The binary (0, 1) SMI and AMI measures are generated (predicted) by a logistic regression model where parameter estimates from the 2012 model and annually updated associated predictors from NSDUH (i.e., responses to questions in the NSDUH) are used to predict the respondent’s SMI (or AMI) status. The predicted SMI (or AMI) status for all adult NSDUH respondents was used to compute prevalence estimates of SMI (or AMI) nationally as well as at the state level. For details on the 2012 model, see Section 3.4.7.8 of CBHSQ (2024b). Note that starting from 2021, the measures used in the mental illness models were all imputed. Therefore, the source variables (i.e., 2012 model covariate) used to create the measures of AMI and SMI had no missing data.

B.13.2 Mental Health Treatment

The mental health treatment questions underwent considerable revisions for the 2022 NSDUH; these questions remained the same for 2023. Revisions for 2022 were intended to reflect contemporary changes in the delivery of mental health treatment services. The changes also made the content more similar between the alcohol and drug treatment and the mental health services utilization sections of the questionnaire. For information about these changes see Section 3.4.5 of CBHSQ (2024b). Because of these changes, the definition for the receipt of mental health treatment changed beginning in 2022. Receipt of mental health treatment includes the receipt of treatment in the past year to help people with their mental health, emotions, or behavior that was received in an inpatient location; in an outpatient location; via telehealth; or in a prison, jail, or juvenile detention center. The definition also includes the receipt of prescription medication to help with mental health, emotions, or behavior.

B.13.3 Major Depressive Episode (MDE)

Two sections related to MDE were included in the NSDUH questionnaires: an adult depression section and an adolescent depression section. These sections were originally derived from Diagnostic and Statistical Manual of Mental Disorders, 4th edition criteria for MDE and remained applicable to the more recent DSM‑5 criteria (APA, 1994; 2013). Consistent with the DSM‑5 criteria, NSDUH does not exclude MDEs occurring exclusively in the context of bereavement. In addition, no exclusions were made for MDEs caused by medication, alcohol, illicit drugs, or any medical illness. For information about the differences in the adult and adolescent depression questions, see Section 3.4.8 of CBHSQ (2024b).

According to DSM‑5, people are classified as having had an MDE31 in their lifetime if they had at least five or more of the following symptoms nearly every day (except where noted) in the same 2‑week period, where at least one of the symptoms is a depressed mood or loss of interest or pleasure in daily activities: (1) depressed mood most of the day; (2) markedly diminished interest or pleasure in all or almost all activities most of the day; (3) significant weight loss when not sick or dieting, or weight gain when not pregnant or growing, or decrease or increase in appetite; (4) insomnia or hypersomnia; (5) psychomotor agitation or retardation at a level observable by others; (6) fatigue or loss of energy; (7) feelings of worthlessness or excessive or inappropriate guilt; (8) diminished ability to think or concentrate or indecisiveness; and (9) recurrent thoughts of death or suicidality (i.e., recurrent suicidal ideation without a specific plan, making a specific plan, or making an attempt). Unlike the other symptoms listed previously, recurrent thoughts of death or suicidality did not need to have occurred nearly every day (APA, 2013).

Respondents who have had an MDE in their lifetime are asked if, during the past 12 months, they had a period of depression lasting 2 weeks or longer while also having some of the other symptoms mentioned. Respondents reporting experiences consistent with them having had an MDE in the past year are asked questions from the Sheehan Disability Scale to measure the level of functional impairment in major life activities reported to be caused by the MDE in the past 12 months (Leon et al., 1997).

B.13.4 Suicidal Thoughts and Behavior

The 2022 and 2023 NSDUHs included sets of questions asking adults aged 18 or older whether they had serious thoughts of suicide, made any suicide plans, or had attempted suicide in the past 12 months. All adult respondents were asked whether they made a suicide plan or attempted suicide regardless of whether they reported that they had serious thoughts of suicide in the past 12 months. The two response options were “yes” and “no.” Additionally, beginning in 2021, the adult variables for suicidal thoughts and behaviors among adults were statistically imputed.

Additionally, the 2022 and 2023 NSDUHs included sets of questions that asked youths aged 12 to 17 about the same suicidal thoughts and behaviors. Even though the wording of the youth suicidal thoughts and behavior questions did not change for 2022, questions were moved to the youth experiences section of the questionnaire from the youth mental health utilization section, since that section was removed from the 2022 NSDUH questionnaire. Unlike the questions for adults, the questions about suicidal thoughts and behavior among youths included response choices for “I’m not sure” and “I don’t want to answer,” in addition to standard response choices of “yes” and “no.”

Estimates for suicidal thoughts and behavior among adolescents in national reports and tables for 2023 included estimates for “I’m not sure,” and “I don’t want to answer,” in addition to estimates for “yes” and “no.” Measures for suicidal thoughts and behavior among adolescents were not statistically imputed for 2022 or 2023. For the 2022‑2023 state small area estimates, estimates for suicidal behaviors reflect the percentage that answered “yes” among all respondents. Respondents who answered “no,” “I’m not sure,” and “I don’t want to answer” were grouped together as the “no” category. Thus the 2022‑2023 state small area estimates for suicidal behaviors among adolescents may be underestimated.

Section C: Sample Sizes, Response Rates,
and Population Estimates

Table C.1 – Survey Sample Sizes (Rounded to the Nearest 10), Weighted Screening and Interview Response Rates, and Population Estimates, by State, for People Aged 12 or Older, 2022 and 2023
State Total
Selected DUs
Total
Eligible
DUs
Total
Completed
Screeners
Weighted DU
Screening
Response Rate
Total
Selected
People
Total
Responded
Population
Estimate
Weighted
Interview
Response
Rate
Weighted
Overall
Response
Rate
Total U.S. 1,817,300 1,669,260 415,700 24.91% 286,530 139,050 282,732,485 48.94% 12.19%
Northeast 358,200 330,590 81,520 23.42% 53,890 25,700 49,002,655 50.35% 11.79%
Midwest 425,780 389,560 99,430 26.46% 67,570 32,600 58,200,570 49.86% 13.19%
South 616,780 562,690 140,510 24.94% 94,110 46,870 108,676,183 49.35% 12.31%
West 416,550 386,420 94,250 24.51% 70,960 33,880 66,853,077 46.42% 11.38%
Alabama 25,860 22,950 8,720 38.99% 5,690 2,460 4,287,709 42.77% 16.68%
Alaska 24,950 21,850 5,210 24.80% 3,760 2,010 587,481 57.16% 14.17%
Arizona 28,260 25,390 5,560 21.55% 4,370 2,160 6,285,586 47.69% 10.28%
Arkansas 24,170 20,710 5,300 25.40% 3,990 2,010 2,553,047 50.85% 12.91%
California 101,460 97,700 21,570 22.01% 18,430 8,410 33,130,051 44.95% 9.90%
Colorado 30,920 28,160 7,480 26.25% 5,000 2,240 4,993,121 44.84% 11.77%
Connecticut 28,850 27,050 6,500 23.90% 4,070 1,990 3,123,417 52.43% 12.53%
Delaware 31,590 29,410 7,660 25.77% 4,740 2,010 876,840 44.53% 11.48%
District of Columbia 32,560 30,640 7,040 23.09% 2,790 1,650 576,987 57.87% 13.36%
Florida 90,950 84,350 20,230 23.28% 13,270 6,340 19,317,461 45.75% 10.65%
Georgia 33,740 31,970 7,790 25.37% 6,340 3,360 9,173,397 48.88% 12.40%
Hawaii 28,400 26,400 6,310 21.91% 4,810 2,080 1,189,291 45.01% 9.86%
Idaho 24,790 23,040 7,230 32.10% 5,080 2,430 1,632,889 45.40% 14.57%
Illinois 70,150 65,280 13,640 20.89% 10,340 4,540 10,678,859 45.78% 9.57%
Indiana 25,530 22,970 5,790 26.02% 4,270 2,310 5,738,173 56.56% 14.72%
Iowa 26,770 24,330 5,820 25.56% 3,860 1,950 2,693,293 52.88% 13.52%
Kansas 24,270 22,080 5,470 25.63% 4,360 1,990 2,433,847 47.92% 12.28%
Kentucky 27,330 23,950 6,310 27.96% 3,930 2,150 3,779,544 56.70% 15.85%
Louisiana 24,760 21,470 5,910 29.26% 3,980 1,890 3,797,065 47.53% 13.91%
Maine 27,420 23,570 7,490 28.82% 4,220 1,990 1,217,246 51.92% 14.96%
Maryland 27,890 26,470 6,370 24.65% 4,460 2,060 5,208,094 45.04% 11.10%
Massachusetts 28,600 27,090 6,370 23.19% 4,210 2,080 6,074,415 53.06% 12.31%
Michigan 62,720 56,450 15,410 27.51% 9,360 4,890 8,581,433 55.20% 15.19%
Minnesota 25,060 23,630 6,220 25.82% 3,960 1,820 4,833,275 45.77% 11.82%
Mississippi 23,470 20,920 5,760 28.51% 4,280 2,050 2,444,092 46.78% 13.33%
Missouri 26,280 23,560 6,330 27.47% 3,930 2,010 5,210,954 54.46% 14.96%
Montana 28,310 24,620 5,270 18.95% 3,140 1,590 961,495 56.93% 10.79%
Nebraska 21,200 19,650 5,660 28.50% 4,360 2,200 1,634,872 52.33% 14.91%
Nevada 27,250 25,520 5,810 22.84% 4,800 2,450 2,709,179 51.46% 11.75%
New Hampshire 27,740 25,600 7,760 31.45% 5,050 2,240 1,224,701 48.85% 15.36%
New Jersey 42,140 39,880 9,020 22.48% 6,600 2,980 7,901,504 46.98% 10.56%
New Mexico 25,700 23,380 5,660 24.20% 4,010 2,100 1,793,695 53.98% 13.07%
New York 81,650 76,250 17,680 23.32% 13,070 6,230 16,838,128 49.27% 11.49%
North Carolina 43,740 39,740 7,720 19.90% 4,460 2,430 9,073,788 52.37% 10.42%
North Dakota 27,270 23,720 4,840 19.69% 3,380 1,660 641,182 50.01% 9.84%
Ohio 65,530 62,630 18,130 29.25% 11,790 5,340 9,951,683 44.23% 12.94%
Oklahoma 25,960 23,030 6,000 25.65% 4,180 1,980 3,333,620 53.38% 13.69%
Oregon 25,680 24,560 6,800 29.49% 4,270 1,970 3,668,026 46.66% 13.76%
Pennsylvania 67,220 62,480 13,500 22.20% 8,830 4,380 11,104,726 52.31% 11.61%
Rhode Island 28,590 25,360 5,630 22.90% 3,590 1,740 948,528 51.09% 11.70%
South Carolina 30,590 27,430 5,110 18.16% 3,180 1,590 4,514,026 51.12% 9.28%
South Dakota 25,380 21,860 4,580 19.92% 3,330 1,760 751,352 55.35% 11.03%
Tennessee 26,160 24,490 6,450 27.44% 4,080 1,870 5,987,617 42.74% 11.73%
Texas 75,000 68,320 14,750 21.87% 12,290 6,470 24,945,844 51.99% 11.37%
Utah 19,680 17,990 5,350 31.86% 5,330 2,700 2,782,695 54.53% 17.37%
Vermont 25,980 23,300 7,570 31.56% 4,260 2,070 569,990 50.57% 15.96%
Virginia 43,310 40,440 13,830 34.79% 9,220 4,980 7,289,502 53.28% 18.54%
Washington 26,350 24,900 7,910 32.67% 5,390 2,350 6,627,638 43.11% 14.09%
West Virginia 29,710 26,400 5,560 20.59% 3,200 1,590 1,517,549 51.44% 10.59%
Wisconsin 25,640 23,410 7,550 31.90% 4,630 2,150 5,051,646 49.75% 15.87%
Wyoming 24,800 22,920 4,110 17.44% 2,570 1,400 491,930 56.98% 9.94%
DU = dwelling unit.
NOTE: To compute the pooled 2022‑2023 weighted response rates, two samples were combined, and the individual year weights were used for the pooled sample. Thus, the response rates presented here are weighted across 2 years of data rather than being a simple average of the 2022 and 2023 individual response rates. The 2022‑2023 population estimate is the average of the 2022 and the 2023 population.
Source: SAMHSA, Center for Behavioral Health Statistics and Quality, National Surveys on Drug Use and Health, 2022 and 2023.
Table C.2 – Survey Sample Sizes (Rounded to the Nearest 10), Weighted Interview Response Rates, and Population Estimates, by State and Three Age Groups, 2022 and 2023
State 12‑17
Total
Selected
People
12‑17
Total
Responded
12‑17
Population
Estimate
12‑17
Weighted
Interview
Response
Rate
18‑25
Total
Selected
People
18‑25
Total
Responded
18‑25
Population
Estimate
18‑25
Weighted
Interview
Response
Rate
26+
Total
Selected
People
26+
Total
Responded
26+
Population
Estimate
26+
Weighted
Interview
Response
Rate
Total U.S. 66,490 29,120 25,820,478 43.87% 71,890 33,540 34,409,720 45.66% 148,150 76,390 222,502,287 50.04%
Northeast 11,900 5,080 4,066,290 42.67% 13,450 6,010 5,829,519 45.23% 28,540 14,610 39,106,845 51.92%
Midwest 15,410 6,220 5,414,481 39.76% 17,530 8,210 7,217,337 45.78% 34,630 18,180 45,568,752 51.73%
South 22,930 10,660 10,194,122 46.47% 22,710 11,030 13,230,256 47.47% 48,470 25,190 85,251,805 49.99%
West 16,260 7,170 6,145,585 43.97% 18,200 8,300 8,132,608 42.89% 36,500 18,410 52,574,884 47.25%
Alabama 1,330 560 395,361 41.81% 1,480 620 539,025 40.83% 2,880 1,280 3,353,323 43.21%
Alaska 840 370 58,870 46.98% 1,000 540 65,935 55.23% 1,930 1,090 462,676 58.62%
Arizona 1,090 510 573,720 47.67% 1,180 560 789,025 46.00% 2,110 1,090 4,922,841 47.97%
Arkansas 910 410 248,241 39.19% 1,160 560 320,505 47.73% 1,920 1,040 1,984,301 52.85%
California 4,130 1,820 3,047,049 43.24% 4,480 2,030 4,031,332 42.43% 9,810 4,550 26,051,670 45.53%
Colorado 1,200 520 436,981 41.54% 1,150 480 610,008 39.07% 2,650 1,240 3,946,133 46.11%
Connecticut 1,000 430 268,208 40.99% 860 370 381,646 45.12% 2,210 1,190 2,473,563 54.85%
Delaware 1,210 470 74,551 38.11% 1,100 420 96,228 35.94% 2,430 1,120 706,061 46.41%
District of Columbia 750 410 35,344 52.18% 670 370 79,766 52.68% 1,380 870 461,878 59.18%
Florida 3,260 1,510 1,534,856 46.06% 2,860 1,340 2,045,283 43.93% 7,160 3,500 15,737,322 45.96%
Georgia 1,380 720 914,813 50.69% 1,770 950 1,155,690 49.82% 3,190 1,690 7,102,894 48.48%
Hawaii 1,110 400 96,941 43.51% 1,170 500 118,461 38.78% 2,530 1,180 973,889 45.92%
Idaho 1,290 590 170,227 46.46% 1,380 610 215,994 41.65% 2,410 1,230 1,246,668 45.91%
Illinois 2,440 880 985,456 34.89% 2,630 1,100 1,292,804 41.42% 5,280 2,560 8,400,599 47.79%
Indiana 870 400 558,597 44.57% 1,330 730 748,524 53.17% 2,070 1,190 4,431,052 58.61%
Iowa 980 400 258,498 40.94% 980 490 360,689 49.18% 1,900 1,060 2,074,106 55.14%
Kansas 920 360 247,003 40.33% 1,290 570 328,728 43.49% 2,150 1,070 1,858,116 49.80%
Kentucky 1,020 470 354,325 44.69% 1,010 560 456,436 56.51% 1,900 1,120 2,968,783 58.13%
Louisiana 1,000 450 369,669 41.80% 770 350 464,930 42.24% 2,210 1,100 2,962,467 49.02%
Maine 920 370 90,954 32.06% 1,070 460 124,764 41.15% 2,230 1,160 1,001,528 55.07%
Maryland 1,170 510 477,408 45.76% 1,080 460 587,239 41.16% 2,210 1,090 4,143,446 45.51%
Massachusetts 760 310 479,926 39.34% 1,140 550 778,509 48.79% 2,300 1,210 4,815,980 55.07%
Michigan 2,350 1,070 755,554 45.55% 2,120 1,060 1,053,176 51.10% 4,890 2,770 6,772,703 56.92%
Minnesota 900 350 459,370 34.24% 1,020 440 575,504 38.08% 2,030 1,030 3,798,400 48.50%
Mississippi 1,070 450 247,080 41.55% 920 450 313,746 49.47% 2,290 1,150 1,883,267 46.95%
Missouri 1,000 430 484,087 40.27% 870 450 637,476 53.57% 2,060 1,130 4,089,391 56.26%
Montana 740 280 83,659 33.00% 780 350 115,247 47.27% 1,610 960 762,589 61.01%
Nebraska 1,010 440 166,831 42.81% 930 440 216,665 46.62% 2,420 1,320 1,251,376 54.57%
Nevada 1,040 530 245,685 50.53% 1,290 620 292,853 48.46% 2,480 1,310 2,170,641 51.96%
New Hampshire 1,080 420 93,095 37.25% 1,210 470 135,461 37.88% 2,760 1,360 996,145 51.35%
New Jersey 1,700 730 710,731 39.97% 1,660 760 887,300 45.87% 3,230 1,490 6,303,473 47.92%
New Mexico 830 400 169,349 50.96% 1,210 570 221,239 42.78% 1,970 1,120 1,403,107 56.09%
New York 2,760 1,270 1,373,050 46.85% 3,280 1,480 2,008,808 43.60% 7,030 3,480 13,456,269 50.35%
North Carolina 1,140 540 824,720 45.01% 980 540 1,106,275 47.50% 2,350 1,350 7,142,793 54.00%
North Dakota 680 220 60,483 33.42% 950 510 95,352 49.86% 1,750 930 485,346 52.02%
Ohio 2,590 1,010 908,928 39.56% 3,100 1,340 1,188,683 41.04% 6,100 2,990 7,854,072 45.24%
Oklahoma 1,070 460 337,376 42.97% 1,030 470 440,668 48.17% 2,070 1,050 2,555,576 55.75%
Oregon 980 390 302,902 39.12% 1,220 520 413,827 39.92% 2,080 1,070 2,951,297 48.45%
Pennsylvania 1,920 830 935,433 42.61% 2,330 1,100 1,319,019 46.80% 4,580 2,440 8,850,275 54.17%
Rhode Island 840 360 72,502 39.75% 860 370 122,502 43.24% 1,890 1,010 753,524 53.49%
South Carolina 790 300 405,682 35.64% 810 400 531,020 44.24% 1,590 890 3,577,324 54.04%
South Dakota 730 270 75,567 36.32% 930 500 94,500 50.14% 1,670 980 581,286 58.54%
Tennessee 940 370 545,625 38.11% 1,070 500 716,006 41.82% 2,080 1,000 4,725,986 43.39%
Texas 2,620 1,320 2,644,728 50.55% 2,960 1,550 3,317,864 51.74% 6,710 3,600 18,983,252 52.23%
Utah 1,160 590 337,230 51.03% 1,370 640 453,651 47.88% 2,800 1,470 1,991,815 56.54%
Vermont 910 370 42,392 38.05% 1,040 450 71,510 38.30% 2,310 1,260 456,088 53.55%
Virginia 2,470 1,380 657,289 55.10% 2,140 1,060 884,275 49.60% 4,610 2,530 5,747,938 53.63%
Washington 1,230 510 575,011 40.69% 1,370 550 746,355 40.25% 2,790 1,290 5,306,272 43.83%
West Virginia 810 340 127,053 40.64% 890 430 175,302 46.06% 1,510 820 1,215,194 53.20%
Wisconsin 950 400 454,107 39.96% 1,370 590 625,236 42.54% 2,310 1,160 3,972,303 52.11%
Wyoming 620 270 47,962 40.24% 610 320 58,681 51.46% 1,350 810 385,286 60.03%
NOTE: Computations in this table are based on a respondent’s age at screening. Thus, the data in the Total Responded column(s) could differ from data in other National Survey on Drug Use and Health tables that use the respondent’s age recorded during the interview.
NOTE: To compute the pooled 2022‑2023 weighted response rates, two samples were combined, and the individual year weights were used for the pooled sample. Thus, the response rates presented here are weighted across 2 years of data rather than being a simple average of the 2022 and 2023 individual response rates. The 2022‑2023 population estimate is the average of the 2022 and the 2023 population.
Source: SAMHSA, Center for Behavioral Health Statistics and Quality, National Surveys on Drug Use and Health, 2022 and 2023.
Table C.3 – Survey Sample Sizes (Rounded to the Nearest 10), Weighted Interview Response Rates, and Population Estimates among People Aged 12 to 20, by State, 2022 and 2023
State Total Selected People Total Responded Population Estimate Weighted Interview
Response Rate
Total U.S. 91,500 40,710 38,358,209 44.48%
Northeast 16,450 7,100 6,100,291 43.20%
Midwest 21,500 9,000 8,089,880 41.51%
South 30,840 14,450 14,986,155 46.82%
West 22,700 10,170 9,181,883 44.10%
Alabama 1,900 820 636,846 42.95%
Alaska 1,200 570 84,358 49.64%
Arizona 1,500 720 876,877 47.85%
Arkansas 1,310 590 365,043 40.38%
California 5,740 2,590 4,506,691 43.59%
Colorado 1,580 660 666,699 40.04%
Connecticut 1,300 560 408,300 41.78%
Delaware 1,620 620 110,726 37.86%
District of Columbia 880 470 48,879 50.38%
Florida 4,270 2,000 2,315,857 46.43%
Georgia 1,960 1,030 1,319,959 50.71%
Hawaii 1,530 590 138,916 43.26%
Idaho 1,790 820 262,877 45.41%
Illinois 3,410 1,280 1,481,469 36.63%
Indiana 1,260 600 789,052 46.30%
Iowa 1,360 580 403,772 44.39%
Kansas 1,340 550 357,709 41.29%
Kentucky 1,380 650 509,473 47.16%
Louisiana 1,280 570 553,311 42.68%
Maine 1,260 520 135,665 35.02%
Maryland 1,550 660 668,655 44.39%
Massachusetts 1,130 480 716,403 39.25%
Michigan 3,060 1,420 1,109,169 47.48%
Minnesota 1,270 500 712,944 35.73%
Mississippi 1,410 630 367,029 45.16%
Missouri 1,300 570 713,892 44.22%
Montana 1,010 390 122,467 36.69%
Nebraska 1,330 590 245,540 43.22%
Nevada 1,510 760 363,386 50.27%
New Hampshire 1,490 570 131,961 36.79%
New Jersey 2,320 1,010 1,034,696 41.53%
New Mexico 1,310 630 257,787 47.96%
New York 3,830 1,770 2,098,910 46.28%
North Carolina 1,510 740 1,210,761 44.63%
North Dakota 1,020 390 92,969 38.01%
Ohio 3,680 1,470 1,357,095 40.13%
Oklahoma 1,430 610 454,799 42.65%
Oregon 1,440 580 466,156 39.51%
Pennsylvania 2,720 1,200 1,395,954 44.12%
Rhode Island 1,150 500 116,638 40.73%
South Carolina 1,060 420 591,783 38.34%
South Dakota 1,030 430 104,033 39.71%
Tennessee 1,300 530 811,109 39.47%
Texas 3,650 1,870 3,845,894 50.99%
Utah 1,590 790 489,852 50.58%
Vermont 1,250 500 61,765 37.41%
Virginia 3,240 1,760 986,970 53.33%
Washington 1,700 710 878,568 41.63%
West Virginia 1,110 470 189,059 41.77%
Wisconsin 1,450 630 722,237 41.21%
Wyoming 810 370 67,249 43.11%
NOTE: Computations in this table are based on a respondent’s age at screening. Thus, the data in the Total Responded column(s) could differ from data in other National Survey on Drug Use and Health tables that use the respondent’s age recorded during the interview.
NOTE: To compute the pooled 2022‑2023 weighted response rates, two samples were combined, and the individual year weights were used for the pooled sample. Thus, the response rates presented here are weighted across 2 years of data rather than being a simple average of the 2022 and 2023 individual response rates. The 2022‑2023 population estimate is the average of the 2022 population and the 2023 population.
Source: SAMHSA, Center for Behavioral Health Statistics and Quality, National Surveys on Drug Use and Health, 2022 and 2023.
Table C.4 – Survey Sample Sizes (Rounded to the Nearest 10), Weighted Interview Response Rates, and Population Estimates among People Aged 18 or Older, by State, 2022 and 2023
State Total Selected People Total Responded Population Estimate Weighted Interview
Response Rate
Total U.S. 220,030 109,930 256,912,007 49.45%
Northeast 41,990 20,620 44,936,365 51.05%
Midwest 52,160 26,380 52,786,089 50.90%
South 71,180 36,210 98,482,061 49.65%
West 54,700 26,720 60,707,492 46.67%
Alabama 4,360 1,900 3,892,349 42.87%
Alaska 2,920 1,640 528,611 58.22%
Arizona 3,290 1,650 5,711,867 47.69%
Arkansas 3,080 1,600 2,304,805 52.13%
California 14,300 6,580 30,083,002 45.12%
Colorado 3,790 1,720 4,556,140 45.16%
Connecticut 3,070 1,560 2,855,209 53.53%
Delaware 3,530 1,540 802,289 45.13%
District of Columbia 2,040 1,240 541,643 58.23%
Florida 10,020 4,830 17,782,605 45.72%
Georgia 4,960 2,640 8,258,584 48.67%
Hawaii 3,700 1,680 1,092,350 45.14%
Idaho 3,790 1,840 1,462,663 45.27%
Illinois 7,910 3,660 9,693,403 46.92%
Indiana 3,400 1,920 5,179,576 57.82%
Iowa 2,880 1,550 2,434,796 54.24%
Kansas 3,440 1,640 2,186,844 48.82%
Kentucky 2,910 1,680 3,425,219 57.92%
Louisiana 2,990 1,440 3,427,397 48.12%
Maine 3,300 1,620 1,126,292 53.55%
Maryland 3,290 1,550 4,730,685 44.97%
Massachusetts 3,440 1,760 5,594,489 54.20%
Michigan 7,010 3,830 7,825,879 56.13%
Minnesota 3,060 1,470 4,373,904 47.06%
Mississippi 3,210 1,600 2,197,013 47.31%
Missouri 2,930 1,580 4,726,867 55.88%
Montana 2,390 1,310 877,836 59.19%
Nebraska 3,350 1,760 1,468,041 53.39%
Nevada 3,760 1,930 2,463,495 51.55%
New Hampshire 3,970 1,830 1,131,607 49.79%
New Jersey 4,900 2,250 7,190,773 47.67%
New Mexico 3,180 1,690 1,624,346 54.30%
New York 10,310 4,960 15,465,078 49.49%
North Carolina 3,320 1,890 8,249,067 53.09%
North Dakota 2,700 1,440 580,698 51.68%
Ohio 9,200 4,330 9,042,755 44.70%
Oklahoma 3,100 1,520 2,996,244 54.59%
Oregon 3,300 1,590 3,365,124 47.38%
Pennsylvania 6,910 3,550 10,169,294 53.20%
Rhode Island 2,750 1,380 876,026 52.04%
South Carolina 2,390 1,290 4,108,344 52.73%
South Dakota 2,600 1,480 675,786 57.41%
Tennessee 3,150 1,500 5,441,992 43.18%
Texas 9,670 5,150 22,301,116 52.16%
Utah 4,170 2,110 2,445,465 54.99%
Vermont 3,350 1,710 527,598 51.53%
Virginia 6,750 3,590 6,632,213 53.09%
Washington 4,160 1,840 6,052,627 43.36%
West Virginia 2,400 1,250 1,390,496 52.35%
Wisconsin 3,690 1,750 4,597,540 50.75%
Wyoming 1,960 1,130 443,967 58.87%
NOTE: Computations in this table are based on a respondent’s age at screening. Thus, the data in the Total Responded column(s) could differ from data in other NSDUH tables that use the respondent’s age recorded during the interview.
NOTE: To compute the pooled 2022‑2023 weighted response rates, two samples were combined, and the individual year weights were used for the pooled sample. Thus, the response rates presented here are weighted across 2 years of data rather than being a simple average of the 2022 and 2023 individual response rates. The 2022‑2023 population estimate is the average of the 2022 population and the 2023 population.
Source: SAMHSA, Center for Behavioral Health Statistics and Quality, National Surveys on Drug Use and Health, 2022 and 2023.

Section D: References

American Psychiatric Association. (1994). Diagnostic and statistical manual of mental disorders (DSM‑IV) (4th ed.).

American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM‑5) (5th ed.). https://doi.org/10.1176/appi.books.9780890425787 SAMHSA Exit Disclaimer

Center for Behavioral Health Statistics and Quality. (2023). 2022 National Survey on Drug Use and Health: Methodological summary and definitions. Substance Abuse and Mental Health Administration.

Center for Behavioral Health Statistics and Quality. (2024a). 2023 National Survey on Drug Use and Health: Methodological resource book. Substance Abuse and Mental Health Administration. https://www.samhsa.gov/data/report/nsduh-2023-methodological-resource-book-mrb

Center for Behavioral Health Statistics and Quality. (2024b). 2023 National Survey on Drug Use and Health: Methodological summary and definitions. Substance Abuse and Mental Health Services Administration. https://www.samhsa.gov/data/report/2023-methodological-summary-and-definitions

Center for Behavioral Health Statistics and Quality. (2024c). 2022‑2023 National Surveys on Drug Use and Health: Model-based prevalence estimates (50 states and the District of Columbia). Substance Abuse and Mental Health Administration.

Center for Behavioral Health Statistics and Quality. (forthcoming a). 2022‑2023 National Surveys on Drug Use and Health: Comparison of population percentages from the United States, census regions, states, and the District of Columbia. Substance Abuse and Mental Health Services Administration.

Center for Behavioral Health Statistics and Quality. (forthcoming b). 2022‑2023 National Surveys on Drug Use and Health: Model-based estimated totals (in thousands) (50 states and the District of Columbia). Substance Abuse and Mental Health Services Administration.

Folsom, R. E., Shah, B., & Vaish, A. (1999). Substance abuse in states: A methodological report on model-based estimates from the 1994-1996 National Household Surveys on Drug Abuse. In Proceedings of the 1999 Joint Statistical Meetings, American Statistical Association, Survey Research Methods Section, Baltimore, MD (pp. 371‑375). American Statistical Association.

Ghosh, M. (1992). Constrained Bayes estimation with applications. Journal of the American Statistical Association, 87(418), 533-540. https://doi.org/10.2307/2290287 SAMHSA Exit Disclaimer

Leon, A. C., Olfson, M., Portera, L., Farber, L., & Sheehan, D. V. (1997). Assessing psychiatric impairment in primary care with the Sheehan Disability Scale. International Journal of Psychiatry in Medicine, 27(2), 93-105. https://doi.org/10.2190/t8em-c8yh-373n-1uwd SAMHSA Exit Disclaimer

Raftery, A. E., & Lewis, S. (1992). How many iterations in the Gibbs sampler? In J. M. Bernardo, J. O. Berger, A. P. Dawid, & A. F. M. Smith (Eds.), Bayesian statistics 4 (pp. 763‑774). Oxford University Press.

Rao, J. N. K. (2003). Small area estimation (Wiley Series in Survey Methodology) (1st ed.). John Wiley & Sons.

RTI International. (2013). SUDAAN® language manual, release 11.0.1.

SAS Institute Inc. (2017). SAS/STAT software: Release 14.1.

Shah, B. V., Barnwell, B. G., Folsom, R., & Vaish, A. (2000). Design consistent small area estimates using Gibbs algorithm for logistic models. In Proceedings of the 2000 Joint Statistical Meetings, American Statistical Association, Survey Research Methods Section, Indianapolis, IN (pp. 105‑111). American Statistical Association.

Singh, A. C., & Folsom, R. E. (2001, April 11-14). Hierarchical Bayes calibrated domain estimation via Metropolis-Hastings Step in MCMC with application to small areas. Presented at the International Conference on Small Area Estimation and Related Topics, Potomac, MD.

U.S. Food and Drug Administration. (2021). Rules, regulations and guidance. https://www.fda.gov/tobacco-products/products-guidance-regulations/rules-regulations-and-guidance.

Wright, D. (2003a). State estimates of substance use from the 2001 National Household Survey on Drug Abuse: Volume I. Findings (HHS Publication No. SMA 03‑3775, NHSDA Series H‑19). Substance Abuse and Mental Health Services Administration, Office of Applied Studies.

Wright, D. (2003b). State estimates of substance use from the 2001 National Household Survey on Drug Abuse: Volume II. Individual state tables and technical appendices (HHS Publication No. SMA 03‑3826, NHSDA Series H‑20). Substance Abuse and Mental Health Services Administration, Office of Applied Studies.

Wright, D., & Sathe, N. (2005). State estimates of substance use from the 2002‑2003 National Surveys on Drug Use and Health (HHS Publication No. SMA 05‑3989, NSDUH Series H‑26). Substance Abuse and Mental Health Services Administration, Office of Applied Studies.

Section E: List of Contributors

This National Survey on Drug Use and Health (NSDUH) document was prepared by the Center for Behavioral Health Statistics and Quality (CBHSQ), Substance Abuse and Mental Health Services Administration (SAMHSA), U.S. Department of Health and Human Services (HHS), and by RTI International, Research Triangle Park, North Carolina. Work by RTI was performed under Contract No. 75S20322C00001. Marlon Daniel served as government project officer and as the contracting officer representative.

This document was drafted by RTI and reviewed at SAMHSA. Production of the report at SAMHSA was managed by Rong Cai, Shiromani Gyawali, Jingsheng Yan, Xiaoting Qin, and Chiu-Fang Chou.

Endnotes

1 Use the NSDUH link on the following webpage: https://www.samhsa.gov/data/nsduh/state-reports-NSDUH-2023.

2 RTI International is a trade name of Research Triangle Institute. RTI and the RTI logo are U.S. registered trademarks of Research Triangle Institute.

3 The 2019-2020 state small area estimates were produced, but they have since been removed from SAMHSA’s website. Methodological investigations found that the unusual societal circumstances in 2020 and the resulting methodological revisions to NSDUH data collection have affected the comparability of 2020 estimates with estimates from 2019 and earlier. Consequently, estimates that involve combining data from 2020 with previous years have been removed from the SAMHSA website.

4 National small area estimates = Population-weighted averages of state-level small area estimates.

5 See Tables 1 to 41 in 2022‑2023 National Surveys on Drug Use and Health: Model-Based Estimated Totals (in Thousands) (50 States and the District of Columbia) (CBHSQ, forthcoming b).

6 For major depressive episode (MDE), receipt of mental health treatment, serious thoughts of suicide, suicide plans, and suicide attempts, estimates for people aged 12 or older are not included. For any mental illness (AMI), co‑occurring substance use disorder (SUD) and AMI, serious mental illness (SMI), and co‑occurring SUD and SMI, estimates for adolescents aged 12 to 17 and people aged 12 or older are not included because adolescents are not asked questions about mental illness.

7 Binge drinking is defined as having five or more drinks (for males) or four or more drinks (for females) on the same occasion on at least 1 day in the 30 days prior to the survey.

8 A DU in NSDUH refers to either a housing unit or a group quarter listing unit, such as a dormitory room or a shelter bed.

9 A successfully screened DU is one in which all screening questionnaire items were answered by an adult resident of the DU and either zero, one, or two DU members were selected for the NSDUH interview.

10 The usable case rule requires that a respondent answer “yes” or “no” to the question on lifetime use of cigarettes and “yes” or “no” to at least nine additional lifetime use questions.

11 The SAE expert panel, convened in 1999 and 2000, had six members: Dr. William Bell of the U.S. Bureau of the Census; Partha Lahiri, Professor of the Joint Program in Survey Methodology at the University of Maryland at College Park; Professor Balgobin Nandram of Worcester Polytechnic Institute; Wesley Schaible, formerly Associate Commissioner for Research and Evaluation at the Bureau of Labor Statistics; Professor J. N. K. Rao of Carleton University; and Professor Alan Zaslavsky of Harvard University.

12 See Tables 1 to 41 in 2022‑2023 Model-Based Prevalence Estimates (CBHSQ, 2024c).

13 The use of mixed models (fixed and random effects) allows additional error components (random effects) to be included. These account for differences between states and within-state variations that are not taken into account by the predictor variables (fixed effects) alone. It is also difficult (if not impossible) to produce valid mean squared errors (MSEs) for small area estimates based solely on a fixed-effect national regression model (i.e., synthetic estimation) (Rao, 2003, p. 52). The mixed models produce estimates that are approximately represented by a weighted combination of the direct estimate from the state data and a regression estimate from the national model. The regression coefficients of the national model are estimated using data from all of the states (i.e., borrowing strength), and the regression estimate for a particular state is obtained by applying the national model to the state-specific predictor data. The regression estimate for the state is then combined with the direct estimate from the state data in a weighted combination where the weights are obtained by minimizing the MSE (variance + squared bias) of the small area estimate.

14 To increase the precision of the estimated random effects at the within-state level, three SSRs from the 2022 and 2023 samples were grouped together to form 250 grouped SSRs. California had 12 grouped SSRs; Florida, New York, and Texas each had 10 grouped SSRs; Illinois, Michigan, Ohio, and Pennsylvania each had 8 grouped SSRs; Georgia, New Jersey, North Carolina, and Virginia each had 5 grouped SSRs; and the rest of the states and the District of Columbia each had 4 grouped SSRs.

15 In these cases, comparable 2021‑2022 state estimates are not available.

16 For details on how this outcome is calculated, see Section B.8 of this document.

17 Estimates of underage (aged 12 to 20) alcohol use were also produced.

18 Estimates of underage (aged 12 to 20) binge alcohol use were also produced.

19 Estimates of underage (aged 12 to 20) perceptions of great risk from having five or more drinks of an alcoholic beverage once or twice a week were also produced.

20 Estimates of underage (aged 12 to 20) tobacco product use were also produced.

21 Estimates of underage (aged 12 to 20) cigarette use in were also produced.

22 Estimates of underage (aged 12 to 20) alcohol use disorder in the past year were also produced.

23 Claritas is a market research firm headquartered in Cincinnati, Ohio (see https://claritas.com/ SAMHSA Exit Disclaimer).

24 To build parsimonious models, the combined 2022‑2023 data were partitioned into modeling and validation samples. For more information on how the data was partitioned, see the 2002‑2003 state SAE report (Wright & Sathe, 2005). Steps 2 to 4 were conducted on the modeling sample, whereas step 5 used the validation sample. Depending on the step, measure, and age group, significance levels were 1, 3, 5, or 10 percent.

25 Generally, age groups are 12 to 17, 18 to 25, 26 to 34, and 35 or older. For underage alcohol and tobacco related outcomes, the age group is 12 to 20.

26 See Table 15 in 2022‑2023 National Surveys of Drug Use and Health: Model-Based Prevalence Estimates (50 States and the District of Columbia) (CBHSQ, 2024c).

27 See Table 15 in 2022‑2023 NSDUH: Model-Based Estimated Totals (CBHSQ, forthcoming b).

28 See Table 15 in the 2022‑2023 NSDUH: Model-Based Prevalence Estimates report (CBHSQ, 2024c).

29 In NSDUH SAE documents prior to 2016-2017, the term “initiation” was referred to as “incidence.”

30 NSDUH respondents were asked the respective questions for alcohol use disorder or marijuana use disorder if they reported use of these substances on 6 or more days in the past year. Respondents were asked SUD questions for other substances if they reported any use in the past year.

31 “An MDE” refers to the occurrence of at least one MDE, rather than only one MDE. Similarly, reference to “the MDE” in a given period (e.g., the past 12 months) does not mean an individual had only one MDE in that period.

Long Descriptions—Equations

Long description, Equation 1. Capital S R R is equal to the ratio of two quantities. The numerator is the summation of the product of w sub d u and complete sub d u. The denominator is the summation of the product of w sub d u and eligible sub d u.

Long description end. Return to Equation 1.

Long description, Equation 2. Capital I R R is equal to the ratio of two quantities. The numerator is the summation of the product of w sub i and complete sub i. The denominator is the summation of the product of w sub i and selected sub i.

Long description end. Return to Equation 2.

Long description, Equation 3. Capital O R R is equal to the product of capital S R R and capital I R R.

Long description end. Return to Equation 3.

Long description, Equation 4. The model is given by the following equation: log of pi sub a, i, j, k divided by 1 minus pi sub a, i, j, k is equal to the sum of three terms. The first term is given by x transpose sub a, i, j, k times beta sub a. The second term is eta sub a, i. And the third term is nu sub a, i, j.

Long description end. Return to Equation 4.

Long description, Equation 5. Lower sub s and a is defined as the exponent of capital L sub s and a divided by the sum of 1 and the exponent of capital L sub s and a. And upper sub s and a is defined as the exponent of capital U sub s and a divided by the sum of 1 and the exponent of capital U sub s and a.

Long description end. Return to Equation 5.

Long description, Equation 6. Capital L sub s and a is defined as the difference of two quantities. The first quantity is the natural logarithm of the ratio of Theta sub s and a and 1 minus Theta sub s and a. The second quantity is the product of 1.96 and the square root of MSE sub s and a, which is the mean squared error for state-s and age group‑a.

Long description end. Return to Equation 6.

Long description, Equation 7. Capital U sub s and a is defined as the sum of two quantities. The first quantity is the natural logarithm of the ratio of Theta sub s and a and 1 minus Theta sub s and a. The second quantity is the product of 1.96 and the square root of MSE sub s and a, which is the mean squared error for state-s and age group‑a.

Long description end. Return to Equation 7.

Long description, Equation 8. The mean squared error, MSE sub s and a, is defined as the sum of two quantities. The first quantity is the square of the difference of two parts. Part 1 is defined as the natural logarithm of the ratio of capital P sub s and a and 1 minus capital P sub s and a. Part 2 is defined as the natural logarithm of the ratio of Theta sub s and a and 1 minus Theta sub s and a. The second quantity is the posterior variance of the natural logarithm of the ratio of capital P sub s and a and 1 minus capital P sub s and a.

Long description end. Return to Equation 8.

Long description, Equation 9. The average annual rate is defined as 100 times quantity q divided by 2. Quantity q is defined as capital X sub 1 divided by the sum of 0.5 times capital X sub 1 plus capital X sub 2.

Long description end. Return to Equation 9.

Long description, Equation 10. The average annual rate is defined as 100 times quantity q. Quantity q is defined as capital X sub 1 divided by the sum of capital X sub 1 plus capital X sub 2.

Long description end. Return to Equation 10.

Go to Top of Page