Skip To Content Table Of Contents
Click for DHHS Home Page
Click for the SAMHSA Home Page
Click for the OAS Drug Abuse Statistics Home Page
Click for What's New
Click for Recent Reports and HighlightsClick for Information by Topic Click for OAS Data Systems and more Pubs Click for Data on Specific Drugs of Use Click for Short Reports and Facts Click for Frequently Asked Questions Click for Publications Click to send OAS Comments, Questions and Requests Click for OAS Home Page Click for Substance Abuse and Mental Health Services Administration Home Page Click to Search Our Site

2001 National Household  Survey on Drug Abuse

Appendix B: Statistical Methods and Limitations of the Data


B.1 Target Population

An important limitation of the National Household Survey on Drug Abuse (NHSDA) estimates of drug use prevalence is that they are only designed to describe the target population of the survey—the civilian, noninstitutionalized population aged 12 or older. Although this population includes almost 98 percent of the total U.S. population aged 12 or older, it excludes some important and unique subpopulations who may have very different drug-using patterns. For example, the survey excludes active military personnel, who have been shown to have significantly lower rates of illicit drug use. Persons living in institutional group quarters, such as prisons and residential drug treatment centers, are not included in the NHSDA and have been shown in other surveys to have higher rates of illicit drug use. Also excluded are homeless persons not living in a shelter on the survey date, another population shown to have higher than average rates of illicit drug use. Appendix E describes other surveys that provide data for these populations.


B.2 Sampling Error and Statistical Significance

The national estimates, along with the associated variance components, were computed using a multiprocedure package, SUrvey DAta ANalysis (SUDAAN) Software for Statistical Analysis of Correlated Data, which was designed for the statistical analysis of sample survey data from stratified, multistage cluster samples (RTI, 2001). The final, nonresponse-adjusted, and poststratified analysis weights were used to compute unbiased design-based drug use estimates.

The sampling error (i.e., the standard error [SE]) of an estimate is the error caused by the selection of a sample instead of conducting a census of the population. Sampling error is reduced by selecting a large sample and by using efficient sample design and estimation strategies, such as stratification, optimal allocation, and ratio estimation.

With the use of probability sampling methods in the NHSDA, it is possible to develop estimates of sampling error from the survey data. These estimates have been calculated in SUDAAN for all estimates presented in this report using a Taylor series linearization approach that takes into account the effects of the complex NHSDA design features. The sampling errors are used to identify unreliable estimates and to test for the statistical significance of differences between estimates.


B.2.1 Variance Estimation for Totals

Estimates of proportions, , such as drug use prevalence rates, take the form of nonlinear statistics where the variances cannot be expressed in closed form. Variance estimation for nonlinear statistics in SUDAAN is performed using a first-order Taylor series approximation of the deviations of estimates from their expected values.

Corresponding to proportion estimates, , the number of drug users, , can be estimated as


where is the estimated population total for domain d, and p hat sub d is the estimated proportion for domain d. The SE for the total estimate is obtained by multiplying the SE of the proportion by , that is,


This approach is theoretically correct when the domain size estimates, , are among those forced to Census Bureau population projections through the weight calibration process. In these cases, is clearly not subject to sampling error.

For domain totals, , where is not fixed, this formulation may still provide a good approximation if it can be reasonably assumed that the sampling variation in is negligible relative to the sampling variation in . In most analyses conducted for prior years, this has been a reasonable assumption.

For a subset of the tables produced from the 2001 data, it was clear that the above approach yielded an underestimate of the variance of a total because was subject to considerable variation. In these cases, a different method was used to estimate variances. SUDAAN provides an option to directly estimate the variance of the linear statistic that estimates a population total. Using this option did not affect the SE estimates for the corresponding proportions presented in the same sets of tables.


B.2.2 Suppression Criteria for Unreliable Estimates

As has been done in past NHSDA reports, direct survey estimates considered to be unreliable due to unacceptably large sampling errors are not shown in this report and are noted by asterisks (*) in the tables containing such estimates found in the appendices. The criterion used for suppressing all direct survey estimates was based on the relative standard error (RSE), which is defined as the ratio of the standard error (SE) over the estimate.

Proportion estimates () within the range [0 < < 1], rates, and corresponding estimated number of users were suppressed if

RSE[(-ln()] > 0.175 when < 0.5


RSE[(-ln(1 - )] > 0.175 when > 0.5.

Using a first-order Taylor series approximation to estimate RSE[(-ln()] and RSE[(-ln(1 - )], the following was obtained and used for computational purposes:


> 0.175 when < 0.5



> 0.175 when > 0.5   .

The separate formulas for < 0.5 and > 0.5 produce a symmetric suppression rule (i.e., if is suppressed, then so will 1 - ). This ad hoc rule requires an effective sample size in excess of 50. When 0.05 < < 0.95, the symmetric property of the rule produces a local maximum effective sample size of 68 at = 0.5. Thus, estimates with these values of along with effective sample sizes falling below 68 are suppressed. A local minimum effective sample size of 50 occurs at = 0.2 and again at = 0.8 within this same interval, so estimates are suppressed for values of with effective sample sizes below 50.

Prior to the 2000 NHSDA, these varying sample size restrictions sometimes produced unusual occurrences of suppression for a particular combination of prevalence rates. For example, in some cases, lifetime prevalence rates near = 0.5 were suppressed (effective sample size was < 68 but > 50), while not suppressing the corresponding past year or past month estimates near = 0.2 (effective sample sizes were > 50). To reduce the occurrence of this type of inconsistency, a minimum effective sample size of 68 was added to the NHSDA suppression criteria starting in 2000. As approached 0.00 or 1.00 outside the interval (0.05, 0.95), the suppression criteria still required increasingly larger effective sample sizes. For example, if = 0.01 and 0.001, the effective sample size must exceed 152 and 684, respectively.

Also new to the NHSDA starting in 2000 were minimum nominal sample size suppression criteria (n = 100) that protect against unreliable estimates caused by small design effects and small nominal sample sizes. Prevalence estimates were also suppressed if they were close to 0 or 100 percent (i.e., if < .00005 or if > .99995).

Estimates of other totals (e.g., number of initiates) along with means and rates (both not bounded between 0 and 1) were suppressed if RSE() > 0.5. Additionally, estimates of the mean age at first use were suppressed if the sample size was smaller than 10 respondents; moreover, the estimated incidence rate and number of initiates were suppressed if they rounded to 0.

The suppression criteria for various NHSDA estimates are summarized in Table B.1 at the end of this appendix.


B.2.3 Statistical Significance of Differences

This section describes the methods used to compare prevalence estimates in this report. Customarily, the observed difference between estimates is evaluated in terms of its statistical significance. "Statistical significance" refers to the probability that a difference as large as that observed would occur due to random error in the estimates if there were no difference in the prevalence rates for the population groups being compared. The significance of observed differences in this report is generally reported at the 0.05 and 0.01 levels. When comparing 2000 and 2001 prevalence estimates, the null hypothesis (no difference in the 2000 and 2001 prevalence rates) can be tested against the alternative hypothesis (there is a difference in prevalence rates) using the standard difference in proportions test expressed as follows:

where 1 = 2000 estimate, 2 = 2001 estimate, var(1) = variance of 2000 estimate, var(2) = variance of 2001 estimate, and cov(1, 2) = covariance between 1 and 2.

Under the null hypothesis, Z is asymptotically distributed as a normal random variable. Calculated values of Z can therefore be referred to as the unit normal distribution to determine the corresponding probability level (i.e., p value). Because there is a 50 percent overlap in the sampled segments between the 2000 and 2001 NHSDAs, the covariance term in the formula for Z will, in general, be greater than 0. Estimates of Z, along with its p value, were calculated using SUDAAN, using the analysis weights and accounting for the sample design as described in Appendix A. A similar procedure and formula for Z were used for estimated totals and for comparing prevalence estimates for different population subgroups from the same data year.

When examining the effects of subgroup variables with more than two levels on a prevalence measure, a X  2 test of independence of the subgroup and the prevalence variables was conducted first to control the error level for multiple comparisons. If the X  2 test indicated some significant differences, the significance of each particular subgroup comparison discussed in the report was tested as indicated above. SUDAAN analytic procedures were used in all tests to properly account for the sample design.


B.3 Nonsampling Error

Nonsampling errors can occur from nonresponse, coding errors, computer processing errors, errors in the sampling frame, reporting errors, and other errors not due to sampling. Nonsampling errors are reduced through data editing, statistical adjustments for nonresponse, close monitoring and periodic retraining of interviewers, and improvement in various quality control procedures.

Although nonsampling errors can often be much larger than sampling errors, measurement of most nonsampling errors is difficult or impossible. However, some indication of the effects of some types of nonsampling errors can be obtained through proxy measures, such as response rates and from other research studies.


B.3.1 Screening and Interview Response Rate Patterns

Response rates for the NHSDA were stable for the period from 1994 to 1998, with the screening response rate at about 93 percent and the interview response rate at about 78 percent (response rates discussed in this appendix are weighted). In 1999, the computer-assisted interviewing (CAI) screening response rate was 89.6 percent, and the interview response rate was 68.6 percent. A more stable and experienced field interviewer (FI) workforce improved these rates in 2000 and continued in 2001. Of the 171,519 eligible households sampled for the 2001 NHSDA main study, 157,471 were successfully screened for a weighted screening response rate of 91.9 percent (Table B.2). In these screened households, a total of 89,745 sample persons were selected, and completed interviews were obtained from 68,929 of these sample persons, for a weighted interview response rate of 73.3 percent. A total of 13,478 (16.5 percent) sample persons were classified as refusals or parental refusals, 4,681 (5.3 percent) were not available or never at home, and 2,657 (4.9 percent) did not participate for various other reasons, such as physical or mental incompetence or language barrier (Table B.3). Tables B.4 and B.5 show the distribution of the selected sample by interview code and age group. The weighted interview response rate was highest among 12 to 17 year olds (82.2 percent), females (74.6 percent), blacks and Hispanics (75.0 and 78.8 percent, respectively), in nonmetropolitan areas (76.7 percent), and among persons residing in the Midwest (74.4 percent) (Table B.6).

The overall weighted response rate, defined as the product of the weighted screening response rate and weighted interview response rate, was 61.5 percent in 1999, 68.6 percent in 2000, and 67.3 percent in 2001. Nonresponse bias can be expressed as the product of the nonresponse rate (1-R) and the difference between the characteristic of interest between respondents and nonrespondents in the population (Pr - Pnr). Thus, assuming the quantity (Pr - Pnr) is fixed over time, the improvement in response rates in 2000 and 2001 over 1999 will result in estimates with lower nonresponse bias.


B.3.2 Inconsistent Responses and Item Nonresponse

Among survey participants, item response rates were above 97 percent for most questionnaire items. However, inconsistent responses for some items, including the drug use items, were common. Estimates of substance use from the NHSDA are based on the responses to multiple questions by respondents, so that the maximum amount of information is used in determining whether a respondent is classified as a drug user. Inconsistencies in responses are resolved through a logical editing process that involves some judgment on the part of survey analysts and is a potential source of nonsampling error. Because of the automatic routing through the CAI questionnaire (e.g., lifetime drug use questions that skip entire modules when answered "no"), there is less editing of this type than in the paper-and-pencil interviewing (PAPI) questionnaire used prior to the NHSDA redesign in 1999.

In addition, logical editing is used less often because with the CAI data, statistical imputation is relied upon more heavily to determine the final values of drug use variables in cases where there is the potential to use logical editing to make a determination. The combined amount of editing and imputation in the CAI data is still considerably less than the total amount used in prior PAPI surveys. For the 2001 CAI data, for example, 6.7 percent of the estimate of past month hallucinogen use was based on logically edited cases and 6.6 percent on imputed cases, for a combined amount of 13.3 percent. In the 1998 NHSDA (administered using PAPI), the amount of editing and imputation for past month hallucinogen use was 60 and 0 percent, respectively, for a total of 60 percent. The combined amount of editing and imputation for the estimate of past month heroin use was 5.7 percent for the 2001 CAI and 37.0 percent for the 1998 PAPI data.


B.3.3 Validity of Self-Reported Use

NHSDA estimates are based on self-reports of drug use, and their value depends on respondents' truthfulness and memory. Although many studies have generally established the validity of self-report data and the NHSDA procedures were designed to encourage honesty and recall, some degree of underreporting is assumed (Harrell, 1997; Harrison & Hughes, 1997; Rouse, Kozel, & Richards, 1985). No adjustment to NHSDA data is made to correct for this. The methodology used in the NHSDA has been shown to produce more valid results than other self-report methods (e.g., by telephone) (Aquilino, 1994; Turner, Lessler, & Gfroerer, 1992). However, comparisons of NHSDA data with data from surveys conducted in classrooms suggest that underreporting of drug use by youths in their homes may be substantial (Gfroerer, 1993; Gfroerer, Wright, & Kopstein, 1997).


B.4 Incidence Estimates

For diseases, the incidence rate for a population is defined as the number of new cases of the disease, N, divided by the person time, PT, of exposure or

IR =  N .

The person time of exposure can be measured for the full period of the study or for a shorter period. The person time of exposure ends at the time of diagnosis (e.g., Greenberg, Daniels, Flanders, Eley, & Boring, 1996, pp. 16-19). Similar conventions are applied for defining the incidence of first use of a substance.

Beginning in 1999, the NHSDA questionnaire allows for collection of year and month of first use for recent initiates. Month, day, and year of birth are also obtained directly or imputed in the process. In addition, the questionnaire call record provides the date of the interview. By imputing a day of first use within the year and month of first use reported or imputed, the key respondent inputs in terms of exact dates are known. Exposure time can be determined in terms of days and converted to an annual basis.

Having exact dates of birth and first use also allows the person time of exposure during the targeted period, t, to be determined. Let the target time period for measuring incidence be specified in terms of dates; for example, the period 1998 would be specified as

t = [t1, t2) = [1 Jan 1998, 1 Jan 1999),

a period that includes 1 January 1998 and all days up to but not including 1 January 1999. The target age group can also be defined by a half-open interval as a = [a1, a2). For example, the age group 12 to 17 would be defined by a = [12, 18) for persons at least age 12, but not yet age 18. If person i was in age group a during period t, the time and age interval, Lt,a,i, can then be determined by the intersection:

Lt,a,i = [t1, t2) [DOBi MOBiYOBi + a1,DOBiMOBiYOBi + a2),

assuming the time of birth can be written in terms of day (DOBi), month (MOBi), and year (YOBi). Either this intersection will be empty (Lt,a,i = Ø) or it will be designated by the half-open interval, Lt,a,i = [m1,i, m2,i), where

m1,i = Max{t1, (DOBiMOBiYOBi + a1)}


m2,i = Min{t2, (DOBiMOBiYOBi + a2).}

The date of first use, tfu,d,i, is also expressed as an exact date. An incident of first drug d use by person i in age group a occurs in time t if tfu,d,i [m1,i,m2,i). The indicator function Ii (d,a,t) used to count incidents of first use is set to 1 when tfu,d,i [m1,i,m2,i) and to 0 otherwise. The person-time exposure measured in years and denoted by ei(d,a,t) for a person i of age group a depends on the date of first use. If the date of first use precedes the target period (tfu,d,i < m1,i), then ei(d,a,t) = 0. If the date of first use occurs after the target period or if person i has never used drug d, then

If the date for first use occurs during the target period Lt,a,i, then

Note that both Ii (d,a,t) and ei(d,a,t) are set to 0 if the target period Lt,a,i is empty (i.e., person i is not in age group a during any part of time t). The incidence rate is then estimated as a weighted ratio estimate:

where the wi are the analytic weights.

Prior to the 1999 survey, exact date data were not available for computing incidence rates. For these rates, a person was considered to be of age a during the entire time interval t, if his/her ath birthday occurred during time interval t (generally, a single year). If the person initiated use during the year, the person-time exposure was approximated as one-half year for all such persons rather than computing it exactly for each person.

Because of the new methodology, the incidence estimates discussed in Chapter 5 are not strictly comparable with the estimates before the 1999 NHSDA. The estimates in this report are based on retrospective reports of age at first drug use by survey respondents interviewed during 1999 to 2001. Because they are based on retrospective reports as was the case for earlier estimates, they may be subject to some of the same kinds of biases.

Bias due to differential mortality occurs because some persons who were alive and exposed to the risk of first drug use in the historical periods shown in the tables died before the 1999-2001 NHSDAs were conducted. This bias is probably very small for estimates shown in this report. Incidence estimates are also affected by memory errors, including recall decay (tendency to forget events occurring long ago) and forward telescoping (tendency to report that an event occurred more recently than it actually did). These memory errors would both tend to result in estimates for earlier years (i.e., 1960s and 1970s) that are downwardly biased (because of recall decay) and estimates for later years that are upwardly biased (because of telescoping). There is also likely to be some underreporting bias due to social acceptability of drug use behaviors and respondents' fear of disclosure. This is likely to have the greatest impact on recent estimates, which reflect more recent use and reporting by younger respondents. Finally, for drug use that is frequently initiated at age 10 or younger, estimates based on retrospective reports 1 year later underestimate total incidence because 11-year-old (and younger) children are not sampled by the NHSDA. Prior analyses showed that alcohol and cigarette (any use) incidence estimates could be significantly affected by this. Therefore, for these drugs only 2000 age-specific, and not overall, estimates were made. Likewise for these drugs, 1999 estimates were made using 2001 NHSDA data and 1998 estimates were made using 2000 and 2001 NHSDA data.


B.5 Serious Mental Illness Estimates

For the 2001 NHSDA, mental health among adults was measured using a scale to ascertain serious mental illness (SMI). This scale consisted of six questions that ask respondents how frequently they experienced symptoms of psychological distress during the 1 month in the past year when they were at their worst emotionally. The use of this scale is based on a methodological study designed to evaluate several screening scales for measuring SMI in the NHSDA. These scales consisted of a truncated version of the World Health Organization (WHO) Composite International Diagnostic Interview Short Form (CIDI-SF) scale (Kessler, Andrews, Mroczek, Üstün, & Wittchen, 1998), the K10/K6 scale of nonspecific psychological distress (Furukawa, Andrews, Slade, & Kessler, in press), and the WHO Disability Assessment Schedule (WHO-DAS) (Rehm et al., 1999).

The methodological study to evaluate the scales consisted of 155 respondents selected from a first-stage sample of 1,000 adults aged 18 or older. First-stage respondents were selected from the Boston metropolitan area and screened on the telephone to determine whether they had any emotional problems. Respondents reporting emotional problems at the first stage were oversampled when selecting the 155 respondents at the second stage. The selected respondents were interviewed by trained clinicians their home using both the NHSDA methodology and using a structured clinical interview. The first interview included the three scales described above using audio computer-assisted self-interviewing (ACASI). Respondents completed the ACASI portion of the interview without discussing their answers with the clinician. After completing the ACASI interview, respondents were then interviewed using the 12-month nonpatient version of the Structured Clinical Interview for DSM-IV (SCID) (First, Spitzer, Gibbon, & Williams, 1997) and the Global Assessment of Functioning (GAF) (Endicott, Spitzer, Fleiss, & Cohen, 1976) to classify respondents as either having or not having SMI.

The data from the 155 respondents were analyzed using logistic regression analysis to predict SMI from the scores on the screening questions. Analysis of the model fit indicated that each of the scales alone and in combination were significant predictors of SMI and the best fitting models contained either the CIDI-SF or the K6/K10 alone. Receiver operating characteristic (ROC) curve analysis was used to evaluate the precision of the scales to discriminate between respondents with and without SMI. This analysis indicated that the K6 was the best predictor. The results of the methodological study are described in more detail in a forthcoming paper (Kessler et al., in press).

To score the items on the K6 scales, they were first coded from 0 to 4 and summed to yield a number between 0 and 24. This involved transforming response categories for the six questions (DSNERV1, DSHOPE, DSFIDG, DSNOCHR, DSEFFORT, and DSDOWN) given below so that "all of the time" is coded 4, "most of the time" is coded 3, "some of the time" 2, "a little of the time" 1, and "none of the time" 0, with "don't know" and "refuse" also coded 0. Summing across the transformed responses obtains a score with a range from 0 to 24. Respondents with a total score of 13 or greater were classified as having a past year SMI. This cutpoint was chosen to equalize false positives and false negatives.

The questions comprising the K6 scale are given below:

DSNERV1 Most people have periods when they are not at their best emotionally. Think of one month in the past 12 months when you were the most depressed, anxious, or emotionally stressed. If there was no month like this, think of a typical month.

During that month, how often did you feel nervous?

  1. All of the time
  2. Most of the time
  3. Some of the time
  4. A little of the time
  5. None of the time

Response categories are the same for the following questions:     
DSHOPE During that same month when you were at your worst emotionally . . .how often did you feel hopeless?
DSFIDG During that same month when you were at your worst emotionally . . . how often did you feel restless or fidgety?
DSNOCHR During that same month when you were at your worst emotionally . . . how often did you feel so sad or depressed that nothing could cheer you up?
DSEFFORT During that same month when you were at your worst emotionally . . . how often did you feel that everything was an effort?
DSDOWN During that same month when you were at your worst emotionally . . . how often did you feel down on yourself, no good, or worthless?


Table B.1 Summary of 2001 NHSDA Suppression Rules


Suppress if:

Prevalence rate,  , with nominal sample size, n, and design effect, deff

The estimated prevalence rate, , is < 0.00005 or > 0.99995, or

> 0.175 when < 0.5, or
> 0.175 when > 0.5, or

Effective n < 68, or

n < 100

where Effective n =

Note: The rounding portion of this suppression rule for prevalence rates will produce some estimates that round at one decimal place to 0.0 or 100.0 percent but are not suppressed from the tables.

Estimated number
(numerator of )
The estimated prevalence rate, , is suppressed.

Note: In some instances when is not suppressed, the estimated number may appear as a 0 in the tables; this means that the estimate is > 0 but < 500 (estimated numbers are shown in thousands).

Mean age at first use, , with nominal sample size, n

RSE() > 0.5, or

n < 10

Incidence rate,

Rounds to < 0.1 per 1,000 person-years of exposure, or

RSE() > 0.5

Number of initiates,

Rounds to < 1,000 initiates, or

RSE() > 0.5

Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 2001.


Table B.2 Weighted Percentages and Sample Sizes for 1999 to 2001 NHSDAs, by Screening Result Code

Screening Result 1999 NHSDA 2000 NHSDA 2001 NHSDA
Sample Size Weighted Percentage Sample Size Weighted Percentage Sample Size Weighted Percentage
Total Sample 223,868 100.00 215,860 100.00 203,544 100.00
    Ineligible cases
36,026 15.78 33,284 15.09 32,025 15.40
    Eligible cases
187,842 84.22 182,576 84.91 171,519 84.60
Ineligibles 36,026 100.00 33,284 100.00 32,025 100.00
18,034 49.71 16,796 50.76 16,489 51.71
    Not a primary residence
4,516 12.90 4,506 13.26 4,706 14.69
    Not a dwelling unit
4,626 12.70 3,173 9.33 2,913 8.66
    All military personnel
482 1.22 414 1.21 327 0.93
    Other, ineligible
8,368 23.46 8,395 25.43 7,590 24.00
Eligible Cases 187,842 100.00 182,576 100.00 171,519 100.00
    Screening complete
169,166 89.63 169,769 92.84 157,471 91.86
      No one selected
101,537 54.19 99,999 55.36 90,530 52.11
      One selected
44,436 23.63 46,981 25.46 43,601 25.94
      Two selected
23,193 11.82 22,789 12.03 23,340 13.82
    Screening not complete
18,676 10.37 12,807 7.16 14,048 8.14
      No one home
4,291 2.38 3,238 1.82 3,383 1.90
      Respondent unavailable
651 0.36 415 0.24 392 0.24
      Physically or mentally incompetent
419 0.24 310 0.16 357 0.20
      Language barrier—Hispanic
102 0.06 83 0.05 130 0.09
      Language barrier—other
486 0.28 434 0.27 590 0.39
11,097 5.92 7,535 4.14 8,525 4.93
      Other, access denied
1,536 1.08 748 0.45 613 0.35
      Other, eligible
38 0.02 7 0.00 9 0.00
      Other, problem case
56 0.03 37 0.02 49 0.03

Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999, 2000, and 2001.


Table B.3 Weighted Percentages and Sample Sizes for 1999 to 2001 NHSDAs, by Final Interview Code, among Persons Aged 12 or Older

Final Interview Code 1999 NHSDA 2000 NHSDA 2001 NHSDA
Sample Size Weighted Percentage Sample Size Weighted Percentage Sample Size Weighted Percentage
Total Selected Persons 89,883 100.00 91,961 100.00 89,745 100.00
Interview complete 66,706 68.55 71,764 73.93 68,929 73.31
No one at dwelling unit 1,795 2.13 1,776 2.02 1,728 2.00
Respondent unavailable 3,897 4.53 3,058 3.52 2,953 3.30
Breakoff 50 0.07 72 0.09 79 0.12
Physically/mentally incompetent 1,017 2.62 1,053 2.57 1,020 2.43
Language barrier—Spanish 168 0.12 109 0.08 190 0.17
Language barrier—Other 480 1.46 441 1.06 470 1.30
Refusal 11,276 17.98 10,109 14.99 10,961 15.60
Parental refusal 2,888 1.01 2,655 0.88 2,517 0.92
Other 1,606 1.53 924 0.86 898 0.86

Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999, 2000, and 2001.


Table B.4 Weighted Percentages and Sample Sizes for 1999 to 2001 NHSDAs, by Final Interview Code, among Youths Aged 12 to 17

Final Interview Code 1999 NHSDA 2000 NHSDA 2001 NHSDA
Sample Size Weighted Percentage Sample Size Weighted Percentage Sample Size Weighted Percentage
Total Selected Persons 32,011 100.00 31,242 100.00 28,188 100.00
Interview complete 25,384 78.07 25,756 82.58 23,178 82.18
No one at dwelling unit 322 1.09 278 0.86 254 0.92
Respondent unavailable 872 3.04 617 2.05 551 2.13
Breakoff 13 0.03 18 0.05 17 0.05
Physically/mentally incompetent 244 0.76 234 0.76 219 0.79
Language barrier—Spanish 15 0.03 10 0.03 18 0.08
Language barrier—Other 58 0.18 50 0.20 34 0.11
Refusal 1,808 5.97 1,455 4.52 1,247 4.14
Parental refusal 2,885 9.50 2,641 8.35 2,517 8.95
Other 410 1.33 183 0.59 153 0.64

Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999, 2000, and 2001.


Table B.5 Weighted Percentages and Sample Sizes for 1999 to 2001 NHSDAs, by Final Interview Code, among Persons Aged 18 or Older

Final Interview Code 1999 NHSDA 2000 NHSDA 2001 NHSDA
Sample Size Weighted Percentage Sample Size Weighted Percentage Sample Size Weighted Percentage
Total Selected Persons 57,872 100.00 60,719 100.00 61,557 100.00
Interview complete 41,322 67.41 46,008 72.92 45,751 72.29
No one at dwelling unit 1,473 2.25 1,498 2.16 1,474 2.12
Respondent unavailable 3,025 4.71 2,441 3.69 2,402 3.43
Breakoff 37 0.07 54 0.09 62 0.13
Physically/mentally incompetent 773 2.85 819 2.78 801 2.62
Language barrier—Spanish 153 0.13 99 0.09 172 0.18
Language barrier—Other 422 1.62 391 1.16 436 1.43
Refusal 9,468 19.41 8,654 16.22 9,714 16.92
Parental refusal 3 0.00 14 0.01 0 0.00
Other 1,196 1.55 741 0.89 745 0.88

Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999, 2000, and 2001.


Table B.6 Response Rates and Sample Sizes for the 1999 to 2001 NHSDAs, by Demographic Characteristics

  1999 NHSDA 2000 NHSDA 2001 NHSDA
Selected Persons

Completed Interviews

Weighted Response Rate Selected Persons Completed Interviews Weighted Response Rate Selected Persons Completed Interviews Weighted Response Rate

89,883 66,706 68.55% 91,961 71,764 73.93% 89,745 68,929 73.31%
Age in Years
32,011 25,384 78.07% 31,242 25,756 82.58% 28,188 23,178 82.18%
30,439 22,151 71.21% 29,424 22,849 77.34% 30,304 22,931 75.51%
    26 or older
27,433 19,171 66.76% 31,295 23,159 72.17% 31,253 22,820 71.75%
43,883 31,987 67.12% 44,899 34,375 72.68% 43,949 33,109 71.92%
46,000 34,719 69.81% 47,062 37,389 75.09% 45,796 35,820 74.58%
11,203 8,755 74.59% 11,454 9,396 77.95% 10,885 8,777 78.78%
63,211 46,272 67.98% 64,517 49,631 73.39% 63,228 48,016 72.65%
10,552 8,044 70.39% 10,740 8,638 76.19% 10,584 8,295 74.98%
    All other races
4,917 3,635 59.28% 5,250 4,099 67.31% 5,048 3,841 66.65%
16,794 11,830 64.03% 18,959 14,394 71.68% 19,180 14,444 71.02%
24,885 18,103 69.63% 25,428 19,355 73.23% 25,560 19,212 73.25%
27,390 21,018 70.93% 27,217 22,041 76.38% 26,278 20,609 74.44%
20,814 15,755 67.47% 20,357 15,974 72.68% 18,727 14,664 73.51%
County Type
    Large metropolitan
36,101 25,901 65.15% 37,754 28,744 71.77% 35,395 26,403 71.00%
    Small metropolitan
30,642 22,612 69.98% 31,400 24,579 74.96% 31,740 24,575 74.66%
23,140 18,193 74.97% 22,807 18,441 77.58% 22,610 17,951 76.72%

Source: SAMHSA, Office of Applied Studies, National Household Survey on Drug Abuse, 1999, 2000, and 2001.

Table Of Contents

This page was last updated on June 16, 2008.