|
|
|||||||||||
|
|
|
|||||||||||
|
1998 National Household Survey on Drug Abuse |
||||||||||||
An important limitation of the NHSDA estimates of drug use prevalence is that they are only designed to describe the target population of the survey, the civilian noninstitutionalized population. Although this includes more than 98% of the total U.S. population, it does exclude some important and unique subpopulations who may have very different drug-using patterns. The survey excludes active military personnel, who have been shown to have significantly lower rates of illicit drug use. Persons living in institutional group quarters, such as prisons and residential drug treatment centers, are not covered in the NHSDA and have been shown in other surveys to have higher rates of illicit drug use. Also excluded are homeless persons not living in a shelter on the survey date, another population shown to have higher than average rates of illicit drug use. Appendix 3 describes other surveys that provide data for these populations.
The sampling error of an estimate is the error caused by the selection of a sample instead of conducting a census of the population. Sampling error is reduced by selecting a large sample and by using efficient sample design and estimation strategies such as stratification, optimal allocation, and ratio estimation.
With the use of probability sampling methods in the NHSDA, it is possible to develop estimates of sampling error from the survey data. These estimates have been calculated for all prevalence estimates presented in this report using a Taylor series linearization approach that takes into account the effects of the complex NHSDA design features. The sampling errors are used to identify unreliable estimates and to test for the statistical significance of differences between estimates.
Estimates considered to be unreliable due to unacceptably large sampling error are not shown in this report, and are noted by asterisks (*) in the tables in the appendix. The criterion used for suppressing estimates was based on the relative standard error (RSE), which is defined as the ratio of the standard error over the estimate. The log transformation of the proportion estimate (p) was used to calculate the RSE. Specifically, rates and corresponding estimated number of users were suppressed if:
RSE[-ln(p)] > 0.175 when p < .5
or RSE[-ln(1-p)] > 0.175 when p $ .5.
Estimates were also suppressed if they rounded to zero or 100 percent. This occurs if p < .0005 or if p $.9995. Statistical tests of significance have been computed for comparisons of estimates from 1998 with prior years. Results are shown in the appendix 5 tables. As indicated in the footnotes, significant differences are noted by "a" (significant at the .05 level of significance) and "b" (significant at the .01 level of significance). All changes described in this report as increases or decreases were tested and found to be significant at least at the .05 level, unless otherwise indicated.
Nonsampling errors such as nonresponse and reporting errors may affect the outcome of significance tests. Also, keep in mind that while a level of significance equal to .05 is used to determine statistical significance in these tables, large differences associated with slightly higher p-values (specifically those between .05 and .10) may be worth noting along with the p-values. Furthermore, statistically significant differences are not always meaningful, because the magnitude of difference may be small or because the significance may have occurred simply by chance. In a series of twenty independent tests, it is to be expected that one test will indicate significance merely by chance even if there is no real difference in the populations compared. In making more than one comparison among three or more percentages (comparing percentages within a table), there has been no attempt to adjust the level of significance to account for making simultaneous inferences (often referred to as multiple comparisons). Therefore, the probability of falsely rejecting the null hypothesis at least once in a family of k comparisons is higher than the significance level given for individual comparisons (in this report, either .01 or .05).
When making comparisons of estimates for different population subgroups from the same data year, the covariance term, which is usually small and positive, has typically been ignored. This results in somewhat conservative tests of hypotheses that will sometimes fail to establish statistical significance when in fact it exists.
Nonsampling errors occur from nonresponse, coding errors, computer processing errors, errors in the sampling frame, reporting errors, and other errors. Nonsampling errors are reduced through data editing, statistical adjustments for nonresponse, and close monitoring and periodic retraining of interviewers.
Although nonsampling errors can often be much larger than sampling errors, measurement of most nonsampling errors is difficult or impossible. However, some indication of the effects of some types of nonsampling errors can be obtained through proxy measures such as response rates and from other research studies.
Of the 80,866 eligible households sampled, 75,167 were successfully screened for a screening response rate of 93.0%. In these screened households, a total of 33,128 sample persons were selected, and completed interviews were obtained from 25,500 of these sample persons, for an interview response rate of 77.0%. Of the sample persons, 3,937 (11.9%) were classified as refusals, 2,300 (6.9%) were not available or never at home, and 1,304 (3.9%) did not participate for various other reasons, such as physical or mental incompetence or language barrier. The response rate was highest among the 12-17 year old age group (82%). Response rates were also higher among Hispanics (81%) than among blacks (80%) and whites (74%).
Among survey participants, item response rates were above 98% for most questionnaire items. However, inconsistent responses for some items, including the drug use items, are common. Estimates of drug use from the NHSDA are based on the responses to multiple questions by respondents, so that the maximum amount of information is used in determining whether a respondent is classified as a drug user. Inconsistencies in responses are resolved through a logical editing process that involves some judgement on the part of survey analysts and is a potential source of nonsampling error. A typical occurrence is when a respondent reports their most recent use of a drug as more than a month ago, but in a later question they reporthaving used in the past month. (This could occur because the interviewer may have developed greater rapport with the respondent in the latter stages of the interview, leading to more openness on the part of the respondent.) This respondent would be considered a past month user. For 1998, 21% of the estimate of past month marijuana use and 43% of the past month cocaine use estimate is based on such cases. Editing accounts for a smaller portion of past year estimates (16% for marijuana and 28% for cocaine), and generally accounts for similar proportions from year to year. An exception to this occurred for estimates of past year heroin use. The percent of this estimate that was based on editing was 27% in 1996, 45% in 1997, and 26% in 1998.
NHSDA estimates are based on self-reports of drug use, and their value depends on respondents' truthfulness and memory. Although many studies have generally established the validity of self-report data and the NHSDA procedures were designed to encourage honesty and recall, some degree of underreporting is assumed. No adjustment to NHSDA data is made to correct for this (Appendix 4 lists a number of references addressing the validity of self-reported drug use data). The methodology used in the NHSDA has been shown to produce more valid results than other self-report methods (e.g., by telephone) (Turner, Lessler, and Gfroerer 1992; Aquilino 1994). However, comparisons of NHSDA data with data from surveys conducted in classrooms suggest that underreporting of drug use by youth in their homes may be substantial (Gfroerer, Wright, and Kopstein 1997).
IV. Incidence Estimates
The incidence estimates discussed in section 8 of this report are based on retrospective reports of age at first drug use by survey respondents interviewed during 1994-98, and may be particularly subject to several biases.
Bias due to differential mortality occurs because some persons who were alive and exposed to the risk of first drug use in the historical periods shown in the tables died before the 1994-1998 NHSDAs were conducted. This bias is probably very small for estimates shown in this report. Incidence estimates are also affected by memory errors, including recall decay (tendency to forget events occurring long ago) and forward telescoping (tendency to report that an event occurred more recently than it actually did). These memory errors would both tend to result in estimates for earlier years (i.e., 1960s and 1970s) that are downwardly biased (because of recall decay) and estimates for later years that are upwardly biased (because of telescoping). There is also likely to be some underreporting bias due to social acceptability of drug use behaviors and respondents' fear of disclosure. This is likely to have the greatest impact on recent estimates, which reflect more recent use and reporting by younger respondents. Finally, for drug use that is frequently initiated at age 10 or younger, estimates based on retrospective reports one year later underestimate total incidence because 11 year old children are not sampled by the NHSDA. Prior analyses showed that alcohol and cigarette (any use) incidence estimates could be significantly affected by this. Therefore, for these drugs no 1997 estimates were made, and 1996 estimates were based only on the 1998 NHSDA.
A recent study (Johnson, Gerstein, and Rasinski 1998) concluded that the marijuana incidence trend from the NHSDA was biased because the reporting of initiation declines as the length of time between initiation and the survey increses. However, this study did not address very recent estimates, i.e., 1995-97, which could be biased because they reflect recent drug useand because they are heavily based on the reports of adolescents. In order to better understand the size of the biases and to assess the reliability of estimates for recent years, OAS performed an analysis of estimates based on single years of NHSDA data. This analysis focused on three drugs: cocaine, heroin, and marijuana. Using the survey data from 1994 to 1998, estimates were made of the number of initiates, the rate of initiation for youths age 12-17, and the rate of initiation for persons age 18-25. For the 1994 survey, an estimate was made for the year 1993. For the 1995 survey, another estimate was made for the year 1993. In this way, two recent estimates of the same year could be compared. Similarly, the 1995 and 1996 data provided two estimates for 1994, the 1996 and 1997 surveys provided two estimates for 1995, the 1997 and 1998 surveys provided two estimates for 1996. Since these calculations represent two measurements of the same population characteristic, they would ideally be the same. Examples of these estimates are shown in the following table:
Year of Initiation |
Avg. of Ratio of 1 Year Recall to 2-Year Recall | ||||||||
1993 |
1994 |
1995 |
1996 | ||||||
Year of Survey | |||||||||
1994 |
1995 |
1995 |
1996 |
1996 |
1997 |
1997 |
1998 | ||
Rate for Age 12-17 |
|||||||||
Marijuana |
59.2 |
53.7 |
74.2 |
75.2 |
75.7 |
73.6 |
83.2 |
75.6 |
1.055 |
Cocaine |
8.9 |
5.0 |
10.2 |
5.7 |
10.6 |
8.0 |
11.3 |
11.0 |
1.480 |
Heroin |
0.7 |
0.5 |
2.1 |
1.4 |
2.5 |
1.8 |
3.9 |
1.5 |
1.722 |
Rate for Age 18-25 |
|||||||||
Marijuana |
46.9 |
41.4 |
42.1 |
55.9 |
47.7 |
53.4 |
53.6 |
50.5 |
0.960 |
Cocaine |
12.8 |
12.8 |
9.9 |
11.8 |
13.8 |
14.7 |
14.8 |
13.9 |
0.961 |
Heroin |
0.1 |
1.4 |
1.4 |
2.1 |
2.4 |
1.9 |
2.3 |
3.0 |
0.692 |
Number of Initiates |
|||||||||
Marijuana |
2,035 |
1,783 |
2,251 |
2,548 |
2,368 |
2,443 |
2,540 |
2,384 |
1.015 |
Cocaine |
595 |
538 |
533 |
530 |
652 |
654 |
675 |
664 |
1.031 |
Heroin |
41 |
62 |
122 |
97 |
141 |
93 |
171 |
127 |
1.195 |
Drug initiation rates for youths age 12-17 for the more hard core drugs (like cocaine and heroin) appear to be most prone to bias. For example, on average across the four survey years, the estimate for the rate of initiation of cocaine use among youths age 12-17 was 48% higher the first time the estimate could be made than the second time. This indicates a probable bias in the estimation; however, it is unclear which estimate is the correct one. As a result, one should be cautious in interpreting any changes between the prior year and the most recent year in the initiation rates for youth of the more stigmatized drugs. Since there are only five years of data to estimate how the rate of incidence changes between the first year it can be estimated and thesecond, one should be cautious about inferring the magnitude of the bias (for example, that it is 48% for cocaine). In 1999 and thereafter the youth and young adult samples will be much larger, and more precise estimates of the bias will be possible.
The average rate of incidence for cocaine for persons age 18-25 for those same years is only 4% lower in the first year estimate compared to the second year. The overall number of cocaine initiates was only slightly higher (3%) the first year than the second.
For heroin, the rate of incidence among youths was 72% higher the first time the estimate could be made than the second. For persons age 18-25, the rate was 31% lower the first year than the second. The overall number of new initiates was 20% higher the first year it could be estimated than the second year. For marijuana, estimates of the number of initiates, the rate of initiation for youth 12-17, and the rate of initiation for persons age 18-25 all are fairly stable, with second year estimates varying no more than plus or minus 5% from the first year they were estimated.
Thus, analyses seem to show that the pattern of reporting varies by drug and implicitly by the degree of stigma. The more prevalent drugs that are based on much larger numbers of first users generally have rates of initiation that are more stable than the rarer, more stigmatized drugs like heroin and cocaine.
While the NHSDA collects data on the most severely affected drug users, the survey design is less suited to estimate these problems. The limitations that preclude more accurate estimates are primarily the sample size, coverage, and the use of a self-report. Because heavy drug use is relatively rare in the general population, the NHSDA captures a small number of these users, resulting in a relatively large sampling error. In addition to this instability resulting from the small sample, underestimation is believed to occur because many heavy drug users may not maintain stable addresses and, if located, may not be available for an interview. Finally, as with all NHSDA respondents, heavy drug users who participate in the survey may not always report their drug use accurately during the interview.
A new estimation procedure was designed at OAS to produce improved estimates of heavy drug use (Wright, Gfroerer and Epstein, 1997). This procedure uses external counts of the number of people in treatment for drug problems (from the Uniform Facility Data Set) and the number of arrests for non-traffic offenses (from the F.B.I.'s Uniform Crime Reports) to adjust NHSDA data. This ratio estimation procedure provides a partial adjustment that accounts for undercoverage of hard-to-reach populations and also adjusts for underreporting of drug use by survey respondents. However, it does not reduce sampling error.
Applications of this adjustment have resulted in 40-80 percent higher estimates of past month and past year heroin use and 20-40 percent higher estimates of frequent cocaine use.
The NHSDA is an important source of data for policy makers, not only because it provides measures of substance abuse for a single year, but also because the series of surveys over the last several years provides a measure of change in substance abuse in the population over time. Beginning in 1994, the NHSDA began using an improved questionnaire and estimation procedure based on a series of studies and consultations with drug survey experts and data users. Because this new methodology produces estimates that are not directly comparable to previous estimates, the 1979-1993 NHSDA estimates presented in this report were adjusted to account for the new methodology that was begun in 1994.
Nearly all of the 1979-1993 substance use prevalence estimates presented in this report were adjusted using a simple ratio correction factor that was estimated at the total population level using data from the pooled 1993 and 1994 NHSDAs. The remaining substance use prevalence estimates were adjusted by formally modeling the effect of the new methodology, relative to the old methodology, using data from the 1994 NHSDA. The modeling procedure was used for the more prevalent substance use measures that changed significantly between the old-and new-version NHSDA questionnaires. The modeling procedure was particularly desirable for the more prevalent measures because the procedure was able to use a greater number of potentially significant explanatory variables in the adjustment compared to the simple ratio correction factor. Each of the procedures are discussed in prior NHSDA reports.
This page was last updated on June 01, 2008. |