Item Nonresponse

Introduction
Additional nonresponse information
1. Item Nonresponse by Section
2. Item Nonresponse by Respondents
3. Item Nonresponse within Problem Sections

This section examines and quantifies the extent of missing data, formally called item nonresponse, in the NLSY79. To provide readers with a detailed view of this problem, six surveys are analyzed. Nonresponse rates are examined first in the 1979 survey and then in the surveys that occur at roughly five-year intervals (1984, 1989, 1994, 1998, and 2004). These years were chosen to capture the major changes in the NLSY79. Examining the 1979 survey shows the initial levels of nonresponse. Examining the 1984 survey shows the amount of nonresponse in the survey just before one part of the respondent pool was dropped. The 1989 data show nonresponse after the first set of NLSY79 respondents was dropped. The 1994 data show what occurred after users and interviewers were switched from paper-and-pencil interviewing (PAPI) to computer-assisted personal interviewing (CAPI). While no major survey changes occurred during the 1998 and 2004 surveys, these surveys show nonresponse rates after many respondents had participated around 20 times.

This section focuses on the three types of missing data: refusals, invalid skips, and don't knows. Overall, the section shows that in these six rounds of the NLSY79, 20 million questions were asked. Out of all the questions asked to respondents, about 1.5 percent do not have valid answers and are missing data. Of the three missing data categories, about half the missing data are don't knows and about half are invalid skips. Given the vast majority of invalid skips occur in paper-and-pencil years, the percentage of problems attributed to this category has been steadily falling as more computer survey rounds are fielded.

Introduction

Missing data, or nonresponse, happens in a number of ways in the NLSY79. First, a number of respondents do not participate at all, causing all information in that particular survey to be missing. Participation rates and reasons for noninterview in each survey round are discussed in the section on Retention & Reasons for Noninterview.

A second reason missing data occurs is that respondents do not provide a valid answer to a question. When this happens, interviewers make a determination about whether to mark the answer as a refusal or don't know value. Users should be cautioned that the assignment of refusals and don't knows is likely to vary across interviewers. Moreover, some respondents may believe it is impolite to refuse a question and decline to answer by saying they do not know. Hence, whether a question is marked either a refusal or a don't know is somewhat arbitrary. Note: Financial questions may often elicit the "refusal" or "don't know" responses. For more information about nonresponse to financial questions, see Appendix 26: Non-Response to Financial Questions and Entry Points.

The last major way missing data can occur is when the interviewer incorrectly follows the survey instrument's flow. Incorrect flows result in some respondents being skipped over a set of questions that should be answered while others answer questions that they should not have been asked. Data archivists have removed from the data most of the extraneous question responses. While extra information can be removed, missing data is not imputed in the NLSY79. Missing data caused by this reason is flagged with a special "invalid skip" code. The number of invalid skipped drops precipitously beginning in 1993 with the introduction of CAPI. Nevertheless, invalid skips are still possible in CAPI data. If the CAPI survey contains a programming mistake, the instrument could incorrectly sequence a respondent. When these errors are found, the CAPI survey is patched in the field to prevent further invalid skips but the incorrect cases are not asked the questions again.

All missing data are clearly flagged in the NLSY79 data set. Five negative numbers are used to indicate to users that the variable does not contain useful information. The five values are (-1) refusal, (-2) don't know, (-3) invalid skip, (-4) valid skip, and (-5) noninterview. These five numbers are reserved as missing value flags and, with a few exceptions (see Appendix 5: Supplemental Fertility and Relationship Variables), are rarely used in the NLSY79 for valid data values.

In the tables that follow, every attempt has been made to look at only variables in a given survey year that were filled in by either a respondent or an interviewer. The goal was to eliminate all created, machine check, date and time stamp, and variables generated in data post-processing from the analysis. Given there is no automatic way to check every question to see if it meets these criteria, the number of questions analyzed by the below tables overstates the number of questions actually filled in by the respondent or interviewer. The overstatement occurs because some questions with meaningful titles are actually hidden machine checks. While every effort was made to eliminate these questions it is impossible to eliminate all of them.

This section is not the only research on the extent of missing data in the NLS. Olsen (1992) investigated the effect of switching from PAPI to CAPI interviewing. His research shows fewer interviewer errors occur from navigating the instrument as well as fewer don't knows in the CAPI survey. More importantly, CAPI respondents appeared more willing to reveal sensitive material in the alcohol use section. Mott (1985, 1984, and 1983) examines the NLSY79's fertility data. In these reports, he examines the 1982 and 1983 surveys and finds very low refusal rates for the data in general. However, by shifting to a confidential abortion reporting method, the willingness to respond greatly increases. Mott (1998) examines the amount of missing data about the children of NLSY79 females. He finds that Hispanics or Latinos and, to a smaller extent blacks, have a much higher probability of not finishing the child assessments after starting the interview.

Additional nonresponse information

The Item Nonresponse by Section examines which sections of the NLSY79 have high nonresponse rates; the Item Nonresponse by Respondents examines how many times individuals do not respond to questions; and the Item Nonresponse within Problem Sections examines which particular questions in sections with high nonresponse rates are causing problems.

Click below to read more about each nonresponse topic.

This section examines and quantifies the extent of missing data, formally called item nonresponse, in each section of the NLSY79. The six tables below show which areas of the NLYS79 respondents are least likely to answer by tracking the total number and percentage of questions that have missing data for each group of respondents. To provide readers with a detailed view of this problem, six surveys are analyzed. Nonresponse rates are examined first in the 1979 survey and then in the surveys that occur at roughly five-year intervals (1984, 1989, 1994, 1998, and 2004). These years were chosen to capture the major changes in the NLSY79. Examining the 1979 survey shows the initial levels of nonresponse. Examining the 1984 survey shows the amount of nonresponse in the survey just before one part of the respondent pool was dropped. The 1989 data show nonresponse after the first set of NLSY79 respondents was dropped. The 1994 data show what occurred after users and interviewers were switched from paper-and-pencil interviewing (PAPI) to computer-assisted personal interviewing (CAPI). While no major survey changes occurred during the 1998 and 2004 surveys, these surveys show nonresponse rates after many respondents had participated around 20 times.

The first column of the tables contains the section names within the survey. The second column shows the total number of questions that all respondents and all interviewers should have answered in that section. This number is determined by first calculating within each section the number of questions each respondent should answer. A question is considered answerable if it does not have a valid skip (-4) or noninterview (-5) as its answer. A total for the section is obtained by summing up the answers for all NLSY79 respondents.

The third (don't know), fourth (refusal), and fifth (invalid skip) columns show the total number of nonresponses found in each section. Columns six, seven, and eight show the same information except in percentage form. The ninth column shows the total percentage of questions missed and is the sum of the previous three percentages. The last column, labeled rank, shows which sections have the most (closer to 1) and least (further from 1) amount of nonresponse.

The bottom row of each table combines the information and shows totals. For example, the bottom of the "Number Questions Asked" column in the 1979 survey shows that almost four million questions (3,975,146) were expected to be filled in by respondents or interviewers. While the 1979 survey contains many questions, other years are not far behind. In 1984, there were 3 million questions, 1989 had 1.8 million, 1994 had 3.7 million questions, 1998 had had 4.1 million questions and 2004 had 3.7 million. Readers are cautioned that each year of NLSY79 data contains far more data points since the tables exclude questions obviously labeled as machine checks, date and time stamps, and questions with valid skip or noninterview data flags.

The six tables show that the overall rate of missing data for many years dropped steadily over time. In 1979, 2.7 percent of the questions in the survey were not answered. This number drops to 1.9 percent in 1984 and then falls to 0.9 percent in 1989 and reaches a low point of 0.7 percent in 1994. After 1994 the number rises again with 0.92 percent in 1998 and 1.42 percent in 2004. Hence, nonresponse problems are of slightly less concern after the initial round of surveying.

Combining the data from all sections in all the tables shows the majority of nonresponse is caused by don't knows and invalid skips. The surveys examined asked a total of 20 million questions. Of these questions more than 140,000 or 0.7 percent were don't knows and slightly more than 127,000, or 0.6 percent were invalid skips. The last category, refusal, contains about 26,000 questions which is roughly 0.1 percent of all questions asked.

Examining the tables over time shows a steady decrease in the amount of data missing due to invalid skips. In 1979, invalid skips accounted for 2.1 percent of the questions asked. This number dropped sharply to 1.2 percent by 1984 and then down to 0.25 percent by 1989. Analysis indicated that CAPI dramatically lowered the problem of invalid skips with only 57 questions out of almost 3.7 million incorrectly skipped in 1994 and 75 questions out of 4 million in 1998.

While invalid skips fall over time, the percentage of refusals has increased slightly. Refusals accounted for 0.01 percent in 1979, 0.07 percent in 1984, 0.10 percent in 1989, 0.16 percent in 1994, 0.19 percent in 1998, and 0.20 percent in 2004. Nevertheless, while refusals steadily increase over time in absolute terms the numbers are still quite small.

While invalid skips fall and refusals are rising over time, the trend in don't knows is more complex. Don't knows accounted for 0.6 percent in 1979, 0.6 percent in 1984, 0.5 percent in 1989, 0.5 percent in 1994, 0.7 percent in 1998, and 1.1 percent in 2004. These figures suggest that don't knows are making a U-shaped pattern over time.

The last column, labeled rank, shows that missing data are not confined to a single section or area of the survey. Table 1.1 shows that in 1979 the work experience section, with 14.5 percent of the questions missing valid data, had the most problems. Fourteen percent of all questions asked in this section are labeled as invalid skips and only 0.5 percent of the questions were either refusals or don't knows. Military experience, the second most problematic section had almost half the rate of missing data (7.8 percent) as work experience. The table shows the problem of invalid skips is not related to subject matter since the section (rank 21 out of 21) with the least problems, titled "On Jobs," also focuses on labor market issues, like work experience.

While the "On Jobs" section of the survey consistently has the least problems in these surveys, the section with the most problems changes. Table 1.2, which examines the 1984 survey, shows the most problems in the "Fertility" section. Of the almost half-million questions asked in the fertility section, 5.6 percent contain missing data. While the majority of problems (3.4 percent) were due to invalid skips, a surprisingly large 2 percent of the missing responses are don't knows. The second most problematic section in the 1984 survey was "Drug Use", where 2.7 percent of the questions have missing data. Like "Fertility," the major portion of the problem is invalid skips (1.8 percent), but don't knows (0.8 percent) also account for a significant share. Interestingly, refusals account for only 0.1 percent, a relatively small proportion for a sensitive topic, suggesting that some of the don't knows were hidden refusals.

Scroll right to view additional table columns or click the link at the bottom of each table to open in a new window.

Table 1.1. Extent of refusals, don't knows, and invalid skips in 1979
Section Name	Number Questions Asked	Number Don't Knows	Number Refused	Number Invalid Skipped	Percent Don't Knows	Percent Refused	Percent Invalid Skipped	Total Percent Missed	Rank
Family Background	660803	6196	90	12292	0.94%	0.01%	1.86%	2.81%	7
Marital Status	32995	131	25	467	0.40%	0.08%	1.42%	1.89%	14
Fertility	82141	679	23	624	0.83%	0.03%	0.76%	1.61%	17
Schooling	402134	994	14	5592	0.25%	0.00%	1.39%	1.64%	16
Pay	211504	22	0	3482	0.01%	0.00%	1.65%	1.66%	15
World of Work	220185	2220	31	2883	1.01%	0.01%	1.31%	2.33%	10
Military	145619	491	24	10885	0.34%	0.02%	7.47%	7.83%	2
CPS	396697	862	8	10969	0.22%	0.00%	2.77%	2.98%	5
On Jobs	230982	135	2	903	0.06%	0.00%	0.39%	0.45%	21
Employer Supplement	291836	2009	69	3575	0.69%	0.02%	1.23%	1.94%	13
Last Job	44504	31	0	261	0.07%	0.00%	0.59%	0.66%	20
Work Experience	67695	288	15	9476	0.43%	0.02%	14.00%	14.45%	1
Gov't Training	36728	62	28	2124	0.17%	0.08%	5.78%	6.03%	3
Other Training	103662	52	0	2936	0.05%	0.00%	2.83%	2.88%	6
Not at Work	90768	79	7	5019	0.09%	0.01%	5.53%	5.62%	4
Health	67869	358	2	545	0.53%	0.00%	0.80%	1.33%	18
Significant Others	58816	669	0	585	1.14%	0.00%	0.99%	2.13%	12
Residences	52845	94	7	1029	0.18%	0.01%	1.95%	2.14%	11
Rotter Scale	202976	1277	15	521	0.63%	0.01%	0.26%	0.89%	19
Income & Assets	321685	1667	216	6813	0.52%	0.07%	2.12%	2.70%	8
Expectations	252702	3824	20	2092	1.51%	0.01%	0.83%	2.35%	9
Total	3975146	22140	596	83073	0.56%	0.01%	2.09%	2.66%	-

Table 1.2. Extent of refusals, don't knows, and invalid skips in 1984
Section Name	Number Questions Asked	Number Don't Knows	Number Refused	Number Invalid Skipped	Percent Don't Knows	Percent Refused	Percent Invalid Skipped	Total Percent Missed	Rank
Calendar	88462	8	0	4	0.01%	0.00%	0.00%	0.01%	15
Marital Status	50206	273	18	561	0.54%	0.04%	1.12%	1.70%	4
Schooling	324139	1031	469	2164	0.32%	0.14%	0.67%	1.13%	9
Military	123126	337	41	1352	0.27%	0.03%	1.10%	1.41%	7
CPS	333267	467	5	4270	0.14%	0.00%	1.28%	1.42%	6
On Jobs	140382	0	0	17	0.00%	0.00%	0.01%	0.01%	16
Gaps in Jobs	120601	15	0	175	0.01%	0.00%	0.15%	0.16%	13
Gov't Training	31226	38	0	59	0.12%	0.00%	0.19%	0.31%	12
Other Training	45002	7	0	736	0.02%	0.00%	1.64%	1.65%	5
Fertility	462288	9141	891	15739	1.98%	0.19%	3.40%	5.57%	1
Child Care	114317	201	13	1157	0.18%	0.01%	1.01%	1.20%	8
Health	52866	35	3	29	0.07%	0.01%	0.05%	0.13%	14
Alcohol	314511	33	47	2234	0.01%	0.01%	0.71%	0.74%	11
Drug Use	414007	3464	300	7454	0.84%	0.07%	1.80%	2.71%	2
Income & Assets	439646	2945	241	938	0.67%	0.05%	0.21%	0.94%	10
Attitudes	13427	214	2	29	1.59%	0.01%	0.22%	1.82%	3
Total	3067473	18209	2030	36918	0.59%	0.07%	1.20%	1.86%	-

Table 1.3 shows the amount of nonresponse in the 1989 survey. The most problematic section is "Income", missing data in 1.3 percent of its questions, with the CPS section a close second with 1.2 percent. Unlike earlier years, the major missing data problem in both the "Income" (1 percent) and CPS (0.8 percent) sections are don't knows, not invalid skips (0.1 percent income and 0.4 percent CPS).

Table 1.3. Extent of refusals, don't knows, and invalid skips in 1989
Section Name	Number Questions Asked	Number Don't Knows	Number Refused	Number Invalid Skipped	Percent Don't Knows	Percent Refused	Percent Invalid Skipped	Total Percent Missed	Rank
Intro	14647	20	1	41	0.14%	0.01%	0.28%	0.42%	7
Marital Status	86563	372	121	450	0.43%	0.14%	0.52%	1.09%	3
Schooling	76999	179	39	217	0.23%	0.05%	0.28%	0.56%	6
Military	33579	1	1	40	0.00%	0.00%	0.12%	0.13%	10
CPS	406265	3320	52	1650	0.82%	0.01%	0.41%	1.24%	2
On Jobs	39749	0	0	1	0.00%	0.00%	0.00%	0.00%	12
Gaps in Jobs	91565	91	1	894	0.10%	0.00%	0.98%	1.08%	4
Gov't Training	49657	118	35	233	0.24%	0.07%	0.47%	0.78%	5
Fertility	152546	6	35	92	0.00%	0.02%	0.06%	0.09%	11
Health	154024	120	74	168	0.08%	0.05%	0.11%	0.24%	9
Alcohol	217441	74	400	201	0.03%	0.18%	0.09%	0.31%	8
Income	470686	4761	1124	439	1.01%	0.24%	0.09%	1.34%	1
Total	1793721	9062	1883	4426	0.51%	0.10%	0.25%	0.86%	-

Table 1.4 shows that the most problematic area in the 1994 survey includes the asset questions, which are missing 2.5 percent of their answers (75 percent of those missing being don't knows). The second most problematic area includes income questions, which are missing 1.3 percent of their answers. While in the three previous surveys refusal rates were not an issue, the 1994 survey shows refusals are becoming significant. Slightly more than half a percent (0.6 percent) of the "Asset" section questions and more than one fifth of a percent (0.2 percent) of the "Income" section questions were refused.

Table 1.4. Extent of refusals, don't knows, and invalid skips in 1994
Section Name	Number Questions Asked	Number Don't Knows	Number Refused	Number Invalid Skipped	Percent Don't Knows	Percent Refused	Percent Invalid Skipped	Total Percent Missed	Rank
Intro	36251	62	14	0	0.17%	0.04%	0.00%	0.21%	12
Marital Status	137540	1522	193	0	1.11%	0.14%	0.00%	1.25%	3
School	60166	302	2	0	0.50%	0.00%	0.00%	0.51%	7
Military	27372	6	1	0	0.02%	0.00%	0.00%	0.03%	15
CPS	269452	28	9	0	0.01%	0.00%	0.00%	0.01%	17
On Jobs	79567	6	7	0	0.01%	0.01%	0.00%	0.02%	16
Employer Supplement	1060679	7092	1342	8	0.67%	0.13%	0.00%	0.80%	5
Training	194147	246	29	47	0.13%	0.01%	0.02%	0.17%	13
Fertility	450871	1859	763	0	0.41%	0.17%	0.00%	0.58%	6
Child Care	26453	109	12	0	0.41%	0.05%	0.00%	0.46%	9
Relationship	81477	285	113	0	0.35%	0.14%	0.00%	0.49%	8
Health	282702	623	199	0	0.22%	0.07%	0.00%	0.29%	11
Alcohol	164663	46	61	0	0.03%	0.04%	0.00%	0.06%	14
Income	305693	3176	672	1	1.04%	0.22%	0.00%	1.26%	2
Program Participation	118305	297	63	0	0.25%	0.05%	0.00%	0.30%	10
Assets	169301	3239	930	1	1.91%	0.55%	0.00%	2.46%	1
Drugs	204621	772	1626	0	0.38%	0.79%	0.00%	1.17%	4
Total	3669260	19670	6036	57	0.54%	0.16%	0.00%	0.70%	-

Table 1.5 examines the 1998 survey. Since the survey is fielded every other year in the late 1990s there is no 1999 interview, which would exactly continue the every five-year pattern. The 1998 survey is used as the closest substitute. This table, like the one for 1994, shows that the most problematic area is again the asset questions, which are missing 3.6 percent of their answers (75 percent of those missing being don't knows). The second most problematic area is the marital history questions, which added a new section that asked detailed questions about the work history and past life of the respondent's spouse. This expanded section is missing 1.8 percent of its answers. In the 1998 survey only two sections have relatively high refusal rates; assets (almost 0.6 percent) and drug use (0.79 percent).

Table 1.5. Extent of refusals, don't knows, and invalid skips in 1998
Section Name	Number Questions Asked	Number Don't Knows	Number Refused	Number Invalid Skipped	Percent Don't Knows	Percent Refused	Percent Invalid Skipped	Total Percent Missed	Rank
Intro	10060	6	4	0	0.06%	0.04%	0.00%	0.10%	12
Marital Status	207805	3296	520	1	1.59%	0.25%	0.00%	1.84%	2
School	53928	197	45	0	0.37%	0.08%	0.00%	0.56%	10
Military	25691	0	0	0	0.00%	0.00%	0.00%	0.00%	15
CPS	301160	44	12	0	0.01%	0.00%	0.00%	0.02%	13
On Jobs	117144	2	0	1	0.00%	0.00%	0.00%	0.00%	14
Employer Supplement	1081493	10265	1441	1	0.95%	0.13%	0.00%	1.08%	3
Training	241013	1559	143	1	0.65%	0.06%	0.00%	0.71%	7
Fertility	578831	3180	1097	50	0.55%	0.19%	0.01%	0.75%	6
Child Care	23241	57	11	1	0.25%	0.05%	0.00%	0.30%	11
Relationship	86632	371	154	0	0.43%	0.18%	0.00%	0.61%	9
Health	350533	2460	223	0	0.70%	0.06%	0.00%	0.77%	5
Income	608849	3410	847	10	0.56%	0.14%	0.00%	0.70%	8
Assets	174570	4702	1566	10	2.69%	0.90%	0.01%	3.60%	1
Drugs	217175	419	1485	0	0.19%	0.68%	0.00%	0.88%	4
Total	4078125	29968	7548	75	0.73%	0.19%	0.00%	0.92%	-

Table 1.6 examines the 2004 survey. This survey has two new sections that are not seen in the previous tables. The first section is found in the employer supplement and asks the respondent detailed questions about the pensions available from their employer and the respondent's participation in these pensions. This new section is ranked first in problems and has missing responses to 2.5% of all questions. The second new section is the over 40 health module. The goal of this section is to provide researchers with a baseline health measure that will be updated at ten year intervals. The health section is ranked 8^th out of 13 sections and has a nonresponse rate slightly more than three-quarters of one percent.

Table 1.6. Extent of refusals, don't knows, and invalid skips in 2004
Section Name	Number Questions Asked	Number Don't Knows	Number Refused	Number Invalid Skipped	Percent Don't Knows	Percent Refused	Percent Invalid Skipped	Total Percent Missed	Rank
Intro	91277	39	16	4	0.04%	0.02%	0.00%	0.06%	12
Marital Status	77954	371	66	106	0.48%	0.08%	0.14%	0.70%	9
School	56716	554	39	4	0.98%	0.07%	0.01%	1.05%	7
Military	39772	20	5	0	0.05%	0.01%	0.00%	0.06%	13
Employer Supplement	734366	7729	1001	275	1.05%	0.15%	0.04%	1.23%	6
Pensions	189861	3753	508	485	1.98%	0.27%	0.26%	2.50%	1
Training	307708	2943	887	322	0.96%	0.29%	0.10%	1.35%	5
Fertility	521658	5801	733	1216	1.11%	0.14%	0.23%	1.49%	3
Child Care	34561	12	4	7	0.03%	0.01%	0.02%	0.07%	11
Relationship	1004	2	0	0	0.20%	0.00%	0.00%	0.20%	10
Over 40 Health	622644	4386	402	14	0.70%	0.06%	0.00%	0.77%	8
Income	412656	4382	1199	39	1.06%	0.29%	0.01%	1.36%	4
Assets	626393	12726	2634	233	2.03%	0.42%	0.04%	2.49%	2
Total	3716570	42718	7494	2705	1.15%	0.20%	0.07%	1.42%	-

This section provides details on the amount of missing data associated with each respondent. Each table in this section shows the number of respondents who are missing data in one of the surveys. The tables are split into two parts. The left-hand part, columns one to four, shows the total number of questions that have missing data for each group of respondents. The right-hand part, columns five to nine, shows the percentage of questions that have missing data.

The top line of Tables 2.1.1 shows that in the 1979 survey, 12,527 respondents never refused to answer questions. While refusals are quite rare in this survey round, don't knows and incorrect skips are quite frequent. The top line shows that only 5,084 respondents had zero don't know responses and only 2,347 respondents were sent through the entire questionnaire without any sequencing errors. Subtracting these numbers from the 12,686 total respondents means that 60 percent, or 7,602 respondents, stated they did not know the answer to at least one question and 81. 5 percent, or 10,339 respondents, were incorrectly skipped somewhere in that questionnaire.

The top line of Table 2.1.2, which examines the percentage of questions missing data, shows a similar picture. Refusal rates are relatively low. There are 12,620 respondents who refused less than one percent of their questions, which means only 66 respondents refused one percent or more of the questions they were expected to answer. Thirty-five percent, or 8,185 respondents, answered don't know to less than one percent of their questions. Again, the largest group was respondents who were incorrectly skipped over questions. Only 4,313 respondents were incorrectly skipped over less than one percent of the questions, but 8,373 of the respondents were illegally skipped over one percent or more of their questions and 227 were skipped over more than 10 percent.

Refusal rates have increased steadily over time even though the more difficult respondents have presumably left the survey. Tables 2.2.1 and 2.2.2, which examine the 1984 survey, shows an increase over the 1979 refusal rates. While the number of respondents answering the survey is shrinking, the number refusing to answer questions is increasing. For example, while in 1979 only 10 respondents refused to answer more than 10 questions, in 1984 there were 41 respondents. This pattern of increase is evident in Tables 2.3.1 and 2.3.2, which examine 1989, through to Tables 2.6.1 and 2.6.2, which examine 2004. By 2004, there were 185 respondents who refused to answer more than 10 questions.

Increasing refusal rates are also seen in the percentage side of the table. In 1979, only 66 respondents refused to answer one percent or more of the questions they were asked. This increased in subsequent surveys to 320 respondents in 1984, 355 respondents in 1989, 480 respondents in 1994, 549 respondents in 1998, and 655 respondents in 2004.

"Don't know" rates have also risen over time. In the 1979 survey, 8,185 respondents had less than one percent of their questions labeled as don't knows. This number drops in 1984 to 7,003 respondents and further drops to 6,423 in 1989 and 5,942 in 1994, 4,741 in 1998 and 3,185 in 2004. While rates have risen, relatively few individuals have high levels of don't knows. In 1979, only 68 respondents didn't know the answer to more than five percent of the questions they were asked. This number falls to 19 respondents in 1984 and then rises to 66 in 1989 before falling back to 46 respondents in 1994 and then jumps back to 66 in 1998, and ends with 149 in 2004.

While don't know and refusal rates have risen, incorrect skip problems have clearly shrunk over time. In 1979, there were only 2,347 respondents who were correctly sequenced through the entire survey. In 1984, this number rises to 7,802 respondents, followed by a rise to 9,334 respondents in 1989. In 1994 and 1998 almost every respondent was correctly sequenced. Only 57 and 46 respondents were incorrectly skipped through part of the survey in each year respectively. Moreover, most of the respondents were only incorrectly skipped in a single question. In 2004 there were 349 respondents who were incorrectly skipped through one percent of their questions and 22 who were incorrectly skipped through 2 percent or more.

Nonresponse by Respondents in 1979 survey

Table 2.1.1 Number of respondents with missing data by number of questions in 1979 survey
Number of Questions	Number of Respondents
Number of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0	12527	5084	2347
1	91	2974	1897
2	26	1723	1393
3	13	1016	1158
4	5	629	838
5	2	376	596
6	1	228	489
7	3	173	502
8	3	131	420
9	1	84	340
10	4	57	308
> 10	10	211	2398

Table 2.1.2 Number of respondents with missing data by percent of questions in 1979 survey
Percent of Questions	Number of Respondents
Percent of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0%	12620	8185	4313
1%	43	3247	3421
2%	7	773	1733
3%	5	264	989
4%	5	101	621
5%	0	48	397
6%	2	27	312
7%	1	18	278
8%	1	6	206
9%	0	7	118
10%	0	2	71
> 10%	2	8	227

Nonresponse by Respondents in 1984 survey

Table 2.2.1 Number of respondents with missing data by number of questions in 1984 survey
Number of Questions	Number of Respondents
Number of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0	11222	4549	7802
1	610	3012	1289
2	73	1901	622
3	44	1136	413
4	38	668	252
5	13	345	369
6	6	177	174
7	1	108	93
8	7	63	115
9	4	38	73
10	10	28	64
> 10	41	44	803
Note: Not included in this table are 617 respondents who did not answer the survey.

Table 2.2.2 Number of respondents with missing data by percent of questions in 1984 survey
Percent of Questions	Number of Respondents
Percent of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0%	11749	7003	8956
1%	207	3807	1267
2%	44	944	674
3%	13	213	284
4%	15	62	133
5%	13	21	84
6%	10	11	139
7%	4	2	137
8%	5	3	107
9%	2	0	68
10%	2	3	36
> 10%	5	0	184
Note: Not included in this table are 617 respondents who did not answer the survey.

Nonresponse by Respondents in 1989 survey

Table 2.3.1 Number of respondents with missing data by number of questions in 1989survey
Number of Questions	Number of Respondents
Number of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0	10221	6135	9334
1	171	2517	781
2	59	1036	189
3	37	395	35
4	20	194	20
5	21	131	16
6	7	75	7
7	10	34	125
8	10	24	18
9	4	10	9
10	7	6	3
> 10	38	48	68
10%	3	8	3
Note: Not included in this table are 2,081 respondents who did not answer the survey.

Table 2.3.2 Number of respondents with missing data by percent of questions in 1989 survey
Percent of Questions	Number of Respondents
Percent of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0%	10250	6423	9461
1%	193	3221	843
2%	58	561	51
3%	35	219	69
4%	13	76	86
5%	10	39	24
6%	4	24	10
7%	4	17	10
8%	3	1	5
9%	3	3	9
> 10%	29	13	34
Note: Not included in this table are 2,081 respondents who did not answer the survey.

Nonresponse by Respondents in 1994 survey

Table 2.4.1 Number of respondents with missing data by number of questions in 1994 survey
Number of Questions	Number of Respondents
Number of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0	7168	3559	8832
1	1129	1780	57
2	191	1082	0
3	87	693	0
4	41	443	0
5	28	334	0
6	29	232	0
7	22	171	0
8	21	115	0
9	17	105	0
10	18	72	0
> 10	138	303	0
Note: Not included in this table are 3,797 respondents who did not answer the survey.

Table 2.4.2 Number of respondents with missing data by percent of questions in 1994 survey
Percent of Questions	Number of Respondents
Percent of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0%	8409	5942	8889
1%	246	2060	0
2%	81	558	0
3%	41	165	0
4%	31	79	0
5%	20	39	0
6%	19	16	0
7%	6	15	0
8%	10	4	0
9%	9	2	0
10%	4	2	0
> 10%	13	7	0
Note: Not included in this table are 3,797 respondents who did not answer the survey.

Nonresponse by Respondents in 1998 survey

Table 2.5.1 Number of respondents with missing data by number of questions in 1998 survey
Number of Questions	Number of Respondents
Number of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0	7248	2497	8353
1	473	1355	21
2	162	1020	23
3	83	729	0
4	60	589	2
5	42	447	0
6	35	343	0
7	26	277	0
8	19	201	0
9	23	169	0
10	12	120	0
> 10	216	652	0
Note: Not included in this table are 4,287 respondents who did not answer the survey.

Table 2.5.2 Number of respondents with missing data by percent of questions in 1998 survey
Percent of Questions	Number of Respondents
Percent of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0%	7850	4741	8385
1%	254	2441	13
2%	86	712	0
3%	58	283	1
4%	54	110	0
5%	27	46	0
6%	30	25	0
7%	14	11	0
8%	4	7	0
9%	8	9	0
10%	2	5	0
> 10%	12	9	0
Note: Not included in this table are 4,287 respondents who did not answer the survey.

Nonresponse by Respondents in 2004 survey

Table 2.6.1 Number of respondents with missing data by number of questions in 2004 survey
Number of Questions	Number of Respondents
Number of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0	6531	1524	6539
1	298	993	440
2	194	755	334
3	171	624	145
4	78	592	42
5	45	486	98
6	51	387	29
7	45	360	13
8	29	314	3
9	23	235	5
10	11	178	7
> 10	185	1213	6
Note: Not included in this table are 5,025 respondents who did not answer the survey.

Table 2.6.2 Number of respondents with missing data by percent of questions in 2004 survey
Percent of Questions	Number of Respondents
Percent of Questions	Refused	Didn't Know	Was Incorrectly Skipped Over
0%	7006	3185	7290
1%	384	2399	349
2%	106	1122	18
3%	48	477	2
4%	40	226	1
5%	18	103	0
6%	16	68	0
7%	10	29	0
8%	8	14	0
9%	8	17	0
10%	3	6	1
> 10%	14	15	0
Note: Not included in this table are 5,025 respondents who did not answer the survey.

How much missing data are associated with particular questions? This section provides readers with an in-depth view of the questions within survey sections having a high amount of missing data. Like the previous parts, this section provides tables for each of the selected survey years. The first table (Table 3.1) examines questions from the 1979 survey's "Work Experience" section. This section has more missing data (14.5 percent) than any other 1979 survey section. The second set of tables (Tables 3.2 through 3.6) examines the most problematic section of the 1984 survey, "Fertility and Abortion." The third set of tables (Tables 3.7 and 3.8) examines the most problematic 1989 survey section, "Income and Assets." Since the 1994 "Income and Asset" section again ranked first in missing data, the next set of tables (Tables 3.9 and 3.10) substitutes the "Drug and Alcohol Use Supplements," given the high degree of research interest in understanding nonresponse in these sections. Table 3.11 highlights nonresponse in 1998 in the Marital History section. Table 3.12 tracks nonresponse problems in the over-40 health section.

To ensure the sets of tables are not overwhelming, all sections that could be naturally divided are split (Fertility, for instance). Additionally, only the most important question or questions with high rates of nonresponse are shown. Table 3.1, which examines the amount of missing data in the 1979 survey, shows the highest amount of missing data are associated with a pair of retrospective questions that asked respondents to remember what happened two years earlier. Interviewers incorrectly skipped slightly less than 1,750 respondents over R01150., weeks worked in 1977, and R01153., hours worked per week in 1977. Examining the 1979 questionnaire shows that these questions appear at the bottom of a page. Prior to these questions is a fairly complicated half page of instructions and questions that the interviewer must read, understand, and partially speak. It seems likely that many interviewers did not understand the instructions and skipped to the next page.

Table 3.1. Amount of missing data per question in the Work Experience section in 1979 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R01150.	Weeks Work in 1977	1735	11	1
R01151.	Weeks Work in 1976	418	18	1
R01152.	Weeks Work in 1975	240	11	0
R01153.	Hours/Week Work in 1977	1749	13	0
R01154.	Hours/Week Work in 1976	459	16	0
R01165.	Industry of 1^st Job after School	628	4	1
R01166.	Occupation at 1^st Job after School	627	3	1
R01167.	Hours/Week Work at 1^st Job after School	631	6	1
R01168.	Hours/Day at 1^st Job after School	632	6	1
R01169.	Rate of Pay at 1^st Job after School	632	32	2

Tables 3.2-3.6, which examine the "Fertility" section, show a much lower number of invalid skips in all parts except in the abortion questions. While invalid skips do not reach the level seen in Table 3.1, on average 190 female respondents were not asked each abortion question (190 is an average from all abortion questions, not just those shown in the tables). The table also shows a number of other trends. First, respondents have higher levels of don't know answers the more precise the question being asked. For example, in Table 3.2, when males were asked the date of birth of their first child, only one did not know the year, three did not know the month and 10 did not know the day. This phenomena is most clearly seen in Table 3.5, which shows the year and month of the respondent's first sexual encounter. Only 43 respondents did not know the year, but 1,410 respondents did not know the month. This problem with dates is also seen in the abortion data where only four respondents did not know the year when they had their first abortion, but 13 did not know the month.

Refusal rates in the "Fertility" section are quite low except for a number of key questions. Asking the number of times they had sex in the last month elicited high rates of refusal for males and females. This question elicited 167 male and 135 female refusals. Interestingly, most individuals were willing to answer if they ever had sex since only 45 males and 54 females refused to answer these questions. Birth control questions did not have exceptionally high rates of refusal. Seventeen female respondents and no males refused to answer the birth control questions. Table 3.6 shows that 28 females refused to answer if they ever had an abortion and 28 more refused to state if they dropped out of school before they terminated the pregnancy.

Table 3.2. Amount of missing data per question in male Fertility section in 1984 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R13017.	Ever Had Any Children	0	3	0
R13019.	Month Birth Child#1 Born	41	3	0
R13020.	Day Birth Child #1 Born	45	10	0
R13021.	Year Birth Child#1 Born	39	1	0
R13022.	Sex of Child#1 Born	3	0	0
R13115.	Total #Children Expect to Have	12	45	3
R13117.	#Years Expect Have 1^st/Next Child	22	120	0
R13118.	Had Any Children/Expecting	0	7	0
R13119.	Current Pregnancy Planned	131	0	0
R13121.	Ever Had Sexual Intercourse	12	0	45
R13122.	Age @First Sexual Intercourse	28	19	23
R13123.	#Times Sexual Intercourse Past Month	11	68	167
R13124.	Is Partner Now Pregnant	0	1	0
R13125.	Use Any Birth Control During Last Month	15	2	0
R13126.	#Times Try Prevent Pregnancy	65	0	0
R13127.-R13141.	Method of Birth Control	16	0	0
R13142.	Ever Have a Sex Education Course	10	0	12
R13148.	Month Took Sex-Ed Course	73	564	0
R13149.	Year Took Sex-Ed Course	36	58	0
R13150.	Time When Pregnancy Most Likely	19	1480	20

Table 3.3. Amount of missing data per question in female Fertility section in 1984 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R13191.	#Pregnancies	8	0	0
R13251.	Use Any Birth Control before Preg#1	18	0	1
R13254.	Want Be Pregnant before Preg#1	20	0	0
R13255.	Husband/Partner Want Preg#1	19	20	0
R13283.	Get Prenatal Care Preg#1	57	0	0
R13286.	Frequency Alcohol Use Preg#1	58	0	0
R13288.	#Cigarettes Smoked Preg#1	56	0	0
R13297.	X-Rays Taken Preg#1	57	0	0
R13302.	Sonogram Preg#1	57	6	0
R13358.	Amniocentesis Preg#1	57	0	0
R13411.	Took Vitamins Preg#1	57	0	0
R13443.	C-Section Child#1 Born	52	0	0
R13445.	Weight at Delivery, Preg#1	53	5	1
R13446.	Weight before Preg#1	51	5	1
R13449.	Length Child#1 Born at Birth	53	20	0
R13667.	Weight of Child#1 @Birth Lbs	25	6	0

Table 3.4. Amount of missing data per question in feeding part of Fertility section in 1984 survey
Reference Number	Variable Title	Invalid	Don't Know
R13670.	Child#1 Breastfed	27	0
R13672.	Month Age Child#1 Breast Fed Ended	27	1
R13674.	Month Age Child#1 Formula Fed	38	3
R13693.	Wk Age Child#1 Formula Fed Ended	57	0
R13694.	Month Age Child#1 Formula Fed Ended	57	6
R13696.	Months Age Child#1 - Cow's Milk	81	10
R13698.	Months Age Child#1 - Solid Food	86	10

Table 3.5. Amount of missing data per question in child part of Fertility section in 1984 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R13791.	Age Had 1^st Menstrual Period	8	14	22
R13792.	Year 1^st Menstrual Period	0	7	0
R13793.	Month Had 1^st Menstrual Period	17	2207	1
R13794.	R Ever Been Pregnant	0	1	0
R13795.	Ever Had Sexual Intercourse	4	0	54
R13796.	Age First Sexual Intercourse	5	26	78
R13797.	Year 1^st Sexual Intercourse	0	43	66
R13798.	Month Sexual Intercourse 1^st Time	19	1410	75
R13799.	#Times Sexual Intercourse Past Month	9	104	135
R13802.	#Times Try Prevent Pregnant Past Month	17	0	2

Table 3.6. Amount of missing data per question in abortion questions of Fertility section in 1984 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R13827.	Ever Had An Abortion	135	0	28
R13828.	# of Abortions	143	0	0
R13830.	Year of 1^st Reported Abortion	196	4	0
R13837.	Drop out School #1 Pregnant	155	0	28
R13839.	Year Left School 1^st Time Pregnant	164	0	0
R13841.	Year Return School Time#1 after Pregnant	258	0	0

Tables 3.7 and 3.8 examine the "Income and Assets" section of the 1989 survey. While invalid skips are relatively rare in this section, refusals and don't know answers are fairly prevalent. The question with the highest amount of missing income data is R29822., which asks how much income was earned by other adults living in the household who were related to the respondent. While the previous questions showed that most respondents knew the type of income received by these family members, 958 could not come up with a specific amount. The second most problematic question with 11 invalid skips, 155 don't knows, and 113 refusals was R29714., which asked the respondent how much they earned from wages, salary, and tips.

Other questions with high numbers of don't knows are R29813., which asked about the amount of money received from other sources like interest and dividends, R29825., which asks about a partner's income, and R29827., which asks the number of exemptions used when filing a Federal tax return.

The asset table (Table 3.8) also shows invalid skips are rare but don't know and refusal rates are not. Surprisingly, one of the questions with the highest amount of missing data (315 missing answers) asks, "how much is your car worth (R29852.)?" Another question missing many observations asks the amount of the respondent's savings (R29835.). While the car worth question primarily elicits don't knows, the savings question resulted in 160 refusals. Three other questions elicited high numbers of don't knows: value of stocks and bonds (R29837.) - 219 don't knows; amount taken out of savings last year (R29842.) - 222 don't knows; and the market value of other items such as jewelry (R29854.) - 151 don't knows.

Table 3.7. Amount of missing data per question in Income section in 1989 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R29714.	Amount Rec from Wages/Salary/Tips	11	155	113
R29715.	In 1988 Receive Income from Own Business	1	0	11
R29717.	How Much Did R Receive after Expenses	6	49	23
R29732.	Amount Rec'd Per Week from Unemployment	0	5	1
R29736.	Amount Sp Rec'd 1988 from Wages	16	17	70
R29754.	How Much Did Sp Receive from Unemployment	8	12	0
R29758.	R/Spouse Rec'd Money for Child Support	1	1	10
R29759.	Amount R/Spouse Rec'd Child Support	2	14	2
R29760.	R/Spouse Rec'd AFDC Payments	0	4	9
R29774.	R/Spouse Rec'd Food Stamps	0	2	10
R29788.	R/Spouse Rec'd SSI/Public Assistance	0	4	9
R29808.	Rec'd Veteran Benefits	1	1	10
R29812.	R/Spouse Rec'd Money from Oth So	0	2	16
R29822.	Income Rec'd by Adults Related To R	7	958	8
R29825.	Total Income Rec'd before Deduct	2	200	4
R29826.	Sp File Federal Income Tax R	0	2	13
R29827.	R'S Filing Status on Federal Ret	11	8	2
R29828.	Exemptions Filed on 1988 Federal Tax	62	92	3

Table 3.8. Amount of missing data per question in Asset section in 1989 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R29831.	Amount Property Selling for on Today	5	53	10
R29832.	Amount R Owes on Property	4	85	25
R29833.	Amount Other Debt R Owes on Property	12	26	27
R29835.	Amount of Savings	7	166	160
R29837.	Current Market Value of Stocks	2	219	23
R29838.	R/Spouse Have Rights to Estate	2	3	18
R29839.	Total Value of Estate	3	90	6
R29840.	Put Money in/out of Savings	1	3	28
R29841.	How Much More Money Put in	6	110	53
R29842.	How Much More Money Take out	5	222	21
R29843.	R Have Business Investment	0	1	12
R29844.	R Have Investment in a Farm	4	0	0
R29847.	Total Market Value of Business	4	75	10
R29848.	Total Amount of Business Debt	1	55	8
R29851.	How Much Does R Owe on Vehicle	0	56	17
R29852.	Amount Vehicle Sells for Today	11	293	11
R29854.	Market Value of Other Items	5	151	25
R29856.	Total Amount R Owes	1	73	13

Table 3.9 and 3.10 examine the drug and alcohol use supplements in the 1994 survey. In these CAPI modules, there are no invalid skips. Interestingly, there are extremely low refusal and don't know rates within the "Alcohol" section (Table 3.9). The question with the highest refusals (nine respondents) asks if the individual had a drink since the 1989 interview. The typical question in the "Alcohol" section received only two refusals. Don't know rates are also low. The maximum number of don't knows at nine occurs in R49803., which asks if the respondent needs to drink more alcohol now in order to get drunk. On average, the "Alcohol" section records only 1.5 don't knows per question.

Table 3.9. Amount of missing data per question in Alcohol Use section in 1994 survey
Reference Number	Variable Title	Don't Know	Refusal
R49791.	R Had Drink of Alcohol since 1989	3	9
R49792.	Had Alcoholic Beverage in Last 30	0	5
R49793.	Times Had 6/More Drinks Last	0	1
R49794.	How Many of Last 30 Days Drank A	6	2
R49795.	No. of Drinks on Avg. Day When R	8	3
R49803.	Need More to Get Drunk Than Before	9	0
R49808.	Arrested, in Police Trouble	0	3
R49809.	Drink More Than Before	4	3

These low numbers of refusals and don't knows are not seen in Table 3.10, which examines the "Drug Use" section. On average, the typical question in this supplement elicited 23 don't knows and 48 refusals. Readers should understand that this supplement was generally filled in directly by the respondent, not by the interviewer. To provide respondents with practice using a computer, the questionnaire asked them two practice questions not related to drug use. Refusal rates are even high for these two test questions, which ask how many more children the respondent expects to have and what type of entertainment, such as movies, concerts, or plays, the respondent went to last year.

The highest number of refusals (119) occurs in R50532., which asks the age the respondent first used marijuana. The second largest number of refusals occurs in a similar question, R50536., which asks the age of first cocaine use. These same questions have very high don't know responses (113 marijuana and 48 cocaine). One other question with a very high don't know rate is R50525., which asks if the respondent ever smoked cigarettes daily. Almost 80 individuals did not know the answer to this question. Given that the question wording is straightforward, it is likely a number of respondents are using don't know as a polite way of refusing to answer the question.

Table 3.10. Amount of missing data per question in Drug Use section in 1994 survey
Reference Number	Variable Title	Don't Know	Refusal
R50524.	R Smoked at Least 100 Cigrtts in Life?	24	38
R50525.	R Ever Smoked Daily?	79	49
R50526.	Age When R 1^st Started Smoking Daily?	33	12
R50531.	Total Occasion R Use Marijuana	33	89
R50532.	Age 1^st Time Used Marijuana	113	119
R50533.	Most Recent Time Used Marijuana	35	89
R50535.	How Many Occasions Used Cocaine	19	86
R50536.	Age 1^st Time Used Cocaine	48	103
R50537.	Most Recent Time Used Cocaine	15	78
R50539.	How Many Occasions Used Crack	15	77
R50540.	Age 1^st Time Used Crack	33	82
R50541.	Most Recent Time Used Crack	16	74
R50553.	R Used Heroin w/o Doctor's Instr	9	53

The top ten questions show that a large number of respondents (ranging from 119 to 181 respondents, depending on the question) have difficulty with questions asking them about their spouse's rate and amount of pay, hours worked and weeks worked. In addition, questions which ask details about a spouse's previous marriage are also quite difficult for many respondents to answer.

Table 3.11. Amount of missing data per question in Marital History section in 1998 survey
Reference Number	Variable Title	Don't Know	Refusal
R58067.	Rate of Pay for Spouse Main Job (Time Unit)	181	49
R58204.	Age of Spouse at 1^st Marriage	213	2
R58125.	Spouse's Weekly Earnings at Main Job	159	29
R58068.	Spouse Receive Overtime at Main Job	151	26
R58127.	Estimate Spouse's Weekly Earning Main Job	149	26
R58178.	House Spouse Works Per Week Usually	170	1
R58177.	Number of Weeks Worked by Spouse in Last Year	140	24
R58179.	Number Weeks Not Working by Spouse Last Year	130	24
R58176.	Spouse Hourly Rate of Pay	119	28
R58208.	Duration of Spouse's Previous Marriage?	109	16

Table 3.12 examines the top questions with missing data problems from the health section in 2004. In this table, reference numbers starting with "R" are for questions asked of all respondents in the survey, while reference numbers starting with "H" represent questions in the "over 40 health module." This module was designed to provide researchers with more information about the health of the respondent when they turned 40 years old and is asked of respondents in the first interview after they turn 40.

While other data from the survey show that many people know if they are covered by health insurance, Table 3.12 reveals that many do not know details about this coverage. For example, one question with a large number of don't knows is R83036., which asks if the respondent's health insurance plan is an HMO, a preferred provider plan (PPO) or a network of affiliated doctors. This question had 428 missing responses out of 6,175 total responses (a 7% missing response rate). Other questions with high don't know rates ask if the respondent's children are covered by health insurance. The health question with the highest refusal rate asks the respondent how much they weigh, with 114 people refusing to divulge the number. Finally, in the 40+ health module a number of NLSY79 respondents have difficulty answering questions about the health and life status of their biological father. This is not surprising given a small but significant number of respondents stated in the past that they have never met their biological father.

Table 3.12. Amount of missing data per question in Health section in 2004 survey
Reference Number	Variable Title	Invalid	Don't Know	Refusal
R83036.	Primary Insurance Plan HMO, Network, PPO	0	426	2
R83037.	Is Primary Plan a PPO?	0	388	2
R83070.	Children Have Health/Hospitalization Plan?	0	328	15
R83038.	R's Primary Plan Need Authorization?	0	301	0
H00015.	Date Most Recent General Physical Exam	0	189	0
R82983.	How Much Does R Weigh?	0	50	114
H00014.	Ever Had A General Physical Exam?	0	147	2
H00017.	Cause Of Biological Dads Death	0	133	10
H00019.	Bio Dad Have Major Health Problems?	0	134	8
R82982.	Since What Date R Had This Health Limit	0	120	0
R82992.	Length Light Moderate Activities 10 Min	0	105	5
H00047.	Date Hypertension Diagnosed	0	91	0
H00016.	Is R's Biological Dad Living?	0	83	4
R82989.	Frequency of Light Mod Exercise 10 > Min	0	75	6
H00018.	Age Of Biological Dad At Death	0	68	1
H02445.	Date Most Recent Visit to Health Professional	0	52	11
H00012.	R Ever Visit Health Care Professional?	0	58	0
R83042.	Spouse Have Health/Hospital Plan	0	32	24
R83048.	Spouse Employer Pay All Health Plan Cost?	0	49	2
Note: Reference numbers that begin with the letter H are variables that are combined from different years of the over-40 health module. Researchers wanting to see the results from just the 2004 survey should use variable H00002.00, which is titled "Source Year for 40+ Health Module Data." Use this variable to select just those cases which answered the questions in 2004.

References

Mott, Frank L. "Patterning of Child Assessment Completion Rates in the NLSY: 1986-1996." CHRR, The Ohio State University, 1998.

Mott, Frank L. "Evaluation of Fertility Data and Preliminary Analytical Results from the 1983 (5^th round) Survey of the National Longitudinal Survey of Work Experience of Youth." CHRR, The Ohio State University, 1985.

Mott, Frank L. "The Patterning of Female Teenage Sexual Behaviors and Attitudes." CHRR, The Ohio State University, 1994.

Mott, Frank L. "Fertility-Related Data in the 1982 National Longitudinal Surveys of Work Experience of Youth: An Evaluation of Data Quality and Some Preliminary Analytical Results." CHRR, The Ohio State University, 1983.

Olsen, Randall J. "The Effects of Computer Assisted Interviewing on Data Quality." CHRR, The Ohio State University, 1992.