Skip to main content

NLSY79

Sexual Activity & Composition

Sexual activity

In 1982, 1983, 1984, and 1985 NLSY79 respondents were asked about their sexual activity. Once the respondent answered the questions, he or she was not asked the series in subsequent surveys. Out of a total of 12,686 respondents, 310 did not answer any of the questions in these years.

The first activity question asked respondents whether they had ever had sexual intercourse. This question was skipped if the respondent had previously reported either being a parent or being pregnant. Out of the 11,797 respondents, 889 or 7.5 percent stated that they had never had sexual intercourse. The youngest respondents answered this question when they were 18. The validity of these items is evaluated in Mott (1985). These data should be used cautiously because inconsistencies in reports of first intercourse with subsequent fertility events can sometimes occur. Respondents who had ever engaged in sex were then asked at what age they first had intercourse. Table 1 includes frequencies broken down by race and sex, as well as a total for the whole cohort.

Table 1. Reported age at which respondents first sexual activity by race and sex
Age Male Female Total
Hispanic
or Latino
Black Non-Black/ Non-Hispanic Hispanic
or Latino
Black Non-Black/ Non-Hispanic Number Percent Cumulative Percent
<10 16 75 29 0 2 7 129 1.1 1.1
10 3 50 17 0 3 8 81 0.7 1.8
11 12 58 19 0 4 7 100 0.9 2.7
12 30 140 91 1 14 14 290 2.5 5.2
13 38 158 160 10 35 50 451 3.9 9.2
14 87 207 249 32 73 132 780 6.8 16.0
15 139 273 415 71 181 275 1354 11.8 27.8
16 178 276 742 125 313 584 2218 19.4 47.2
17 177 136 621 143 293 625 1995 17.4 64.6
18 117 90 527 145 282 711 1872 16.3 81.0
19 52 24 216 136 118 367 913 8.0 88.9
20 25 11 155 63 74 233 561 4.9 93.8
21 12 11 72 60 32 140 327 2.9 96.7
22 8 3 56 31 14 79 191 1.7 98.4
23 4 0 22 9 4 49 88 0.8 99.1
24 0 1 14 3 3 26 47 0.4 99.5
25 3 2 10 6 3 9 33 0.3 99.8
26 3 0 3 5 1 3 15 0.1 100
27 1 0 1 0 0 3 5 0.0 100
All 905 1515 3419 840 1449 3322 11450 100 100

The numbers are based on responses from the 1983, 1984, and 1985 NLSY79 surveys (R09880., R10207., R13122., R13796., R16916., and R17600.). Universe includes only respondents ever reporting sexual intercourse.

Questions in this section also ask NLSY79 respondents if they have had sex in the past month. In 1983, respondents provided a yes/no answer. However, in 1984 and 1985 they were asked the number of times they had sex in the last month. The data contain some outlying entries; users should carefully decide how to interpret outlying data points in these distributions. Additionally, there is evidence of a significant number of youth who changed their "ever had sex" status from yes to no between 1983 and subsequent survey years.

In 1984, the survey also asked if respondents had taken a sex education course. If so, they were asked at what date they took the course. Respondents also stated whether the first course they took covered:

  • The female menstrual cycle
  • Contraception methods
  • Where to obtain contraception
  • Effects of contraception
  • Sexual diseases

To test reproductive knowledge, respondents were also asked (Table 2) when pregnancy is most likely to occur during the female monthly menstrual cycle. Answers in 1984 show a substantial portion of respondents did not choose the medically correct answer of about two weeks after the period. Approximately 25 percent of men and 12 percent of women stated they did not know the answer.

Table 2. Knowledge of when pregnancy most likely to occur
Answer

Males

Females

Right before Period

1283

1185

During Period

352

128

Right after Period

963

806

About 2 Weeks after Period

1441

2549

Anytime

496

597

Don't Know

1480

725

Source: The data are from the 1984 survey (males R13150. and females R13826.).

Contraception

Since 1982, NLSY79 respondents have regularly been asked about their use of contraception. These contraception questions appear in the "Fertility" section of the questionnaire. The questions were asked annually from 1982 to 1986. After 1986 the questions were asked only in even years corresponding with the child supplements. After 1994, the surveys became biennial and the questions were included in each survey.

The respondents are asked if they use birth control. If they state that they do, the second question asks the frequency of these methods using a three-point scale of "always," "sometimes," or "almost never." Finally, respondents are asked to state all the methods they have used in the past month from the following list:

  • Pill
  • Condom, rubber
  • Foam
  • Jelly or cream alone
  • Suppository or insert
  • Diaphragm with or without jelly or cream
  • Douching after intercourse
  • IUD, coil, loop
  • Operation-Female sterilization, tubes tied
  • Operation-Male sterilization, vasectomy
  • Natural family planning, safe period by temperature and cervical mucus test
  • Rhythm or safe period by calendar
  • Withdrawal/pulling out
  • Contraceptive sponge
  • Other method

Over time the list has been expanded and now includes four additional choices:

  • Abstinence
  • Norplant
  • Cervical cap
  • Depo-Provera
  • Hysterectomy

Beginning in 2002 a specific individual question was added asking respondents if they have been surgically sterilized. If they have been sterilized they will not be asked this series of birth control questions again. Previously, sterilization was an option among a list of birth control options (see list above), but it was not a separate question.

In most years, at the end of this three-question series, the interviewer fills in a question that states if anyone else was present during this part of the questioning. Small children under three years old are not counted by interviewers when filling in their response.

In addition to these questions, female respondents who are pregnant or who have given birth since the last survey year are asked about their contraception methods before conceiving. Respondents are asked if before becoming pregnant they used one of the birth control methods listed, or if they stopped all birth control methods before becoming pregnant. Finally, they were asked if the reason they stopped using birth control was because they wanted to become pregnant.

Reference

Mott, Frank L. "Evaluation of Fertility Data and Preliminary Analytical Results from the 1983 (Round 5) Survey of the National Longitudinal Surveys of Work Experience of Youth." Columbus, OH: CHRR, The Ohio State University, 1985.

Comparison to Other NLS Surveys NLSY79 young adults have been asked about both sex and birth control methods in the "Confidential" section of each survey. In NLSY97 rounds 1 and 2, questions on sexual activity were asked of respondents who were at least 14 years old; all respondents were eligible in later rounds. These respondents were first asked if they had ever had sexual intercourse and if they had intercourse since the last interview (later rounds). If a respondent reported ever having sexual intercourse, questions were asked about the first occurrence, such as the respondent's age and whether the respondent and the partner used birth control. For more precise details about the content of each survey, consult the appropriate cohort's User's Guide using the tabs above for more information.
Survey Instruments & Documentation The questions on sexual activity and contraception can be found in the "Fertility" section of each questionnaire. In 1983, the fertility section was fielded as a separate supplement for those respondents who were not interviewed in 1982.
Areas of Interest
  • Birth Record
  • Contraceptive Use and Efficacy
  • Fertility and Relationship History/Created

Childcare

Data on childcare have been collected within various topical sections of the NLSY79 questionnaires. The main data collection on types of childcare arrangements, discussed in detail below, occurred during the 1982-86, 1988, 1992, and 1994-2014 surveys. Starting with the 2016 survey, childcare data were no longer gathered, due to the age of the sample.

A range of both retrospective and cross-sectional (past four weeks) childcare information is available in the NLSY79. The mother-report child care sections from the NLSY79 provide the types of current childcare arrangements used for each child in the household, a retrospective of childcare experiences during the first three years of life for all children (of at least 1 year of age) born to the respondent, and in some survey years, the overall family expenditure for current care.

Retrospective

Retrospective data were gathered during the 1986, 1988, 1992 and 1994-2014 survey years for up to three childcare arrangements used by NLSY79 mothers for each of their children during the first three years of life. For each year, mothers were asked about whether the child lived with them and, if so, whether that child was in child care. If so, questions about specific child care arrangements during that year of the child's life were asked about. (While childcare information was not collected in the 1990 NLSY79 survey round, it was updated in 1992.)

The data about childcare in the first three years of life have been concatenated into cross-round variables that allow users to study the use of child care during each child's first 3 years of life. For each of the first three years of life for each child there is a variable indicating whether or not that child was in child care during their first year or why this information is unavailable (i.e., the child did not live with the mother, the child or the mother was deceased, the mother was part of a dropped sample type, etc.) If the child was in childcare for a given year, there are variables for up to 3 types of arrangements and the number of months spent in each arrangement.

Cross-sectional

Current childcare information was first asked in the 1982 survey about the youngest and next youngest child living in the mother’s household. In the 1983, 1984, and 1985 NLSY79 surveys, questions were asked about current child care arrangements used in the past four weeks for the youngest child by parents who were either employed, in school, or in training at the survey date. Location and type of primary and secondary care and hours of use were initially asked in 1983, and these questions were expanded in 1984 to include nature of payment and grandmother care. Location, type, payment, detail on group arrangements, and hypothetical care are available for 1985. In both 1984 and 1985, respondents who are not currently employed but who have an employed spouse report limited information on location and type of care.

In both 1986 and 1988, NLSY79 mothers are asked a limited number of questions about current child care arrangements for all of their children living in the household.

The retrospective information collected in 1986 and the current child care information collected between 1984 and 1988 relate to different universes of children and utilize different child care definitions. Additional child care information was also collected in the 1982 and 1983 NLSY79 surveys.

There are marked differences in the universes of respondents, reference children, kinds of questions asked, and reference periods across survey years (see Table 1 below). Universes of respondents vary widely both within and across survey years, from respondents--both male and female--engaged in some educational or labor market pursuit, to all women with a child in the household, to not-employed respondents with an employed spouse. The focus during the initial survey years was on collecting information on childcare arrangements used over the past month for only the youngest child(ren) in the household.

  • A limited number of childcare questions, fielded within the 1987, 1989, and 1990 fertility series, obtained information from select universes on:
    • Whether respondents made use of regular childcare arrangement
    • Whether respondents encountered childcare problems that affected their employment
    • The extent of responsibility for childcare assumed by female respondents during recent pregnancies
  • Out-of-the-labor-force respondents could specify within various sections of the yearly questionnaires, such as the 1980-98 "CPS or Current Labor Force Status" or 1979-2014, "Periods not Working" sections, that lack of available childcare or family responsibilities was a reason they were not looking for work or did not want a job now

In addition, information on whether childcare services were provided as a supportive service through Federally funded government employment and training programs was gathered during the 1979-87 surveys. For those respondents residing in households with children under age 14, the special 1981 "Household Chore and Childcare" time use survey provides information on the amount of responsibility for providing childcare at home and actual time spent on a given day in specific childcare tasks. The Fringe Benefits section of this guide details the collection of information on childcare as a benefit made available by employers of NLSY79 respondents. Finally, a special experimental Childcare Supplement, administered to 347 NLSY79 mothers who were interviewed during the first month of the 1989 fielding, collected a wide range of information, including data on every childcare arrangement used for at least 10 hours per week since the date of last interview. Researchers interested in the 1989 special childcare supplement data should contact NLS User Services. A report evaluating the quality of various childcare data items is available from CHRR (Mott and Baker 1992).

Types and locations of childcare arrangements

Data on types and locations of childcare arrangements are available for the 1982-86, 1988, 1992, and 1994-2014 survey years. Typical categories of childcare arrangements include self-care; care by relatives (the child's other parent, a step-parent, siblings, or grandparents); care by nonrelatives; and care provided by institutions such as day care centers, nurseries, or preschools. Information on care provided by individuals usually differentiates between that occurring in the child's home and that occurring in another private home. Information was gathered during select survey years (1983-86 and 1988) on both primary and secondary childcare arrangements.

Additional characteristics of childcare arrangements

In addition, supplementary information has been collected during certain survey years on the number of hours that childcare services were required/provided, the nature of the payments (cash or noncash), total cost per child/per provider, and the effect of available childcare services on such activities as employment, job search, and training.

Table 1. Child Care Questions: 1982-2014 NLSY79 Surveys Including the Special 1989 Child Care Supplement

Year

Universe

Universe Description Nature of Questions Time Reference Reference Children

1982

1

All respondents in school, in training, employed, or on active duty with children in the household Type and location of care; total weekly expenditures; days/hours in child care Usually Youngest and next youngest child

2

Unemployed respondents (in school or training) with children in the household Hypothetical type and location of child care if employed Future Youngest and next youngest child

3

All respondents in school, in training, employed, or on active duty with children in the household Would availability of child care affect hours spent/ attendance at schooling, training, employment or job search Future  

1983

1

All respondents in school, in training, employed or on active duty with children in the household Types and location of primary and secondary care; hours; nature of payments Last 4 weeks Youngest child

2

Women in school, training, employed or on active duty with children in the household Would availability of child care affect hours spent/ attendance at schooling, training, employment or job search Future  

1984

1

All respondents in school, in training, employed or on active duty with children in the household Type, location, and hours of primary & secondary care; nature of payments; use of grandmother Last 4 weeks Youngest child in household

2

Not employed respondents with employed spouse Type and location of primary care Last 4 weeks Youngest child in household

3

Women with a child in the household Would availability of child care affect hours spent/attendance at school, training, employment or job search Future  

1985

1

All respondents in school, in training, employed, or on active duty with children in the household Type, location, and hours of primary & secondary care; nature of payments; total expenditures; retrospective on current arrangement; detail on primary group care Last 4 weeks Child in household who was youngest active in household in 1984

2

Respondents employed or in active forces with children in the household Hypothetical additional cost of primary care arrangement for respondents wanting to work more hours Future Youngest and next youngest child

3

Respondents employed or in active forces with children in the household Type, location, and hours of primary & secondary care; nature of payments; total expenditures; retrospective on current arrangement; detail on primary group care Last 4 weeks Youngest child in household - no care data collected in 1984

4

Respondents employed or in active forces with children in the household Type and location of primary & secondary care Last 4 weeks Youngest child in household

5

Respondents not in school, in training, or unemployed with employed spouse Type and location of primary care and shift worked by spouse Last 4 weeks Youngest child in household

1986

1

All women with children in the household Type, location and hours of primary & secondary care; detail on primary group care; nature of payment; expenditures for all care Last 4 weeks All children in the household

2

All mothers Type and location of up to 5 arrangements at each age First 3 years of life All biological children at least one year old who resided with mother during most of 1st, 2nd, and/or 3rd years of life

1987

1

All respondents with children in the household Use of a regular child care arrangement Last 4 weeks Any (not individually) children in the household

1988

1

All women with a biological child in the household Location, type, and hours of primary & secondary care; detail on primary group care; nature of payment; expenditures for all care Last 4 weeks Any (not individually) children in the household

2

All mothers Location and type of up to 3 arrangements at each age and extent of usage First 3 years of life All biological children at least one year old who resided with mother during most of 1st, 2nd, and/or 3rd years of life

3

Female respondents Extent of responsibility for child care during recent pregnancies During pregnancy Any existing already during the pregnancy

1989

1

Respondents who were employed or on active duty in past four weeks with children under age 14 in the household Problems with regular child care arrangements that affected respondent's work Last 4 weeks All children under age 14 in household

1989 Supplement

1

Women with at least one child under age 14 in the household Types and location of all child care arrangements lasting at least one hour; hours; characteristics of care giver; month/year began; reason needed child care Last week Any child

2

Women with more than one child under age 14 in the household Care giver used; hours; costs Last week Youngest and next youngest child

3

Women with at least one child under age 14 in the household (unemployed only) Main reason not working; if child care available, would work; caretaker usually use when go out    

4

Women with at least one child under age 14 in the household (employed only) Impact of sick child on work hours; type of child care for sick child; impact of problems with regular child care on job, training, school; types of child care problems; proximity of relatives Last month/last 12 months  

5

Women with at least one child under age 14 in the household (except with children ages 5 and older) Usual child care arrangements for children after school Regular school year Youngest and next youngest child

6

Women with at least one child under age 14 in the household Characteristics of up to 5 types of child care arrangements used for at least 10 hours in any week; dates and reason started/stopped; costs Date of last interview  

1990

1

Female respondents Extent of responsibility for child care during recent pregnancies During pregnancy Any existing already during the pregnancy

1992, 1994-2014

1

All mothers Location and type of up to 3 arrangements at each age and extent of usage First 3 years of life All biological children at least 1 year old who lived with mother during most of first three years of life

References

Mott, Frank L. and Baker, Paula. "Evaluation of the 1989 Child Care Supplement in the National Longitudinal Surveys of Youth." Discussion Paper 92-6. Washington, DC: U.S. Department of Labor, Bureau of Labor Statistics.

Comparison to Other NLS Surveys Childcare history in the first three years is on the NLSY79 Child Data File. The NLSY97 Parent Questionnaire collected information from one of the youth's biological parents or another household adult on background information including childcare. An expanded set of NLSY97 childcare questions was asked in select rounds. In some years the Young Women and Mature Women answered questions concerning types and locations of childcare arrangements and the extent of childcare responsibility. For more precise details about the content of each survey, consult the appropriate cohort's User's Guide using the tabs above for more information.
Survey Instruments and Documentation Childcare questions are located in the "Childcare" sections of the questionnaires: Section 16 (1982), Section 14 (1983), Section 11 (1984-87), and Section 10 (1988, 1992, and 1994-2014). The limited set of childcare questions asked during 1987 and 1989 can be found within the Section 9 "Fertility" series. The "Time Use - Household Chores & Childcare" questions are located in Section 19 of the 1981 questionnaire. Users interested in childcare services provided within government training programs should reference the "Training" section of this guide.
Areas of Interest The 1982-2014 childcare questions can be found within the "Childcare" area of interest. The 1980-1998 CPS series have been placed in the "CPS" areas of interest; "Periods Not Working within Job Tenure" includes the variables relating to childcare as a reason for not working for an employer. Variables relating to childcare services provided to respondents holding government jobs or participating in government training programs are located within the "Government Jobs" and "Government Training" areas of interest. The special set of 1981 time use questions relating to childcare can be found in "Time Use."

Marital Status, Transitions & Attitudes

Created variables

  • MARITAL STATUS: These variables contain the respondent's marital status at each interview date.
  • MARITAL STATUS COLLAPSED: These variables contain the respondent's marital status at each interview date, collapsed into three primary categories ("never married," "married spouse present," or "other").
  • AGE 1ST/2ND/ETC. MARRIAGE BEGAN/ENDED: These XRND variables contain constructed marital transition data out to the month and year for all marriages.
  • RELATIONSHIP HISTORY (SPOUSE/PARTNER ON HOUSEHOLD ROSTER, ID NUMBER OF SPOUSE/PARTNER): These XRND variables contain a sequential identification number for a spouse/partner if one is listed on the household roster, and a code reflecting whether the person listed is a spouse or a partner.
  • NUMSPPTR##: The number of the spouse/partners reported as of most recent interview date.
  • RELSPPTR##: Relationship code for spouse/partner at most recent interview date.

Note: The MARSTAT variables, NUMSPPTR, and RELSPPTR have been created for opposite-sex individuals only. See information on Same-Sex Partners below.

Important Information: Using marital status, transitions and attitudes data

Collection of information on partners as distinguished from spouses has varied over time in the NLSY79. To some extent, the term "Partner" is used more interchangeably in sections of the questionnaire such as "Fertility" and "Childcare" than in sections such as "Marriage" and "Income and Assets," where items may specifically include or exclude partners. Do not to assume the interchangeability of terms and carefully check the wording of all the questionnaire items of this type that are of interest.

A detailed memo, "Inconsistencies in the NLSY79 Marital History Data" (Haurin 1988), identifies those respondents whose marital histories through the 1986 survey contained inconsistencies; it also summarizes the edits made, if any, to each case during preparation of the 1986 Supplemental Fertility File (area of interest "Fertility and Relationship History/Created"). Researchers using the constructed marriage dates should be aware that there is a very small percentage of people who have ended their third marriage or have entered a subsequent one. The use of information from the Information Sheet to designate respondents' current marital status is not encouraged because this information is dated. However, Information Sheet data do permit users to detect inconsistencies reported over time.

The program statements used to create marital status for the 1992 survey years are listed in Figure 1. Creation procedures for other years after 1988 are almost identical to the procedure presented here, except that different reference numbers are used in the 1989-1996 PL/1 code and the code was converted to SPSS in 1998 and is now in SAS. Prior to 1988, interview checks verifying marital status at the previous interview did not exist in the questionnaire; marital status was created based on the last actual stated change rather than an interviewer check or verification of the status last reported. General information regarding creation of the Supplemental Fertility File (area of interest "Fertility and Relationship History/Created"), including marital transition data, age at first marriage, and months between first marriage and first birth, is found in Appendix 5: Supplemental Fertility Files in the NLSY79 Codebook Supplement.

Marital status

Marital status information for NLSY79 respondents is available from:

  • responses to questions fielded during the annual or biennial surveys
  • two sets of created variables specifying marital status as of the interview date
  • an item on the marital status of each respondent as of the previous interview derived from the interviewing aid called the Information Sheet
Table 1. Year-by-year variations in marital status data collection

Survey Years

Marital Status data collected
1978 (household screener) The marital status of household members is available from the 1978 household screening only, for example 'Household Screener: Family Member #1 - Marital Status.' Categories include "presently married," "widowed," "divorced," "separated," and "never married/annulled."
1979 The marital status of each respondent, that is, whether he or she was married, widowed, divorced, separated, or never married, was collected during the 1979 survey and is available as a single variable, 'Marital Status.'
1980-1987 Interviews collected change in marital status information. Prior to 1982, presence of a partner was indicated by an interview check coded "yes" if the respondent lived with one unrelated adult of the opposite sex ('Int Check - Does R Live With Opposite Sex Adult as a Partner'). Since 1987, only one version of the Household Interview Form has been used; all respondents not living with a spouse have been asked about opposite-sex partners. The partner variable originating from the household interview is titled 'Currently Living as Partner with Opposite Sex Adult'.
Beginning in 1988 'Current Marital Status' interview checks are included in the questionnaire to verify separately the marital status of respondents who report a change in status since the date of last interview and respondents who do not.

Marital history/transitions

A series of edited Supplemental Fertility File variables (area of interest "Fertility and Relationship History/Created") that reflects the beginning and ending dates of marriages was constructed for all marriages through the present. These variables include the month and year the respondent began a marriage and the month and year a marriage ended, for example, 'Month Began 1st Marriage.' The unedited data items include the month and year of each reported marital transition and type of transition. Users should note that a separate category for the transition to "reunited" was not added until the 1981 survey. Evaluations of the marital history data for the NLSY79 can be found in Haurin (1988, 1994). A related variable, also located in the "Fertility and Relationship History/Created" area of interest, provides the number of months between the respondent's first marriage and the birth of the respondent's first child.

Cohabitation

The following cohabitation information is available from the 1990 and 1992-2016 surveys:

  • the month and year the respondent and his or her opposite-sex partner began living together
  • whether the respondent lived with his or her spouse before marriage
  • the month and year the respondent and his or her spouse began living together
  • whether the respondent and his or her spouse lived together continuously until marriage

A household member's relationship to the respondent may be listed as "partner" in the Household Record. This is true regardless of whether the "partner" is of the same or opposite sex as the respondent. However, through 1996 only opposite-sex partners are referenced during the interview for questions relating to household, income, and dating/relationship. After 1998 information on all partners is collected.

Beginning in 2002, respondents were asked detailed questions about their cohabitation experiences during any unmarried spell lasting three months or longer. First, respondents were asked about marital status and changes in marital status. If any unmarried gap lasting three months or more was identified, respondents were asked if they ever lived with anyone as a domestic partner for three months or more since the last interview. If respondents answered affirmatively, they were then asked specifically about each unmarried gap of three months or greater.

For each unmarried gap, respondents were asked the following series of questions: First, respondents were reminded of the marital status they reported during the gap and asked if they lived with someone as a domestic partner during that period. If a partner was reported, respondents provided information on the date on which they began cohabiting, whether they cohabited with this partner continuously until the current interview date and, if not, the date they stopped cohabiting with this partner during the specific gap. After these questions were answered, the dates were compared. If the unaccounted-for remainder of the specific gap was three months or more, the respondent was asked if they lived with any other partner during that gap for three months or more. If the answer was "yes," the questions were repeated for the next partner during that gap. This entire series of questions was then repeated for each eligible three-month-plus unmarried gap.

Same-sex partners

Beginning in 2000, the question titled “IS THERE A SAME SEX PARTNER IN THE HOUSEHOLD?” (Q9-64FB) indicates a same-sex partner reported.

Prior to 2000, the questions on NLSY79 partners were limited to an opposite sex adult; however, it may be possible to find some same-sex partners prior to 2000 by looking at the Household Record variables. To do so, users can identify the household member who is the spouse or partner by using the relationship variables HHI_FINAL_RELCODE, then identify the sex for that same member# using HHI_FINAL_GENCODE.

NLSY79 respondents have never been asked specifically to identify their sexual orientation as straight or LGBT.

Spousal characteristics

Information collected as part of the Household Record is available for spouses and partners at each survey point if they are listed as members of the household. Users first need to identify the appropriate relationship to the respondent (that is, code "1" for spouse; code "33" for partner) through variables that are specific for this purpose. Typically, information on the age, relationship to respondent, highest grade of schooling completed, and employment in the past calendar year is collected during each survey. See Household Composition for information on specific characteristics. In addition to information available from the Household Record, additional information has been collected on the spouse as part of the Marital History section of the questionnaire. Users should consult each relevant survey year's questionnaire or codebook to ascertain the specific items collected in the given year. Information on income has been selectively collected for the respondent's spouse or partner. Users should check the Income section for further details.

Scroll right to view additional table columns.

Table 2. Spouse specific questions by survey year (spouse/partner collected in same question series beginning in 1994)

Variable

1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012 2014 2016 2018 2020 2022

Spouse present on household enumeration; if present, relationship to youth confirmed, highest grade completed, age, sex, work in last calendar year

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Dates/types of marital status change (marriage, separation, divorce, widowed)

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Is spouse a new spouse

                              * * * * * * * * * * * * * * *

Past spouse's race

                                            *   * * * * * *

Month/year of birth of spouse

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Religious affiliation (raised in and present) and current attendance

      *                             * * * * * * * * * * * *

Cohabit with spouse prior to marriage, date began living together, live together continuously before marriage

                      * * * * * * * * * * * * * * * * * * *

Current/new spouse's prior marriage history (# prior marriages, age at 1st marriage, duration of most recent prior marriage, how most recent prior marriage ended) (1982 - # marriages including to R only)

      *                           * * * * * * * * * * * * *

Spouse enrolled in regular school since last interview

  * * *                                                    

Spouse occupation

* * * * * * * * * * * * * * * * * * * * * * * * * * * *    

Rate of pay

* * * * * * * * * * * * * * * * * * * * * * * * * * * *    

Shift worked by spouse

      *       * * * * * * * * * * * * * * * * * * *        

Labor force activity survey week

                  * * * * * *                              

Weeks not working past calendar year

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Weeks looking for work/on layoff past calendar year

      * * * * * * * * * * * *                              

Number of weeks worked, number of hours usually worked in past calendar year

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Source of health insurance coverage

                    * *   * * * * * * * * * * * * * * * * *

Health limitations of spouse/effect on r's work

      *                                                    

Contraceptive use r/spouse/partner

      *   * * *   *   *   *   * * * * * * * * * * * * * * *

Pregnancy wantedness (ask of females about husbands/partners)

      * * *   *   *   *   *   * * * * * * * * * * * * * * *

Partner pregnant (asked of males about partners)

          * * *   *   *       *                            

Child care when spouse working

          * *                                              

Relationship quality (mothers in 1988, females in other years)

                  *       * * * * * * * * * * * * * *      

Unemployment compensation (past calendar year through 1992, event history since last interview after 1992)

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Income -military (in 2018 retirement income from military)

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Income (wages and earnings)

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Income (business/farm)

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Educational benefits received

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Child support (questions vary through survey rounds)

                            * * * * * * * * * * * * * * * *

Define retirement (choice = when spouse stops working/reduces hours) (2006 experimental group)

                                          * * * * *        

Immigration - was R the spouse of a principle entrant; status of principle entrant visa, relationship to sponsor

                      *                                    

Sequential id # of spouse/partner, identification as spouse/partner (created variables) Note 2.1

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Note 2.1: See the Relationship History Variables section of Appendix 5 in the NLYS79 Codebook Supplement for more information.

Table 3. Respondent/spouse (not separable) variables by survey years

Variable

1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012 2014 2016 2018 2020 2022

Money/time donated to various charitable entities

                                              * * *        

Lose any time at work in past 4 weeks due to no child care

              *   *                                        

AFDC/TANF, Food Stamps, SSI (past calendar year through 1992, even history since last interview after 1992)(SSI & SSDI differentiated in 2018)

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

Targeted cash or non-cash assistance (event history from beginning of survey year)

                                  *                        

Targeted benefits from public assistance program (gas vouchers, child care and so forth)

                                    *                      

Receive veap/educational benefits from military

* * * * * * * * * * * * * * * * * * * * * * * * * * *      

Value, amount owed on vehicles owned

* * * * * * * * * * * * * * * * * * *   *   *   *   *   *  

Market value residential property

            * * * * * *   * * * * * *   *   *   *   *   * *

Amount child support paid past calendar year

      * * * * * * *           * * * * * * * * * * * *      

Alimony paid past calendar year

      * * * * * * *                                        

Possession of various assets

            * * * * * *   * * * *   *   *   *   *   *   *  

Assets market value/debt

* * * * * * * * * * * *   * * * *   *   *   *   *   *   *  

Amount spent on food, other than food stamps

                      * * * * *                            

Rights to/income or property from inheritance or trust (questions vary)

                  * * * * * * * * * * * * * * * * * * * * *

Attend meetings organized by r/spouse employer (2006 experimental group)

                                          * * * * *        
Table 4. Marriage expectations and dating behavior variables by survey years

Variable

1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012 2014 2016 2018 2020 2022

Age R expects to marry

*                                                          

Dating behaviors and attitudes (unmarried females) (attitudes oldest child)

                  *       *   * * * * * * * * * * *        

Likelihood of marriage in future (not currently married)

                  *       *   * * * * * * * * * * *        

Marital attitudes and expectations

A series of relationship satisfaction questions were asked during the 1988, 1992, and 1994-2016 interviews of those mothers living with a spouse or opposite-sex partner. These questions dealt with both positive and negative interactions in the relationship, for example, 'Frequency R and Husband/Partner Calmly Discuss Something' and 'Frequency R and Husband/Partner Argue About - Money.' In addition, mothers without a spouse or partner were asked for information on 'Frequency R Goes Out on Dates,' whether the 'Oldest Child Encourage(s) R's Dating,' and 'Likelihood of Marriage in the Future.' In 1979, all never married respondents were asked for information on 'Age Expects to Marry.'

Environmental characteristic data

For the 1979-1982 interview years, the restricted-use Geocode file includes marriage and divorce rates and percent of families with female heads of household for the county and SMSA in which the respondent resided. These statistics are taken from the 1972 and 1977 County & City Data Books. From 1983 until 1998, marriage and divorce rates and the number of families with a female head are included for county of residence only, based on statistics from the 1983 and 1988 County & City Data Books. The 2000 release includes the same information based on statistics from the 1988 and 1994 County & City Data Books.

Figure 1. Expanded and collapsed marital status: NLSY79 1992

/* EXPANDED & COLLAPSED MARITAL STATUS 1992 */
MARST_EXPAND=-4;
MARST_COLLAP=-4;
IF WEIGHT92=0 THEN DO;
MARST_EXPAND=-5;
MARST_COLLAP=-5;
END;
ELSE DO;
IF R(37017.)>=0 THEN MARST_EXPAND=R(37017.);
ELSE IF R(37030.)>0 THEN MARST_EXPAND=R(37030.);
IF MARST_EXPAND=0 THEN MARST_COLLAP=1;
ELSE IF MARST_EXPAND=1 & SPOU92=1 THEN MARST_COLLAP=2;
ELSE IF MARST_EXPAND>0 THEN MARST_COLLAP=3;
ELSE MARST_COLLAP=-3;
END;
MARST_COLLAP=R(40072.);
MARST_EXPAND=R(40073.);

/* SPOU92 INDICATES IF A SPOUSE IS LIVING IN THE HOUSEHOLD (0=NO, 1=YES). */
TO CREATE SPOU92:

  1. INITIALIZE SPOU92=0.

  2. SET SPOU92=-5 IF NOT INTERVIEWED IN 1992 (I.E., IF WEIGHT92=0).

SEARCH THROUGH THE HOUSEHOLD ENUMERATION AND COMPUTE SPOU92=1
IF THE RELATIONSHIP TO YOUTH IS CODED AS A SPOUSE (CODE=1). */

Survey Instruments Marital information can be found in the Marital History section of the questionnaire.
Areas of Interest
  • Marital/Relationship Quality & Dating
  • Marriage

Race, Ethnicity & Immigration

Important information: Using race, ethnicity and immigration data

  • Race and ethnicity variables for household members are based on information collected on the Household Screener; in which race and one ethnic background for each household member were recorded. 
  • The interviewer's identification of the respondent's race can be subjective. Each interview from 1979-1986 and 1988-1998 collected information on the interviewer's direct observation of the race of the respondent ("black," "white," or "other"). 
  • No special instructions are provided within the Question by Question Specifications as to how the interviewer is to code race.

Additional instructions for coding race, ethnic origin, and the racial/ethnic identifier variable can be found in the Household Screener and Interviewer's Reference Manual (1978) and in a NORC memo dated 10/4/78 available from NLS User Services.

The following race and ethnicity variables are available for NLSY79 respondents: 

  1. a racial/ethnic variable based on the sample identification code assigned by NORC
  2. a series of self-reported ethnic origin variables collected during the 1979 and 2002 surveys
  3. a set of interviewer identifications of the race of the respondent at the time of the interview
  4. racial/ethnic identification for current and past spouse/partners
  5. variables representing the respondent's immigration history and status collected during the 1990 survey
  6. a 1979 variable indicating whether a foreign language was spoken in the house during the respondent's childhood
  7. a series of variables recorded by the interviewer indicating whether the survey was administered in English or another language
  8. race/ethnicity variables for each family member listed during the 1978 screener
  9. race of the interviewer where available at each interview
  10. country of origin of the respondent's parents and the respondent's country of birth, available on the restricted Geocode release

Race and ethnic origin information is also available for each household member identified during the 1978 household screening. In 2002 respondents were asked to identify their race/ethnicity using questions that conformed to Federal government definitions. Of related interest is a series of immigration questions, fielded in 1990, that included the collection of information on country of citizenship at the time that foreign-born respondents entered the U.S.

Race/ethnicity

The variable 'Racial/Ethnic Cohort from Screener' (R02147.) designates the respondent as "Hispanic," "black," or "nonblack/non-Hispanic" and provides the basis for weighting NLSY79 data. This variable is collapsed from R01736., 'Sample Identification Code,' which includes such values as "supplemental male black" or "cross-sectional female Hispanic." This code was assigned by NORC to each respondent based on information gathered during the 1978 household screening. In the creation of the 'Sample Identification Code' and thus the 'Racial/Ethnic Cohort' variable, both race and ethnic origin information collected at the time of the 1978 household screening were used. Interviewers conducting the screening were instructed to:

  1. code race by observation into three categories, "nonblack/non-Hispanic," "black," or "other"
  2. inquire about the ethnicity of all household members age 14 or above
  3. but assign ethnicity, without asking, to those members who were under age 14

Coding procedures used by NORC to assign the "Hispanic," "black," and "nonblack/non-Hispanic" identifications to respondents included the following classification guidelines:

"Hispanics"
were those who self-identified as Hispanic, whose ethnicity screener code was 1-4
  1. Mexican American, Chicano, Mexican, Mexicano
  2. Cuban, Cubano
  3. Puerto Rican, Puertorriqueno, Boriccua
  4. Latino, Other Latin American, Hispano, or Spanish descent. Persons who did not self-identify as Hispanic but who met the following conditions were also classified as "Hispanic":
    • those who identified themselves in the ethnic origin categories that included Filipino (code 6) or Portuguese (code 13)
    • those whose householder or householder's spouse reported speaking Spanish at home as a child
    • those whose family surname is listed on the Census list of Spanish surnames
"Blacks"
  • Included those for whom race was coded "black" and ethnic origin was "non-Hispanic" or those whose ethnic origin was coded black, Negro, or Afro-American (code 5) regardless of race coding
"Nonblack/non-Hispanics"
  • Included those whose race was coded "white" or "other" and who did not identify themselves as either black or Hispanic in answer to the ethnicity question. Instructions to interviewers for coding race included coding in the "other" category those persons who were Japanese, Chinese, Vietnamese, Asian Indian, Native American, Korean, Eskimo, Pacific Islander, or of another race besides black or white.
  • Father's race was to be used to assign race for those of mixed descent except for some cases of those under age 14 of Spanish descent. Users should note that this decision rule is different from that applied to the NLSY79 children, for whom the mother's race is used. Spanish origins were to be given preference; if at least one ethnicity mentioned was of Spanish origin, the Spanish origin was to be coded (or, for those under 14, if at least one parent was Hispanic, the Hispanic parent's ethnicity was assigned).

A series of ethnic identification variables, '1st-6th Racial/Ethnic Origin' and 'Racial/Ethnic Origin with Which R Identifies Most Closely' (R00096.-R00102.), provide extensive ethnicity information. Respondents were asked during the 1979 interviews to name the racial/ethnic origins with which they identified. A listing of more than 20 categories, including "Black," "English," "French," "German," "American Indian," "Irish," "Mexican," "Mexican-American," and "Puerto Rican," were presented on a Show Card. If a respondent offered more than one origin, he or she was also asked for the ethnic group with which he or she most closely identified. Users should be aware that frequency counts for the coding category "Indian American, or Native American are unusually high. About 5 percent of respondents reported this racial/ethnic origin, compared to Census estimates of approximately 0.5 percent of the population. This may have resulted from some respondents' misinterpretation of the term "Native American." Table 1 compares frequencies of the 1979 first (or most closely held) ethnic identification with the NORC assigned racial/ethnic identification.

Table 1. Ethnicity by racial or ethnic cohort from screener (unweighted data)
Respondent's Self-Identification NORC-Assigned Race/Ethnicity

Racial/Ethnic Group
Note 1.1

Total NonBlack/
Non-Hispanic
Non-Hispanic
Black
Hispanic 
or Latino

Total

12686 7510 3174 2002

Black

3049 19 3017 13

Total Hispanic or Latino

1834 46 5 1783

Cuban

116 1 0 115

Chicano

59 0 0 59

Mexican

383 5 0 378

Mexican-American

734 15 1 718

Puerto Rican

328 7 1 320

Other Hispanic or Latino

118 7 0 111

Other Spanish

96 11 3 82

Total European

5281 5100 82 99

French

311 290 10 11

German

1395 1376 5 14

Greek

31 29 0 2

English

1561 1476 51 34

Irish

949 933 3 13

Italian

497 474 7 16

Polish

238 234 3 1

Portuguese

97 88 3 6

Russian

45 45 0 0

Scottish

122 120 0 2

Welsh

35 35 0 0

Total Asian

117 93 11 13

Asian Indian

22 20 2 0

Chinese

26 22 4 0

Filipino

43 33 4 6

Japanese

19 14 0 5

Korean

6 3 1 2

Vietnamese

1 1 0 0

Hawaiian/Pacific Islander

20 17 0 3

American Indian

622 585 17 20

Other

779 736 21 22

American

743 692 10 41

None Note 1.2

241 222 11 8

Note 1.1: R00102., 'Racial/Ethnic Origin with Which R Identifies Most Closely,' is used unless it was not answered; otherwise R00096., '1st or Only Ethnic Origin' is used. Those listing only one ethnic background did not answer R00102.

Note 1.2: Includes totals of 98 "don't know," 132 "none," 10 "invalid skips," and 1 "refusal."

Immigration

In 1990, NLSY79 respondents born outside the United States, its territories, or Puerto Rico were asked a series of questions on their immigration history and visa status. Dates of first and most recent entrance into the United States to live for six or more months and information on whether the respondent was the principal entrant/immigrant were collected. For respondents' or principal entrant/immigrants' first and most recent entry or change in visa/immigration status, details were gathered on:

  1. visa or immigration status at entry date
  2. form of temporary entry visa
  3. citizenship status (that is, citizen or permanent resident alien) and relationship of the sponsoring relative
  4. country of citizenship at entry date or date of change of status

Also recorded for the respondent was information on:

  1. current citizenship/residence/visa status in the United States
  2. residence inside/outside the United States
  3. expectations to return to the United States to live permanently or to return to his or her country of birth to live permanently
  4. the total number of years spent outside the United States since initial entry

Of related interest is the variable, 'Is R a Citizen of the U.S.' available from the 1984 and 1990 interviews.

Foreign language used or spoken

For each household member, information is available from the screener on presence of a Spanish surname and whether Spanish was the language spoken in the home when that individual was a child. The 1979 interview asked whether a foreign language (Spanish, French, German, other) was spoken at home during the respondent's childhood. In addition, interviews record for each survey whether English, Spanish, or another foreign language was used to administer the Household Interview Forms ('English or Foreign Language Used for Household Record') and questionnaire ('Int Remarks - Was Interview Conducted in English or Foreign Language').

Reference

NORC. 1978 Household Screener and Interviewer's Reference Manual. Chicago, IL: National Opinion Research Center - University of Chicago, 1978.

Comparison to Other NLS Surveys Race is available for all cohorts; ethnicity is available for all cohorts except the Older Men and Young Men. Users should be aware that coding categories for race and ethnicity have varied among cohorts and over time. For more precise details about the content of each survey, consult the appropriate cohort's User's Guide using the tabs above for more information.
Survey Instruments & Documentation

Race and ethnicity variables originating from the screener are located on the second page of the Household Screener. Questions concerning the ethnicity of the respondent are included in the "Family Background" section (Section 1) of the 1979 questionnaire. Interviewer remarks regarding race are located in the final section ("Interviewer's Remarks") of each questionnaire. Immigration questions are located in Section 13, "Immigration," of the 1990 questionnaire.

For further information on the coding of race and ethnicity in the Household Screener, see the 1978 Household Screener and Interviewer's Reference Manual (NORC 1978). Those needing additional information on coding procedures should request a copy of a NORC memo dated 10/4/78 available from NLS User Services.

Areas of Interest

'Birthplace (Country and State) of R's Mother/Father' and 'Birthplace (Country) of Father's Father' are available in "Geocode" area of interest in the restricted-use Geocode file. 'Current residence in US?' is in the "Key Variables" area of interest in the public file.

Race and ethnicity variables are included in the following areas of interest: 'Racial/Ethnic Cohort from Screener' is in the "1978 Screener" and "Household Record" areas of interest. Ethnicity variables originating from the 1979 interview as well as all immigration variables have been placed in the "Family Background" area of interest. The interviewer's remarks variables are located in "Interviewer Remarks." Race variables for household members originating from the 1978 household screening are located in "1978 Screener" and "Household Record."  Current residence related to immigration status "Family Background."

Household Composition

Created variables

TYPE OF RESIDENCE. These variables reflect the type of residence in which the R was living at each survey point (e.g. own dwelling unit, in parental household, jail, etc.). Although these variables exist for each year, they are only actually created or compiled from multiple versions of the Household Interview from 1979-1986. A single version of the Household Interview was used beginning in 1987.

Important information: Using household composition data

  • Some familiarity with the following survey instruments (see Survey Instruments section for descriptions of each of these instruments) which gather information on households is necessary:
    • the NLSY79 Household Interview Forms
    • the NLSY79 "Household Enumeration"
    • the NLSY79 Face Sheet
    • and the household screeners that were used to select respondents for the NLSY79 cohort
  • This section does not contain information pertaining to variables about the characteristics or experiences of household members, the presence of partners within the household, or geographical areas of residence. Any information collected specifically on household members will be in specific topics of interest, such as age, sex, educational status, and so forth. The availability of information on partners is discussed in the Marital Status, Marital Transitions & Attitudes section.
  • Income of partners is omitted from 'Total Net Family Income', family size, and family income variables. Inferring a monetary relationship between household members who do not have a legal relationship by their own design is more tenuous than inferring a monetary relationship between designated family members. Therefore, partners are excluded. You can easily add or subtract from the family size by designating your own qualifying relationships.
  • Spousal pairs are inconsistent for three respondents. In the created relationship codes for household members (R00001.51, R00001.53), respondents 9707, 8522, and 1414 are considered spouses of 9706, 8521, and 1413, respectively. However, 9706 is considered 9707's partner, 8521 is considered 8522's "other non-relative," and 1413 is considered 1414's husband or brother-in-law. These assigned relationships are reflective of respondents' own explanations of the relationships. Relationship codes linking respondents may be weak outside of immediate family relationships. 
  • Do not use the 1982 'Version of Household Record from Last Interview' as a substitute for the missing 1981 version because it may contain inaccuracies and because not all 1981 interviewees were interviewed in 1982.
  • This section describes variables related to household and family composition, household identification, linkages between members of multiple respondent households, and household residence.

Household members

The term "household" refers to all individuals sharing the respondent's primary residence at the time of the interview. For respondents living in temporary quarters (except temporary military quarters), the usual residence is defined as that person's permanent residence. For those living in their own dwelling unit or in military family housing, the usual residence is the person's dwelling unit. For example, if a male college student is living in a temporary residence, such as a fraternity, those who share his permanent residence, such as his parents' address, would be considered his household members. However, if that same college student were living in his own apartment, all those living in his apartment would be considered his household members.

Persons analyzing military households should note that household screener information was not collected for persons in the military sample. Thus, while military units are included in the total 8,770 unique households, military units cannot be multiple respondent households. Household specification for those respondents enlisted in the military is as follows:

  1. for those in the military who are married but living in military quarters other than military family housing, the household is the household of the respondent's spouse, and
  2. for those in the military who are unmarried, no household information is recorded.

During PAPI surveys, information about a respondent's household was gathered during a separately administered household interview. Three different Household Interview Forms "Household Interview Forms" were used prior to 1987: 

  • Version A was completed by a parent of those respondents living in a parental household
  • Version B was administered to youth not living at a permanent address
  • and Version C was answered by those respondents living in their own dwelling unit or independent living quarters

Table 1 details, by survey year, the relevant universes and residence types specific to each version; notes on variations in administration of the forms are included. A series of variables entitled 'Version of Household Record Used' is available for the 1979-80 and 1982-86 survey years. To determine the version of the household interview used in 1981, it is necessary to match information from the variable, 'Type of Residence R is Living In,' to residence information that was included on the three different forms. Beginning in 1987, only one version of the household interview was used, as all respondents were 22 or older and living predominantly on their own. Since the introduction of CAPI interviews in 1993, household information has been collected in the first section of the main questionnaire rather than in a separate instrument.

All members of the respondent's household are enumerated each survey year on the household record; in 1978, household members were listed on the household screener. The relationship generally listed for each household member on the household record is relative to the youth respondent, such as 'Household Record - Relationship to Youth Member # 1.' For variables from the screener and for one series of 1979 household record variables, the relationship of household members (only family members in the screener) is relative to the householder. Anyone who lives in the residence but is temporarily away is listed; anyone who is there only temporarily is removed from the listing. For the screener and for interviews in which the respondent lives in a new household, that is, living with new people rather than living at a new address, the householder generally is listed first, followed by a spouse; any children; any other relatives; and any roomers, boarders, hired help, or other usual unrelated residents.

Table 1. Guide to the household forms: NLSY79
Year Household Version Conducted with R's Residence
1979 Version A Parent of R only Parental home
Version B1 Youth Respondent Dorm, jail, hospital, temporary living quarters
Version B2 Youth Respondent Military sample member
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, other living quarters
1980 Version A Parent of R only Parental home
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.2
1981 Version A Parent of R only Parental home Note 1.1
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.2
1982 Version A Parent of R only Parental home Note 1.1
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.2
1983 Version A Parent of Respondent only Parental home Note 1.3
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters 
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.4
1984 Version A Parent of Respondent only Parental home Note 1.3
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.5
1985 Version A Youth Respondent or Parent Parental home Note 1.6
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.4
1986 Version A Youth Respondent or Parent Parental home Note 1.6
Version B Youth Respondent Dorm, jail, hospital, mil/temp living quarters
Version C Youth Respondent Own dwelling unit, orphanage, religious institution, military/other living quarters Note 1.4
1987-2022 One HH version only Youth respondent only Any residence

Note 1.1: Includes youth respondents under 18, living in other parent's or spouse's parents' home.

Note 1.2: Includes youth respondents over 18, living in other parent's or spouse's parents' home.

Note 1.3: referred version of household interview for youth respondents living in other parent's or spouse's parents' home.

Note 1.4: Permissible (though not preferred) version of household interview for youth respondents living in other parent's or spouse's parents' home.

Note 1.5: Included some youth respondents still in parental household (with explanation as to circumstances--code "17" added).

Note 1.6: Included youth respondents in other parent's or spouse's parents' home (codes "18" and "19" added to reflect whether household interview conducted with the youth respondent or the parent).

Family members

Within the listing of household members, family units are identified through family unit numbers and relationship codes. A family unit includes all those related by blood, marriage, or adoption. For each member of the household in every survey year, including the 1978 screener, the family unit number is listed on the "Household Enumeration" or the screener, such as 'Household Record - Family Unit # 1 Member # 1.' All family members in an interrelated group will share a family unit number, with number 1 assigned to the respondent's family. Each additional interrelated group or individual adult sharing the household but not related to another group or individual in the household constitutes an additional family unit. For example, if Mr. and Mrs. Brown are boarders in the same house with Mr. and Mrs. Smith, the Smiths are the first family unit and the Browns are a second family unit. The reliability of 1979-92 family unit numbers beyond those assigned to the respondent's family is questionable. Beginning in 1993, family units were assigned electronically; the definition of a family unit remains the same. Codes were added for partner's family/relations. All others are assigned a code of "9."

An enumeration of a respondent's children is also available. Several variables have been created as part of the Supplemental Fertility File (area of interest "Fertility and Relationship History/Created"), including variables such as '# of Own Children in Household,' 'Age of Youngest Child in Household,' and a variety of variables for each biological child listed, with some exceptions, in order of age. Unedited variables from the Children's Record Form (areas of interest "Child Record Form/Biological" and "Child Record Form/Nonbiological") are also available for both biological and nonbiological children. If there is a discrepancy between household rosters versus marriage event histories use the marriage event histories rather than the household roster data. See the "Fertility" section of this guide for more information about the collection of information on the respondent's children.

Additionally, information about household members includes, sex of the member, their relationship to the respondent, age, highest grade completed, and whether the member receives pay for work (age restrictions apply.)

Finally, information on whether the mother and father of each child (in 1991, new children only) live in the household is available for the 1987-2014 survey years. In all other years, information on whether the father of the child is present is available for children of female respondents.

Family size

Beginning with the 1990 release, a family size variable, comparable to the family size variable created for the computation of the 'Total Net Family Income' and 'Poverty Status' variables, was created for each year. The variable is constructed by simply cycling through the household record "relationship codes" and increasing the family size by one each time a qualifying relationship relative to the respondent is encountered. Qualifying relationships include all relations by blood, marriage, and adoption. Foster relationships, partners, boarders, guardians, and other individuals are not considered family members in the creation of this variable.

Program derivation

The SPSS program statements for a sample survey year (1979-92) FAMILY SIZE variable are as follows:

COUNT FAMSZXX=RELR1 TO RELR15 (0 THRU 32,37 THRU 44,47 THRU 49)

IF (WEIGHTXX EQ 0) FAMSZXX=-5

After 1993, the roster was expanded to accommodate up to 20 individuals. The SPSS program is the same but the number of relationships to check is five larger. Additionally, the respondent is not on the household roster after 1992, so FAMSZXX is initialized to "1."

Household identification and linkages

The NLSY79 screening procedure allowed more than one member of a household to be selected for interviewing. The original 12,686 respondents were members of 8,770 households; 6,742 respondents or 53 percent of the sample were members of households from which more than one respondent originated, while 5,944 respondents or 47 percent were members of single respondent households (Table 2). To establish linkage of respondents originating from the same household, variables identify other interviewed household members and their relationships as of the 1979 interview. The 1979 variable providing the unique household identification number of each household is R00001.49, 'Household Identification Number' (HHID). The same HHID is assigned to all respondents who originated from the same household in 1979. In multiple-respondent households, the HHID corresponds to the lowest respondent 'Identification Code' of all respondents interviewed in that household; in single-respondent households, the HHID corresponds to the respondent 'Identification Code.' The HHID variable was constructed using other created variables from the NLSY79 main data set and exists only for 1979.  Multiple respondent households can also be identified through variables that identify other respondents in the same household and their relationship to the first respondent. Reference numbers include R00001.50-R00001.61, for example, 'Identification Code of 1st Other Interviewed Youth in R's Household.'

Table 2. Distribution of respondents living within single and multiple respondent households: NLSY79

Household Type Note 2.1

Households Respondents Percent of Sample Note 2.2

Single Respondent Households

5944 5944 46.9

Multiple Respondent Households

2826 6742 53.1

2 Respondent Households

1985 3970 31.3

3 Respondent Households

634 1902 15.0

4 Respondent Households

170 680 5.4

5 Respondent Households

32 160 1.3

6 Respondent Households

5 30 0.2

Total

8770 12686 100

Note 2.1: Household types are based on information gathered during the 1978 household screening.

Note 2.2: Numbers have been rounded up to the nearest tenth.

Although these matches represent unique samples for a number of research topics, be aware that matches may be not be demographically representative due to the age restrictions applied to all members chosen from a household. The primary types of relationships that existed among respondents within multiple respondent households at the time the surveys began included brothers, sisters, husbands, and wives (Table 3). Other relationships included cousins, brothers- and sisters-in-law, step-brothers or -sisters, and other types of household members.

Table 3. Number of NLSY79 civilian respondent pairs interviewed in 1979 and 1992

Type of Pair

Respondent Members Households
1979 1992 1979 1992

Siblings

5863 4806 2448 2149

2 Siblings

3386 2744 1693 1572

3 Siblings

1725 1427 575 446

4 Siblings

604 519 151 116

5 Siblings

130 99 26 13

6 Siblings

18 17 3 2

Spouses Note 3.1

334 216 167 120

Note 3.1: Excludes three cases in which the relationship assigned to the respondent pair is "spouse" for only one member of the pair.

Household residence

Household residence refers to the type of dwelling or living situation of the respondent. Household residence information is available for the respondent at each survey point, for the respondent during his or her childhood, and for the respondent's children during recent surveys. The variable 'Type of Residence R is Living In' classifies the respondent's actual place of residence at the time of each survey. From 1979-86, it was created based upon responses to several questions asking about different types of dwelling units. In these years, several versions of the Household Interview Forms (the instrument completed before the main questionnaire and used to construct the household enumeration) existed. The universes for these different versions were dependent upon the type of dwelling unit in which the respondent lived (parental home, own dwelling unit, individual or group quarters), the sample type of the respondent (military or civilian), and who answered the household interview section questions (respondent or parent). The responses to questions designating type of residence from each of these versions were combined into one variable reflecting type of residence for the entire sample.

Beginning in 1987, the several versions of the Household Interview Forms were combined and all types of residences were coded in one question. Therefore, after 1986, this question is no longer considered a "created" variable. The 'Type of Residence R is Living In' variables include categories such as dorm, fraternity or sorority, hospital, jail or juvenile detention center, orphanage, religious institution, own dwelling unit, parents' household, and specific types of military quarters. The codes assigned to response categories for type of residence in 1979 differ significantly from those in other survey years. Also, in earlier years, respondents living in parental homes were treated as valid skips; in later years, these respondents were assigned a separate code that differs by year.

Retrospective information describing the respondent's childhood living arrangements was collected during 1988 in a three-part series of questions on the Childhood Residence Calendar. In Part 1, the respondent's identification of any type of parent with whom he or she lived for four or more months was recorded. Coding categories included biological, adoptive, or stepmother or father for each age from birth through 18 years, for example, 'Lived with Biological Mother at Birth,' 'Lived with Adoptive Father at Age 16.' Ages at which the respondent stopped living with a parent, the reason for ending shared living arrangements, and the frequency of visitation with the absent parent during the first year after coresidence ended were also collected. For those ages when the respondent reported not living with a parent, information was collected in part 2 of the Childhood Residence Calendar on:

  • coresidence with grandparent(s), other relative(s), foster parent(s), or friend(s)
  • residence in a children's home or orphanage, a group care home, a detention center/jail/prison, or another institution
  • use of another type of arrangement
  • for those ages ten and over, whether the respondent was left to be on his or her own

Variable titles for this series include 'Lived with Foster Parent(s) (Not Living With a Parent) at Age-7,' 'Lived in Children's Home/Orphanage (Not Living With a Parent) At Age-2,' 'Left to be on Own (Not Living With a Parent) at Age-15.' The number of foster or group care arrangements experienced by the individual is also recorded. Finally, in part 3 of this supplement for each age during which the respondent experienced more than one living arrangement when not living with a parent, the place at which s/he stayed the longest is identified. Data quality issues are discussed by Haurin (1991).

Information on the residence of respondents' children is available, for the most part, since the 1982 survey year. Note that edited variables based on the 1989 and 1991 raw data are not available until the subsequent year's release. These edited variables, cleaned and checked for consistency, include residences of each biological child in birth order (with some anomalies), such as 'Usual Residence of 7th Child,' and combine information collected for residence of children of male respondents with that of children of female respondents. Coding categories include in the respondent's household, with other parent, with other relatives, in foster care, with adoptive parents, in a long-term care institution, away at school, deceased, lives part-time with both parents, lives part-time with the respondent and another person, and other. The unedited variables upon which the edited variables are based can be found in the "Fertility" section of the main youth questionnaire and include residence of all biological children; residence of all children born by the time of the previous interview, collected annually since 1984; and residence of all children born since the last interview, collected since 1983. Unedited residence information for nonbiological children is available for 1985, 1986, 1988, 1990, 1992, and from 1994 forward. Coding categories for all unedited variables are the same as for edited variables.

References

Haurin, R. Jean. "Childhood Residence Patterns: Evidence from the National Longitudinal Surveys of Work Experience of Youth." Columbus, OH: CHRR, The Ohio State University, 1991.

Morgan, William R. "Sibling Influences on the Career Plans of Male and Female Youth." Columbus, OH: CHRR, The Ohio State University, 1983.

Comparison to Other NLS Surveys

All biological children of NLSY79 mothers are included in the NLSY79 Child data set. NLSY79 young adults, regardless of whether they are living with their mothers, complete a household interview almost identical to that in the main youth.

Information on the respondent's household is available for all other cohorts for most survey years. Data generally include the age, sex, relationship to the respondent, and educational attainment of all occupants; the enrollment status of those of school age; and the occupation and weeks worked of residents age 14 and older. In the pre-1980 surveys of the Original Cohorts, data were generally collected only for family members living in the respondent's household and not for unrelated household members. For more precise details about the content of each survey, consult the appropriate cohort's User's Guide using the tabs above for more information.

Survey Instruments & Documentation The 1988 childhood residence data were collected using questions in Section 16 "Childhood Residence" and the supplemental Childhood Residence Calendar. Information on residence of respondent's children is collected in the "Fertility" section of the questionnaire. Questions on distance of a respondent's child to the child's mother, father, or to the respondent also are located in the "Fertility" section. The questions on distance from the respondent's residence to that of his or her father or mother can be found in the "Family Background" section of the 1979 questionnaire. General information on the Supplemental Fertility File variables, such as the edited residence of children variables, can be found in Appendix 5: Supplemental Fertility and Relationship Variables of the NLSY79 Codebook Supplement. A technical appendix in Morgan (1983) presents details on respondent sibling matching procedures.
Areas of Interest
  • The family size and type of residence variables are included in the "Key Variables" area of interest. 
  • Edited residence of children variables have been placed in "Fertility and Relationship History/Created," while unedited residence of children variables have been placed in the "Birth Record" and "Birth Record xxxx" areas of interest.
  • Information from the household interview, which is transcribed onto the household enumeration, is included in "Household Record."
  • The distance from the respondent's residence to that of each child not living in the household, as well as the distance each child lives from his or her mother (for children of male respondents) or father (for children of female respondents), is available in the "Birth Record xxxx" area of interest for 1984-86, 1988, 1990, 1992, and 1994-2016. The distance from the respondent's residence to the residence(s) of the respondent's mother and father was collected during the 1979 interview.

Geographic Residence & Neighborhood Composition

Created variables

Public use variables:

  • Region of residence at each survey date (Northeast, North Central, South, or West)
  • Information on whether the current residence is in an urban or rural county
    • Through 1996, this series was based on the respondent's State and county of residence and the "% urban population" data from the County & City Data Book. From 1998-2002 this item was based on whether the respondent was living in an urbanized area or in area with a population greater than 2,500. Beginning in 2004, this item indicates whether the respondent resides within an urban cluster or urbanized area. For further information see the Geocode Codebook Supplement.
  • Information on whether the current residence is in a Metropolitan Statistical Area (MSA), the central city of an MSA, or outside of an MSA
    • Based upon zip code, State, and county matches with metropolitan statistical designations for place of residence, the location of the respondent is determined to be within or outside of a metropolitan statistical area.
  • Beginning in 1988, whether the current residence is in the United States.

Geocode file variables:

  • The specific county and State (both edited) of residence at the time of interview, coded with Federal Information Processing Standards (FIPS) codes
  • Similar information is provided for the respondent's residence at birth and at age 14
  • The specific metropolitan area of residence at the time of interview. As applicable, information may be included for the following types of metropolitan areas:
    • SMSA-Standard Metropolitan Statistical Area
    • MSA-Metropolitan Statistical Area
    • CMSA-Consolidated Metropolitan Statistical Area
    • PMSA-Primary Metropolitan Statistical Area
    • NECMA-New England County Metropolitan Area
    • CBSA-Core Based Statistical Area
  • Distance between respondent addresses at each interview round (see Appendix 22: Migration Distance Variables for Respondent Locations).
  • This supplements the data on state and county of residence and is available only on the geocode release
  • The distance between the respondent's addresses at each date of interview was created for all unique pairs of survey years
  • The data described here do not actually provide a location for the respondent's residence; these variables only provide distances between the various places the respondent lives
  • This pairwise matrix of variables enables various types of migration research by enabling users to consider the distance between residences and to identify return migration to an area where the respondent has lived in the past
  • Indicators of the quality of the geographic data:
    • May not have an address for the respondent
    • In such cases the respondent's address is geocoded to the centroid of the zipcode when we can determine the zipcode
    • To identify these cases, an indicator for the quality of this distance measure was created based on the quality of the matches in both years
  • Indicator for whether the respondent was located in the same zip code, was created for all pairs of years

Important information: Using restricted-use Geocode data

  • The level of detail available determines whether a variable is placed within the restricted release "Geocode" files. For example, general country level information, such as whether the respondent resided at various points in time within or outside of the United States, is available to all users with no restriction, while the specific county or SMSA in which he or she resided at a specific interview point is present only within the restricted-use Geocode data files.
  • Researchers interested in using restricted-use Geocode data must submit an application to BLS. These confidential files are available for use only at the BLS National Office in Washington, DC, and at Federal Statistical Research Data Centers (FSRDCs) on statistical research projects approved by BLS. Access to data is subject to the availability of space and resources. Information about applying to use the zip code and Census tract data is available on the BLS Restricted Data Access page.
  • The "Household Interview" areas of interest contain a set of variables titled 'Does R Live on a Farm or in a Rural Area?' The interviewer answers this question based on observation when at the respondent's permanent residence; if the interview takes place elsewhere, the interviewer asks the respondent about the place of residence. There are no consistent criteria for the definition of nonfarm property as rural. These variables should not be considered a replacement for the created KEY VARIABLE, 'Current Residence Urban/Rural?'
  • The coding of respondents' geographic location before 1993 required extensive hand-editing and may not be completely accurate. The most common error is the potential assignment of a respondent to an adjacent county of residence. Data on addresses, zip codes, and phone numbers are used to clean the geographic codes. The post-1988 use of telephone number information improved data quality. A brief discussion below provides more information on both the hand-edits performed each year and the created variable that indicates the extent of hand-editing required for each case; see Appendix 10 in the Geocode Codebook Supplement for more details.

Geographic data for NLSY79 respondents fall into two categories: information on the main public file and more detailed information released as restricted-use Geocode data. Table 1 lists NLSY79 geographic variables with their areas of interest and corresponding documentation found in the NLSY79 Geocode Codebook Supplement and the NLSY79 Codebook Supplement. Variables with a "Geocode" areas of interest are restricted-use data; all others are public use.

Table 1. Select residence variables by survey year and area of interest: NLSY79 main and Geocode files
Variables Survey Years Area of Interest Documentation
Residence at Birth Country - U.S. or Other Country 1979, 1983 Geocode  
Country - Actual Other Country 1979 Geocode Attachment 101
County 1979 Geocode Attachment 102
State 1979 Geocode Attachment 102
South/Non-South 1979 Family Background Attachment 100
Residence at Age 14 Country - U.S. or Other Country 1979 Geocode  
Country - Actual Other Country 1979 Geocode Attachment 101
County 1979 Geocode Attachment 102
State 1979 Geocode Attachment 102
South/Non-South 1979 Family Background Attachment 100
Area of Residence - Urban/Rural 1979 Family Background NLSY79 User's Guide and Appendix 6
Present Residence Lived in Since Birth 1979 Family Background  
Year of Move to 1979 Family Background  
Migration History Country/County/State Since Jan. 1978/Last Interview 1979-1980, 1982, 2000-2020 Geocode Attachment 101, 
Attachment 102
Month/Year of Move(s) 1979-1980, 1982, 2000-2020 Family Background  
Main Reason for Move 2018-2022 Family Background  
Months Spent at Alternate Residence 2018-2022 Family Background  
Current Residence Region 1979-2022 Key Variables Attachment 100
Urban/Rural 1979-2022 Key Variables NLSY79 User's Guide and Appendix 6
SMSA/Central City 1979-2022 Key Variables NLSY79 User's Guide and Appendix 6
In U.S. 1979-2022 Key Variables NLSY79 User's Guide
County 1979-2022 Geocode Attachment 102
State 1979-2022 Geocode Attachment 102
SMSA 1979-2022 Geocode Attachment 104
PMSA 1979-2022 Geocode Attachment 104
MSA 1979-2022 Geocode Attachment 104
CMSA 1979-2022 Geocode Attachment 104
MSA/CMSA/NECMA 1979-2022 Geocode Appendix 10
CBSA 1979-2022 Geocode Appendix 10
Main Reason for Moving Since Date of Last Interview 2018-2022 Family Background NLSY79 User's Guide

Geocode file variables

The Geocode files provide data on NLSY79 respondents' residence at the State, county, and metropolitan statistical area levels, merging information from Census reference files and data books, and includes additional variables such as local unemployment rates, job location, and college and military discharge locations where available.

  • Information on the State, county, and metropolitan statistical area of residence for each respondent (the current residence variables) are merged with information from several other data files, namely the City Reference File (Census 1973, 1982, 1983, 1987, 1992) and the County & City Data Book (Census 1972, 1977, 1983, 1988, 1994), to provide detailed information on the environmental characteristics of the State, county, and metropolitan statistical areas in which each NLSY79 respondent resides. Note: Users may attach additional county and metropolitan statistical area-level data from a variety of sources by simply merging information from the desired source with the Geocode data based upon the State, county, and metropolitan statistical area of residence codes in the Geocode file.
  • For select survey years, Geocode information is available on the location of respondents' jobs, the location of colleges attended, and the point of discharge from military service.
  • Unemployment rate of each respondent's labor market of current residence:
    • The source of the 'Unemployment Rate' variables is the May issue of the Bureau of Labor Statistics' Employment and Earnings for the year following the survey year. Figures from March of each survey year are used. This table supplies unemployment rates for each State and for selected metropolitan statistical areas. Respondents who reside within one of these metropolitan statistical areas are assigned the appropriate unemployment rate. For those residing outside of these areas, a "balance of State" unemployment figure is computed using State total figures for the size of the civilian labor force and the number employed and subtracting the population living in metropolitan statistical areas.
    • Additional information on these variables can be found in Appendix 7 in the NLSY79 Geocode Codebook Supplement.

Types of county or Metropolitan Statistical Area environmental characteristics in the NLSY79 restricted-use Geocode data

  • Population sizes
  • Percent of population that is:
    • urban
    • black
    • female
    • under 5 years old
    • 65+ years old
  • Birth/death/marriage/divorce rates
  • Physician and hospital bed rates
  • Crime rates
  • Poverty level data
  • Educational attainment levels 
  • Median family and per capita income
  • Recipients of and payments from:
    • AFDC
    • SSI
    • Social Security
  • Labor force statistics:
    • total labor force
    • civilian labor force
    • number of females in the civilian labor force
    • civilians unemployed versus employed
    • percent employed in various industries
  • Unemployment rate for labor market of residence

Geographic residence

Detailed geographic mobility information was collected during the 1979-80, 1982, and from 2000 forward; data were gathered on the country/county/State and timing of up to five residential moves since January 1978 or since the last interview. Beginning in 2000 only significant geographical moves were recorded.

Neighborhood quality

The neighborhood quality series (1992, and 1994-2000), is taken from the National Commission on Children Parent & Child Study, 1990 Parent Questionnaire. In this series of questions respondents rate how much of a neighborhood problem issues such as crime, lack of police protection, unsupervised children and joblessness are.

Other geographic variables

Users may obtain special permission to use zip code and Census tract data available at the BLS offices in Washington, DC.

Edited versus unedited versions of state/county of residence

For some years (1979-82, 1988-89, 1991-92), two versions of the State and county of residence variables have been included in the "Geocode" files. The set occurring at the beginning of each file is the edited version, while the variables found near the end of the files for these years are unedited. If the variable has an actual source question number/name, it is the original from NORC. If the source question name says created, it is the edited/created version. Note that the unedited variables are sometimes combined into a single variable, with the State and county code appended to each other. These raw variables are preceded by the word "GEOCODE" in the variable title. The edited residence variables contain the corrections made for erroneous address information and are the ones from which the Geocode files themselves are constructed. Users should be aware that the edited version of these variables does not contain data for those respondents who are in the active military forces or who are living abroad or in a U.S. territory.  Codes of "-4" appearing in the unedited versions of the State or county variables (because foreign country and U.S. territory codes are placed in one field or the other) should not appear in the edited versions of these residence variables.

Geocode procedures for assigning residence codes and hand-editing discrepant cases

During the 1988 hand-editing process, it became evident that the telephone numbers were very accurate, even in cases for which the address information contained discrepancies. Beginning in 1989, the area code and phone exchange were used to assign State and county of residence codes. The State assigned by the area code was then compared to the State assigned on the basis of zip code alone and the State contained in the original NORC respondent file. A "quality of match" variable was computed on the basis of how well these States match. For a more detailed discussion of these new assignment and matching procedures, refer to Appendix 10 in the Geocode Codebook Supplement. This process was used through the 1994 release.

The hand-editing procedure has also been streamlined. In 1989, the first year in which the phone assignment procedure was used, the residence codes assigned on the basis of the area code and exchange were compared to the raw residence variables received from NORC. Those with information that did not match were identified for individual examination. Ideally, the discrepancies requiring individual examination would be reduced to those cases which are "genuine movers" or which have zip codes covering multiple counties and would require some verification that the correct county was assigned based upon the phone information. The current process for identifying discrepancies and hand-editing is aimed more directly at achieving this objective. 

Beginning in 1990, the residence codes assigned based on phone information were compared to the 1989 CHRR-edited residence information to identify cases for individual examination. Because the previous year's edited variables incorporate the corrections that were made in the hand-editing process from earlier years, repeated editing of the same cases across years decreased. Through this process, the discrepancies in residential Geocode information were reduced. The number of cases requiring individual examination also decreased and was restricted more closely to the population of "genuine movers" and people with multiple-county zip codes and phone numbers that require verification of county of residence. 

The hand-editing process in previous years included not only these genuine movers and multi-county zip code dwellers, but also other cases for which elements of the address are simply in error or incompatible with each other. Some of these cases could potentially require editing for the same errors in more than one year, even if the respondent stayed in one location. Hand-editing procedures were further streamlined, and in some cases automated, to produce the 1992 data.

Beginning in 1996, a new procedure for verifying and assigning correct final Geocode information was instituted. This procedure is now performed using specialized address tracking Geocode software. The processes are described in Appendix 10. It is the belief of CHRR staff members not only that the current procedures are more efficient in identifying true discrepancies and streamlining the hand-editing process, but also that they result in more accurate and consistent assignment of State and county codes in general. 

Missing values, New England cases, and mobility

Missing values in location of residence variables and metropolitan statistical area codes are associated with respondents who are in the active military forces or who are living abroad or in a U.S. territory. Users should be aware that, because the New England County Metropolitan Area (NECMA) codes are not comparable to metropolitan statistical areas from the remainder of the country, New England cases are eliminated from some of the procedures used to construct the Geocode files.

The review and hand-editing process has been periodically revised to improve the accuracy of the data and the efficiency of data production. The potential implications for effects on mobility rates between some years due to these changes have been noted in Appendix 10. Users should read Appendix 10 carefully to gain a better understanding of the issues outlined above and their implications for specific research endeavors.

Comparison to Other NLS Surveys Data on the respondent's area of residence are available for all cohorts. Geographic residence information for those NLSY79 children who resided with their mother can be inferred from the residence data of their mothers. The NLSY97 main created variables indicate whether the respondent lives in an urban or rural area, whether the respondent lives in a Metropolitan Statistical Area, and in which Census region the respondent resides. More detailed information is available on the restricted-use Geocode data. Region of residence and geographic mobility of Original Cohort respondents are provided for most survey years. Geographic data for NLSY79 respondents fall into two categories: information on the main public file and more detailed information released as restricted-use Geocode data. Information about applying to use the zip code and Census tract data is available on the BLS Restricted Data Access page.
Survey Instruments & Documentation

Data on residence at birth and at age 14, as well as the 1979-82 present/most recent residence series, were collected using questions found within Section 1 ("Family Background" and "On Family") of the 1979, 1980, and 1982 questionnaires. All other variables are created from or determined by the geographic information provided by each NLSY79 respondent within the locator section of the questionnaire or from the interviewing Face Sheet or internal NORC locating files. Several attachments and appendices in the NLSY79 Codebook Supplement and/or the NLSY79 Geocode Codebook Supplement offer creation procedure information and coding systems for the geographic residence variables. The following are relevant to the Geocode:

Related Variables

Related NLSY79 main file variables discussed in the Household Composition and Family Background sections of this guide include:

  • Type of residence or dwelling unit at the time of interview (such as dorm, hospital, jail, orphanage, own home, and so forth).
  • Childhood living arrangements of NLSY79 respondents from birth to age 18, including not only information on persons with whom the respondent lived (such as biological versus adoptive and step-parents) but also on institutions such as children's homes, group care homes, or detention centers/jails/prisons in which he or she may have resided.
Areas of Interest Residence variables can be found within the "Family Background," "Key Variables," "Geocode," or "Household Interview" areas of interest; Table 1 above specifies the particular areas of interest for each variable. All environmental variables, including the 'Unemployment Rate for the Labor Market of Current Residence,' are present in the "Geocode" areas of interest in the restricted-use Geocode data.

Sex

Important information: Using sex data

During screening, sex was determined by observation and asked directly of respondents only if it was "not obvious" to the interviewer. On March 1, 1986, 'Sex of R' was changed for 45 cases as a result of inconsistencies generated from interviewer checks for respondent's sex in the fertility section of the 1982 survey instrument; three additional cases were changed shortly thereafter. Each of these cases was verified by NORC for accuracy. 'Sex of R' (R02148.) for the following identification codes (R00001.) was changed:

  • 166
  • 712
  • 1306
  • 1933
  • 2212
  • 2286
  • 2287
  • 2433
  • 3388
  • 3582
  • 3583
  • 3865
  • 3960
  • 4157
  • 4524
  • 4579
  • 4917
  • 5929
  • 6102
  • 6198
  • 6360
  • 6466
  • 6840
  • 7571
  • 7620
  • 7624
  • 7645
  • 7890
  • 8321
  • 8542
  • 8543
  • 8596
  • 8690
  • 8826
  • 9150
  • 9166
  • 9555
  • 9713
  • 10347
  • 10511
  • 11110
  • 11114
  • 12257
  • 12387
  • 12676

The variable series 'Int Remarks - Sex of R,' provides interviewers' observations of the sex of the respondent for the 1982-1986 and 1988-1998 survey years. These observations are subject to a small degree of error from erroneous interviewer observation or recoding and data entry error. When using this series of variables, a small number of respondents may appear to "change" sex across surveys.

Variables available within the NLSY79 provide information on the sex of each respondent, their children, and members of their household. Information on the sex of the respondent can be found in:

  • a single 1979 variable, 'Sex of R' (R02148)
  • a set of yearly interviewer remarks variables, 'Int Remarks - Sex of R' 

The 1979 'Sex of R' variable (R02148.) is derived from R01736., 'Sample Identification Code,' a variable which defines each respondent's membership in one of the subsamples that compose the NLSY79 (such as the "cross-sectional male, nonblack/non-Hispanic poor," "supplemental female black," and so forth). Subsample identification was based on information gathered during the 1978 household screening.

Comparison to Other NLS Surveys Sex for all biological children born to female members of the NLSY79 is available. Information on sex is also available for the NLSY97. Sex is implicit by membership in the Original Cohorts. For more precise details about the content of each survey, consult the appropriate cohort's User's Guide using the tabs above for more information.
Survey Instruments & Documentation A copy of the 1978 Household Screener used to collect information on sex of the respondent and other household members can be found in the Household Screener and Interviewer's Reference Manual (NORC 1978). Interviewer observations are recorded in the final section of each questionnaire, entitled "Interviewer's Remarks." Household members' sex is collected during the administration of the Household Interview Forms. A copy of the Information Sheet, containing sex of respondents' children, can be found near the beginning of the yearly Question by Question Specifications. The CRF is a separate child "inventory" referenced in the "Fertility" section of the questionnaire; sample copies can be found in the Question by Question Specifications. Finally, a general description of the derivation of the Supplemental Fertility File variables, such as sex of children, appears in Appendix 5: Supplemental Fertility and Relationship Variables in the NLSY79 Codebook Supplement.
Areas of Interest All sex variables discussed above are located on the main NLSY79 data set. 'Sex of R' (R02148.) and the 'Sample Identification Code' (R01736.) can be found in the "Common" area of interest, while the interviewer remarks variables are located in "Interviewer Remarks." The Supplemental Fertility File variables have been placed in "Fertility and Relationship History/Created." Children's sex, listed separately for biological and nonbiological children on the CRF, are in area of interest "Child Record Form/Biological" and "Child Record Form/Nonbiological," respectively. Variables collected during the household interview can be found in "Household Record." Variables from the Household Screener are housed in "1978 Screener" and "Household Record" areas of interest.

Age

Created variables

AGEATINT: These variables provide the respondent's age at each interview date.

Important information: Using age data

The eligibility for inclusion in this cohort was based on the 1979 age reports, as are weights. Birth date questions were asked again in the 1981 survey because:

  • a number of discrepancies between birth dates found on the military file and the NLSY79 files were discovered
  • a number of inconsistencies between age as recorded on the "Household Enumeration" and the main questionnaire were apparent

The 1981 birth dates should be used to determine age with the 1979 dates used only as a backup. Differences between 1979 and 1981 birth dates remained for approximately 200-250 respondents after the 1981 fielding; editing on a case-by-case basis was performed by CHRR staff on only the 1981 variable. Inconsistencies in age or birth date information may appear for a number of reasons: age and birth date information has been collected at multiple survey points, giving rise to respondent-reported inconsistencies; respondents' ages for sample selection were based on date of birth information reported at the time of the 1978 household screening by individuals who may not have been the respondent; and responses to interviewer check items, that is, the age reported to the interviewer that determines when age-specific questions should be asked, may not be the same age as that calculated from previously reported age or birth date information. For example, a respondent whose age was 16 as calculated from the birth date reported in 1981 may have answered questions which were specific to a 17 year old.

Date of birth information was collected from each NLSY79 respondent during the 1979 and 1981 interviews:

  • The variable 'Age of R,' gathered during the 1979-83 surveys, is the self-reported age of the respondent as of the interview date.
  • The NLSY79 main data files also contain a yearly created variable, 'Age of R at Interview Date.' These created variables are constructed using the 1981 date of birth information coupled with the 1979 birth date for the 491 respondents not interviewed in 1981. 

Table 1 depicts age distribution of the NLSY79 for the 1979 survey year. Table 2 depicts age distribution of the NLSY79 for the most recent survey year.

Table 1. Age of NLSY79 respondents as of date of interview for the 1979 survey year
Age Male Female Total
14 533 471 1004
15 804 766 1570
16 784 780 1564
17 756 756 1512
18 838 798 1636
19 812 871 1683
20 829 827 1656
21 832 850 1682
22 215 164 379
Total 6403 6283 12686

This table uses the created variable 'Age of R at Interview Date.'

Table 2. Age of NLSY79 respondents as of date of interview for the most recent survey year: 2022
Age Male Female Total
57 11 10 21
58 362 376 738
69 436 456 892
60 417 506 923
61 438 491 929
62 427 439 866
63 324 383 707
64 314 373 687
65 265 352 617
66 20 13 33
Total 3014 3399 6413

This table uses the created variable 'Age of R at Interview Date.' In the 2022 survey year, 3389 males and 2884 females were not interviewed.

Comparison to Other NLS Surveys Age data are available for all NLS cohorts. These variables include both the age of the respondents as of a fixed date during the initial survey year and as of the interview date in various years. Date of birth is also available for all cohorts. For more precise details about the content of each survey, consult the appropriate cohort's User's Guide using the tabs above for more information.

Retirement

Beginning with the 2006 survey year, questions about retirement preparation and expectations were introduced as respondents reached their fifties. Table 1 outlines the retirement preparation/expectation variables, along with the survey years in which they were included. These variables are available in the NLS Investigator by searching for the question name root RETIRE_EXP.

In 2018, the retirement section was revised and expanded to allow respondents, many of whom were beginning to retire, to provide more comprehensive details about their retirement funds. Information about types of retirement savings and income expectations can be found in the Pension Benefits & Pension Plans section and by searching the NLS Investigator using the RETIRE_FIN question name root.

Additionally, in the 2018-2022 survey years, respondents were asked if their employer had ever offered an early retirement window, one with a special financial incentive like a cash bonus or improved pension benefits. These variables are available in the NLS Investigator by searching for the question name QES-EARLYOUT.

Table 1. Retirement preparation and expectation questions asked of NLSY79 respondents: 2006-2022
Question 2006
Note 1.1
2008  2010 2012 2014  2016 2018 2020 2022

What is the probability that you will not be working for pay at age 67? age 65? age 62? Note 1.2

* * * * * * * * *

How would you define retirement for yourself? ("Select All" categories include stop work, receive pension, work for fun, etc.)

* * * * *        

Approximately at what age do you think you'll retire?

* * * * *        

Do you expect to work for pay… or at a family business for pay/profit in the future?

  * * * *        

Have you ever calculated how much retirement income you would need at retirement?

* * * * *        

Have you ever consulted a financial planner about how to plan your finances after retirement?

* * * * *        

Have you ever read any magazines or books on retirement spending?

* * * * *        

Have you ever used a computer program to help you plan your retirement?

* * * * *        

Have you ever attended any meetings on retirement or retirement planning?

* * * * *        

Were any of the meetings organized by your (or your spouse's/partner's) employer?

* * * * *        

Did you (or your spouse's/partner's) attend the meetings on a voluntary basis or was attendance required?

* * * * *        

What do you think the chances are that you will be working full-time after you reach age 62? age 65? age 67?

        * * * * *

What do you think the chances are your health will limit your work activity during the next 10 years?

        * * *   *

What do you think the chances are you (and spouse/partner) will leave inheritances totaling $10,000 or more? $100,00 or more?

        * *      

What do you think the chances are you will live to be 75 or more? (85 or more?)

        * * *   *

What do you think the chances are you will ever have to move to a nursing home?

        * * *    

What do you think the chances are you (and spouse/partner) will have major medical or long-term care expenses

                *

How much planning have you done for your retirement?

            *   *

Right now, would you like to leave work altogether, but plan to keep working because you need health insurance?

            *   *

Right now, would you like to leave work altogether, but plan to keep working because you need the money?

            *   *

Have you received a foreclosure notice in the past 12 months?

                *

Note 1.1: Only a random sample of 991 respondents were asked retirement expectations questions in 2006.

Note 1.2: Probability working at age 62 (asked in 2006-2022), at age 65 (asked in 2006-2022), and at age 67 (asked in 2006-2012, 2022).

Survey Instruments & Documentation Questions about retirement expectations can be found in the following sections: Retirement Expectations Part I (2006), Retirement Expectations Part II (2006), and Retirement Expectations (2008-2022).
Areas of Interest Retirement preparations, expectations, savings plans and retirement income are located in the "Retirement" areas of interest, while the early retirement window variable is in "Job Information."

Business Ownership

Created variables

BUSOWN_SOURCEYR. Survey year when business data was collected. Cross-round variable.

BUSOWN-UID-MATCH.01, 02, etc. Unique job ID in Employment History Roster of Business 01, 02, etc. originally. Cross-round variable.

BUSOWN-MATCH-QUALITY.01, 02, etc. Provides quality of the match between Business 01, 02, etc. ownership reported and past employers. Cross-round variable.

BUSOWN-11_TRUNC.01, 02, etc. Money respondent used to establish/acquire Business 01, 02, etc. (amount truncated). Cross-round variable.

BUSOWN-16_TRUNC.01, 02, etc.  Respondent's estimate of sales/revenue of Business 01, 02, etc., generated in a typical year (amount truncated). Cross-round variable.

Note: Users are encouraged to utilize these XRND (cross-round) variables, as they integrate information collected from 2010 onward.

Prior to 2010, only limited information was collected on business ownership in the NLSY79. In 2010 (round 24), NLSY79 respondents who were current or former business owners were asked a lengthy series of questions about each business owned since age 18. The business ownership questions were also asked in subsequent survey years for those not interviewed in 2010. 

Respondents who reported having owned a business gave the year each business was established and how ownership was acquired: whether the respondent established the business themselves or with partners, received ownership as a gift, purchased ownership, inherited ownership, or received an ownership stake through marriage. Respondents answered questions about working for a related type of business prior to starting their business, the source of the money used to establish or acquire the business, the number of employees and the number of physical locations of the business, the legal form of the business (sole proprietorship, corporation, etc.) the sales or revenue in a typical year, and whether the business was family owned.

In addition, all NLSY79 respondents interviewed in round 24 (regardless of "own business" status) were asked several questions on family ownership of businesses, their own patent-seeking activities, and whether they consider themselves to be entrepreneurs. These questions were repeated in subsequent rounds for those who missed the round 24 interview.

Areas of Interest Located in the "Business Ownership" areas of interest. These variables can also be located by using "BUSOWN" as the Question Name search criterion in the NLS Investigator. These variables are listed as cross-round (XRND) variables. To determine which survey year the data were collected, use the created variable "BUSOWN_SOURCEYR."
Subscribe to NLSY79