Created variables
CV_URBAN-RURAL. Indicates whether respondent lives in an urban or rural area
CV_MSA. Indicates whether or not the respondent lives in a Metropolitan Statistical Area
CV_CENSUS_REGION. Provides Census region where the respondent resides (Northeast, North Central, South, or West)
CV_MIGRATE.XX. Describes moves made by the respondent within a county, within a state to a different county, between states, and to and from a foreign country. Available for rounds 2 and up.
CV_DISTANCE_MOM_COL and CV_DISTANCE_DAD_COL. Provide the distance in miles between the respondent's address and the reported address of the respondent's mother and father. Available for rounds 7 and up. Note: Addresses that were not full street addresses were given the longitude and latitude of the centroid of their zipcodes. Data quality variables (CV_DISTANCE_MOM_QUALITY, for example) were created to indicate whether or not the addresses were zip centroided.
CVC_TTL_RESIDENCES. Provides total number of different residences since age 12.
The geographic indicator variables are created to describe the respondent's area of residence. A software package that assigns latitude and longitude information to respondent addresses is used. Latitude and longitude are then used to link respondent addresses to standard geographic information such as state, county, and metropolitan statistical area. See the NLSY97 Geocode Codebook Supplement introduction for more information about the use of programs used to assign latitude and longitude.
Census Division | Number of Respondents by Round | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
R1 | R2 | R3 | R4 | R5 | R6 | R7 | R8 | R9 | R10 | |
Region 1, Northeast (Connecticut, Maine, Massachusetts, New Hampshire, New Jersey, New York, Pennsylvania, Rhode Island, Vermont) |
1585 | 1451 | 1403 | 1391 | 1336 | 1333 | 1282 | 1219 | 1182 | 1187 |
Region 2, North Central (Illinois, Indiana, Iowa, Kansas, Michigan, Minnesota, Missouri, Nebraska, North Dakota, Ohio, South Dakota, Wisconsin) |
2050 | 1903 | 1835 | 1772 | 1734 | 1741 | 1727 | 1641 | 1600 | 1628 |
Region 3, South (Alabama, Arkansas, Delaware, District of Columbia, Florida, Georgia, Kentucky, Louisiana, Maryland, Mississippi, North Carolina, Oklahoma, South Carolina, Tennessee, Texas, Virginia, West Virginia) |
3359 | 3160 | 3116 | 3074 | 3004 | 3032 | 3004 | 2927 | 2883 | 3002 |
Region 4, West (Alaska, Arizona, California, Colorado, Hawaii, Idaho, Montana, Nevada, New Mexico, Oregon, Utah, Washington, Wyoming) |
1990 | 1846 | 1834 | 1808 | 1770 | 1736 | 1701 | 1676 | 1638 | 1698 |
Reside abroad | 22 | 18 | 14 | 17 | 27 | 23 | 38 | 35 | 39 | |
Data missing | 4 | 2 | 21 | 21 | 27 | 17 | 1 | 5 | ||
Table is based on CV_CENSUS_REGION. |
Census Division | Number of Respondents by Round | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
R11 | R12 | R13 | R14 | R15 | R16 | R17 | R18 | R19 | R20 | |
Region 1, Northeast (Connecticut, Maine, Massachusetts, New Hampshire, New Jersey, New York, Pennsylvania, Rhode Island, Vermont) |
1131 | 1162 | 1160 | 1157 | 1140 | 1079 | 1089 | 1023 | 1042 | 996 |
Region 2, North Central (Illinois, Indiana, Iowa, Kansas, Michigan, Minnesota, Missouri, Nebraska, North Dakota, Ohio, South Dakota, Wisconsin) |
1581 | 1564 | 1576 | 1546 | 1515 | 1487 | 1440 | 1368 | 1409 | 1362 |
Region 3, South (Alabama, Arkansas, Delaware, District of Columbia, Florida, Georgia, Kentucky, Louisiana, Maryland, Mississippi, North Carolina, Oklahoma, South Carolina, Tennessee, Texas, Virginia, West Virginia) |
2975 | 3019 | 3061 | 3035 | 3020 | 2913 | 2901 | 2770 | 2869 | 2815 |
Region 4, West (Alaska, Arizona, California, Colorado, Hawaii, Idaho, Montana, Nevada, New Mexico, Oregon, Utah, Washington, Wyoming) |
1676 | 1688 | 1713 | 1684 | 1682 | 1594 | 1603 | 1499 | 1553 | 1452 |
Reside abroad | 47 | 51 | 49 | 54 | 66 | 65 | 68 | 69 | 72 | 74 |
Data missing | 8 | 6 | 3 | 3 | 2 | 5 | 2 | 14 | ||
Table is based on CV_CENSUS_REGION. |
In addition, variables were created in round 1 to describe the respondent's residence as of his or her 12th birthday; another counts the number of residences in which the respondent has lived from his or her 12th birthday until the survey date (all rounds). See Youth Residential History for more details on these variables.
Note: Incarcerated respondents will have a -4 (valid skip) on CV_MIGRATE and a -3 on CV_DISTANCE_MOM/DAD while they are incarcerated as well as the interviews immediately before and after the incarceration spell.
Geocode Data File
The NLSY97 Geocode Data File provides a variety of statistics for the counties where respondents lived at each interview date. From rounds 1-5, most of these data are based on the 1994 edition of the U.S. Census Bureau's County and City Data Book (the most recent edition available). The first group of geocode variables lists basic demographic information for respondents' counties. These data include land area in square miles; population by race, age, and gender; and birth and death rates. Another variable reports the percent of persons in that county who lived in a different house and/or state in 1990 compared to their residence in 1985, providing information about migration rates for the respondent's area. Also included are the FIPS (Federal Information Processing Standards) codes for state and county (see Attachment 100 of the NLSY97 Geocode Codebook Supplement).
Factors that might influence the respondent's education and employment outcomes are the focus of several other geocode variables. These provide the number of serious crimes, households with children, female householders with no spouse present, persons with high school or college degrees, and families below the poverty level. A pair of variables summarizes medical availability for each county, reporting the number of active nonfederal physicians and community hospital beds.
Geocode variables associated with economic and labor force issues include the size of the county's civilian labor force, the percent employed in various industries, and the percent of workers age 16 and older with jobs outside their county of residence. Income variables include per capita money income for the respondent's county, per capita personal income, and median family money income. County and City Data Book information is not included in the geocode data set for rounds 6 and later. Information on obtaining the most recent County and City Data Book data is available at https://www.census.gov/library/publications/2010/compendia/databooks/ccdb07.html. The unemployment rates for the respondent's metropolitan areas or for the portions of the state not in an MSA (for non-MSA respondents) are calculated from other sources for all rounds.
Other geocode variables involve colleges attended by respondents. Survey staff use information from the Integrated Postsecondary Education Data Systems (IPEDS) to provide users with a code identifying each college attended by the respondent and its location. These codes can be used to associate the NLSY97 respondent's college with various characteristics of the institution contained in the IPEDS database. More information is located in Attachment 102 of the NLSY97 Geocode Codebook Supplement.
In addition, to support research on respondent mobility, survey staff created series of geocode variables for the distance between respondent addresses at each interview round. The data do not provide a location for the respondent's residence; rather, these variables provide distances between the various places the respondent has lived. Staff also created indicators of the quality of the geographic data. More information is located in Attachment 103 of the NLSY97 Geocode Codebook Supplement.
Due to the detailed nature of the information found on the Geocode Data File, use of this data set is restricted to those meeting confidentiality requirements. The geocode data set is released only to those who satisfactorily complete the Bureau of Labor Statistics geocode agreement procedure. The application for this geocode agreement is now available online at www.bls.gov/nls/geocodeapp.htm. Researchers interested in the geocode data can view a complete set of codebook pages in Attachment 104 of the NLSY97 Geocode Codebook Supplement. These codebook pages show frequency distributions and answer categories for the restricted-use geocode variables and may help users to determine whether the NLSY97 geocode data and sample sizes will be appropriate for their research interests.
Note: Incarcerated respondents will have a -4 (valid skip) or -3 (invalid skip) on many restricted geocode variables due to increased privacy concerns for these individuals. Details are provided in the NLSY97 Geocode Codebook Supplement. The majority of respondents still have childhood information available.
Comparison to Other NLS Surveys | Data on the respondent's area of residence are available for all cohorts. For the NLSY79, information on the respondent's region of residence and geographic mobility is available to all users on the main public data file; more detailed information (e.g., state and county of residence) is available on the restricted-use Geocode Data File. Data on NLSY79 Children are available through the mother's record. Region of residence and geographic mobility of Original Cohort respondents are provided for most survey years. For more complete information, refer to the appropriate cohort's User's Guide. |
---|---|
Main Area of Interest | Geographic Indicators |