Errata for NLSY97 Round 12 Release

Errata for NLSY97 Round 12 Release

Important Information

The Investigator contains the most recent release of each NLS cohort. Corrections have been made to items noted below in the errata for the round 12 release. These older, corrected error notices are retained for archival purposes, such as enabling replication of research performed before the errors were discovered. Please contact NLS User Services for additional information.

Users are cautioned that we have discovered an error in the NLSY urban/rural residence variable. The NLSY97 urban/rural variables for rounds 8-11 will be corrected on the next public data release, scheduled for July 2010.

The error stems from a change in the Census Bureau's definition of an urban area. The 1990 Census criteria used in creating the NLSY urban/rural residence variable used the population of a place to determine the correct classification. People who lived in urbanized areas or places with a population of 2,500 or more were considered urban; everyone else was rural. The 2000 Census criteria changed the method of determining whether a particular point was urban or rural to one that relied on population density within an area. Areas of higher population density are called Urbanized Areas (UA) and Urban Clusters (UC). Residence in either is now considered urban.

From 2003 (the first year the new definition could be implemented), the NLS geocoders used a hybrid approach that considered respondents living in either an Urbanized Area (but NOT an Urban Cluster) or a place with a population of 2,500 or more to be urban. Otherwise the code is rural. A preliminary estimate of the differences between using this hybrid code and the 2000 Census definition indicates that 6% to 7% of respondents may be affected.


1) Data for the variable R11930.00 - SIDCODE - need to be corrected for the following eight cases:

PUBID Question Name
Value
4921 New YOUTH_SIBID01 4921
4922 New YOUTH_SIBID01 7473
6803 New YOUTH_SIBID01 7475
6804 New YOUTH_SIBID01 7475
8818 New YOUTH_SIBID01 7477
8819 New YOUTH_SIBID01 7477
8820 New YOUTH_SIBID01 7477
8821 New YOUTH_SIBID01 7477

The changes to this variable will be included in the next release of the NLSY97 data, which is scheduled for the summer of 2010.

2) The following variables have been unintentionally omitted from previous NLSY97 data releases:

      S3608600 "CHK SAVE CHILDID DLI NR CHILD L1 2003"
      S3608700 "CHK SAVE CHILDID DLI NR CHILD L2 2003"
      S3608800 "CHK SAVE CHILDID DLI NR CHILD L3 2003"
      S3608900 "CHK SAVE CHILDID DLI NR CHILD L4 2003"
      S3609000 "CHK SAVE CHILDID DLI NR CHILD L1 2004"
      S3609100 "CHK SAVE CHILDID DLI NR CHILD L2 2004"
      S3609200 "CHK SAVE CHILDID DLI NR CHILD L3 2004"
      S3609300 "CHK SAVE CHILDID DLI NR CHILD L1 2006"
      S3609400 "CHK SAVE CHILDID DLI NR CHILD L2 2006"
      S3609500 "CHK SAVE CHILDID DLI NR CHILD L3 2006"
      S3609600 "CHK SAVE CHILDID DLI NR CHILD L4 2006"
      S3609700 "CHK SAVE CHILDID DLI NR CHILD L5 2006"
      S3609800 "CHK SAVE CHILDID DLI NR CHILD L6 2006"
      S3609900 "CHK SAVE CHILDID DLI NR CHILD L7 2006"
      S3610000 "CHK SAVE CHILDID DLI NR CHILD L8 2006"
      S3610100 "CHK SAVE CHILDID DLI NR CHILD L1 2008"
      S3610200 "CHK SAVE CHILDID DLI NR CHILD L2 2008"
      S3610300 "CHK SAVE CHILDID DLI NR CHILD L3 2008"
      S3610400 "CHK SAVE CHILDID DLI NR CHILD L4 2008"
      S3610500 "CHK SAVE CHILDID DLI NR CHILD L5 2008"
      S3610600 "CHK SAVE CHILDID DLI NR CHILD L6 2008"
      S3610700 "CHK SAVE CHILDID DLI NR CHILD L7 2008"
      S3610800 "CHK SAVE CHILDID DLI NR CHILD L8 2008"

These variables contain identification numbers that can be used to identify the child that is being asked about in the non-resident child series of questions inside the fertility section. These variables will be included in all future NLSY97 data releases.

3) Employer Start And Stop Date Problem

Due to a small number of updates made to the YEMP_START_DATE.xx  ~M / ~Y variables, updates were made to the created variables (CV_), the collapsed created variables (CVC_), and the event history variables (EMP_).

In addition, a coding error caused the EMP_START_WEEK_xxx.yy, EMP_START_YEAR_xxx.yy, EMP_END_WEEK_xxx.yy, and EMP_END_YEAR_xxx.yy variables to be incorrectly coded as a valid skip (-4) rather than the correct value.

4) Asset variables from round 7 (2003)

Ten asset variables from round 7 (2003) related to educational debt were unintentionally omitted from the current release. These variables, T14122.00 to T14131.00 will be included in the next release.

Significant changes made between the July 2009 round 11 and the July 2010 round 12 data files:

1) Event history variable renumber
Due to an issue concerning the number and spacing of r–numbers attached to the created Event History variables, all event history variables were re–r–numbered according to a new system. All created Event History r–numbers now begin with an "E" followed by a string of 7 numbers. A crosswalk between the QNAME and the new r–number (beginning with an E) can be found in the qname_rnumber_crosswalk.xls file. The newly assigned r–numbers will be maintained from Round 12 forward.

2) Created child residence status - Round 2 and Round 8
New data led to updated information on the children of respondents. The affected variables and respondent Ids are the following:

Respondent 1555 (Round 2)
CV_BIO_CHILD_HH
CV_BIO_CHILD_NR

Respondent 8610 (Round 8)
CV_BIO_CHILD_HH
CV_BIO_CHILD_NR

3) Created marriage and cohabitation variables - Round 2 through Round 11
Due to additional information gathered during the Round 12 marriage section, a number of updates were made to created marriage and cohabitation variables. The variables, Round and number of affected respondents are below:

Round 2:
CV_MARSTAT – 1

Round 3:
CV_MARRIAGES_TTL – 1
CV_MARSTAT – 1
CV_MARSTAT_COLLAPSED – 1

Round 4:
CV_COHAB_TTL – 2
CV_FIRST_COHAB_DATE~M – 4
CV_FIRST_COHAB_DATE~Y – 3
CV_FIRST_COHAB_MONTH – 4
CV_FIRST_MARRY_DATE~M/~Y – 1
CV_FIRST_MARRY_MONTH – 1
CV_MARRIAGES_TTL – 3
CV_MARSTAT – 3
CV_MARSTAT_COLLAPSED – 3

Round 5:
CV_FIRST_MARRY_DATE~M/~Y – 1
CV_FIRST_MARRY_MONTH – 1
CV_MARRIAGES_TTL – 3
CV_MARSTAT – 3
CV_MARSTAT_COLLAPSED – 3

Round 6: CV_COHAB_TTL – 1
CV_FIRST_COHAB_DATE~M/~Y – 1
CV_FIRST_COHAB_MONTH – 1
CV_FIRST_MARRY_DATE~M/~Y – 1
CV_FIRST_MARRY_MONTH – 1
CV_MARRIAGES_TTL – 6
CV_MARSTAT – 6
CV_MARSTAT_COLLAPSED – 6

Round 7:
CV_COHAB_TTL – 2
CV_FIRST_COHAB_DATE~M/~Y – 2
CV_FIRST_COHAB_MONTH – 2
CV_MARRIAGES_TTL – 8
CV_MARSTAT – 8
CV_MARSTAT_COLLAPSED – 8

Round 8:
CV_COHAB_TTL – 2
CV_FIRST_COHAB_DATE~M/~Y – 3
CV_FIRST_COHAB_MONTH – 2
CV_MARRIAGES_TTL – 11
CV_MARSTAT – 11
CV_MARSTAT_COLLAPSED – 11

Round 9:
CV_COHAB_TTL – 5
CV_FIRST_COHAB_DATE~M/~Y – 6
CV_FIRST_COHAB_MONTH – 5
CV_MARRIAGES_TTL – 42
CV_MARSTAT – 40
CV_MARSTAT_COLLAPSED – 40

Round 10:
CV_COHAB_TTL – 9
CV_FIRST_COHAB_DATE~M/~Y – 9
CV_FIRST_COHAB_MONTH – 9
CV_FIRST_MARRY_DATE~M/~Y – 2
CV_FIRST_MARRY_MONTH – 2
CV_MARRIAGES_TTL – 55
CV_MARSTAT – 52
CV_MARSTAT_COLLAPSED – 52

Round 11:
CV_COHAB_TTL – 10
CV_FIRST_COHAB_DATE~M/~Y – 9
CV_FIRST_COHAB_MONTH – 9
CV_FIRST_MARRY_DATE~M – 10
CV_FIRST_MARRY_DATE~Y – 11
CV_FIRST_MARRY_MONTH – 10
CV_MARRIAGES_TTL – 44
CV_MARSTAT – 42
CV_MARSTAT_COLLAPSED – 42

4) Created education variables - Round 2 through Round 11
We conducted large scale checks on the consistencies among the created education variables and across different survey years. When inconsistencies were found, we incorporated the information collected from all Rounds to arrive at the created variables. These checks resulted in the hand edits for the following created variables. The variables, Round, and number of affected respondents are below:

Round 1:
CV_ENROLLSTAT – 4
CV_GED – 2
CV_HIGHEST_DEGREE_EVER – 3
CV_HS_DIPLOMA – 3

Round 2:
CV_GED – 5
CV_HS_DIPLOMA – 9

Round 3:
CV_ENROLLSTAT_EDT – 1
CV_GED – 13
CV_HIGHEST_DEGREE_9900 – 2
CV_HIGHEST_DEGREE_EVER_EDT –
CV_HS_DIPLOMA – 26

Round 4:
CV_AA_DEGREE – 1
CV_ENROLLSTAT_EDT – 4
CV_GED – 27
CV_HIGHEST_DEGREE_0001 – 3
CV_HIGHEST_DEGREE_EVER_EDT – 4
CV_HS_DIPLOMA – 51

Round 5:
CV_AA_DEGREE – 2
CV_ENROLLSTAT_EDT – 10
CV_GED – 41
CV_HIGHEST_DEGREE_0102 – 7
CV_HIGHEST_DEGREE_EVER_EDT – 9
CV_HS_DIPLOMA – 91

Round 6:
CV_AA_DEGREE – 2
CV_ENROLLSTAT_EDT – 14
CV_GED – 47
CV_HIGHEST_DEGREE_0203 – 14
CV_HIGHEST_DEGREE_EVER_EDT – 13
CV_HS_DIPLOMA – 147
CV_SCH_ATTEND_EVER – 3
CV_SCH_ATTEND_YR – 2

Round 7:

CV_AA_DEGREE – 4
CV_BA_DEGREE – 4
CV_ENROLLSTAT_EDT – 26
CV_GED – 45
CV_HIGHEST_DEGREE_0304 – 19
CV_HIGHEST_DEGREE_EVER_EDT – 21
CV_HS_DIPLOMA – 171
CV_SCH_ATTEND_EVER – 4
CV_SCH_ATTEND_YR – 4

Round 8:
CV_AA_DEGREE – 11
CV_BA_DEGREE – 15
CV_ENROLLSTAT_EDT – 33
CV_GED – 39
CV_HIGHEST_DEGREE_0405 – 32
CV_HIGHEST_DEGREE_EVER_EDT – 34
CV_HS_DIPLOMA – 222
CV_SCH_ATTEND_EVER – 4
CV_SCH_ATTEND_YR – 4

Round 9:
CV_ENROLLSTAT – 44
CV_HIGHEST_DEGREE_0506 – 49
CV_HIGHEST_DEGREE_EVER_EDT – 53
CV_SCH_ATTEND_EVER – 5
CV_SCH_ATTEND_YR – 5

Round 10:
CV_BA_CREDITS.01 – 1
CV_ENROLLSTAT – 12
CV_HIGHEST_DEGREE_0607 – 20
CV_HIGHEST_DEGREE_EVER_EDT – 22
CV_SCH_ATTEND_EVER – 4
CV_SCH_ATTEND_YR – 4

Round 11:
CV_BA_CREDITS.01 – 1
CV_ENROLLSTAT – 17
CV_HIGHEST_DEGREE_0708 – 14
CV_HIGHEST_DEGREE_EVER_EDT – 14
CV_SCH_ATTEND_EVER – 4
CV_SCH_ATTEND_YR – 4

5) Created hours worked per week variables - Round 4 through Round 11
A small error was found in the hours per week program in which the variable was incorrectly calculated using the hours from a prior spell of the job rather than the start date of the current spell. The variables, Round, and number of affected respondents that were corrected are below:

Round 4:
CV_HRS_PER_WEEK.01 – 5
CV_HRS_PER_WEEK.02 – 5
CV_HRS_PER_WEEK.03 – 5
CV_HRS_PER_WEEK.05 – 1

Round 5:
CV_HRS_PER_WEEK.01 – 5
CV_HRS_PER_WEEK.02 – 2
CV_HRS_PER_WEEK.03 – 4
CV_HRS_PER_WEEK.05 – 1

Round 6:
CV_HRS_PER_WEEK.01 – 4
CV_HRS_PER_WEEK.02 – 7
CV_HRS_PER_WEEK.03 – 3
CV_HRS_PER_WEEK.05 – 1

Round 7:
CV_HRS_PER_WEEK.01 – 6
CV_HRS_PER_WEEK.02 – 8
CV_HRS_PER_WEEK.03 – 6
CV_HRS_PER_WEEK.04 – 1
CV_HRS_PER_WEEK.06 – 1

Round 8:v CV_HRS_PER_WEEK.02 – 4
CV_HRS_PER_WEEK.03 – 2

Round 9:
CV_HRS_PER_WEEK.02 – 7
CV_HRS_PER_WEEK.04 – 1
CV_HRS_PER_WEEK.05 – 1

Round 10:
CV_HRS_PER_WEEK.01 – 1
CV_HRS_PER_WEEK.02 – 2
CV_HRS_PER_WEEK.03 – 4

Round 11: CV_HRS_PER_WEEK.01 – 2
CV_HRS_PER_WEEK.02 – 8
CV_HRS_PER_WEEK.03 – 2
CV_HRS_PER_WEEK.04 – 2

6) Created training variables - Round 4 through Round 11
A change in the code frame for training programs caused a small number of training programs to be missed in the created variable program. The program has been changed to include this item. The variables, Round, and number of affected respondents that were corrected are below:

Round 4:
CV_TRN_CERT – 14
CV_TRN_CERT_DATE~M,~Y – 1

Round 5:
CV_TRN_CERT – 13
CV_TRN_CERT_DATE~Y – 1

Round 6:
CV_TRN_CERT – 11

Round 7:
CV_TRN_CERT – 26
CV_TRN_CERT_DATE~M – 6
CV_TRN_CERT_DATE~Y – 4

Round 8:
CV_TRN_CERT – 34
CV_TRN_CERT_DATE~M – 8
CV_TRN_CERT_DATE~Y – 4

Round 9:
CV_TRN_CERT – 35
CV_TRN_CERT_DATE~M – 9
CV_TRN_CERT_DATE~Y – 5

Round 10:
CV_TRN_CERT – 60
CV_TRN_CERT_DATE~M – 22
CV_TRN_CERT_DATE~Y – 19

Round 11:
CV_TRN_CERT – 59
CV_TRN_CERT_DATE~M – 18
CV_TRN_CERT_DATE~Y – 17

7) Created residence variable - Round 11
In R11, a programming error caused 815 cases to assign a -3 (invalid skip) to the created residence variable (CV_TTL_RESIDENCES_2007) rather than a -4 (valid skip). This has been corrected.

8) Created income and poverty variables - Round 1
In R1, a programming error caused 1456 cases to assign a -4 (valid skip) to the created income variable (CV_INCOME_GROSS_YR) and the created poverty variable (CV_HH_POV_RATIO) rather than a -3 (invalid skip). This has been corrected.

9) Created parent distance variables- Round 11
A small number of Round 11 parent distance variables were recoded due to a correction in the program that identifies the parent in the household. The created distance between the respondent and the father (CV_DISTANCE_DAD_COL) was recoded for 43 respondents and between the respondent and the mother was recoded for 38 respondents.

10) Created MSA variable - Round 11
The created MSA variable (CV_MSA) was recoded for 138 Round 11 respondents due to a change in the way that MSA was created.

11) SIDCODE variable - Round 1
Missing values on the household id variable for eight respondents were filled in with valid data.

12) YOUTH_SIBID01.01 variable - Round 1
The sibling id variable was corrected for eight respondents.