Chapter 2:  Sample Design and Attrition

Return to Table of Contents



2.1 Sample Design

The original NLS of Young Women sample was designed to represent the civilian noninstitutionalized population of the United States ages 14–24 as of December 31, 1967, at the time of the initial survey.  The cohort is represented by a multi-stage probability sample drawn by the Census Bureau from 1,900 primary sampling units (PSUs) that had originally been selected from the nation’s counties and cities for the experimental Monthly Labor Survey conducted between early 1964 and late 1966.  A primary sampling unit consists of Standard Metropolitan Statistical Areas (SMSAs), counties (or parishes in some states), parts of counties, and independent cities.  A total of 235 sample areas, comprising 485 counties and independent cities, were chosen to represent every state and the District of Columbia.  From the sample areas, 235 strata were created of one or more PSUs that were relatively homogeneous according to socioeconomic characteristics.  Within each stratum, a single PSU was selected to represent the stratum.  Finally, within each PSU, a probability sample of housing units was selected to represent the civilian noninstitutionalized population.  Because the addresses for the sample frame came from the 1960 Census, respondents are covered by Title 13 confidentiality restrictions.  Therefore, variables that link respondents to PSUs are not available to public users, making it impossible to identify respondents by city or state.

Return to top 


2.2 Screening Process

As dictated by the above requirements, the initial sample of about 42,000 housing units for all four NLS Original Cohorts was selected and screening interviews took place in March and April of 1966.  Of this number, about 7,500 units were found to be either vacant, occupied by persons whose usual residence was elsewhere, changed from residential use, or demolished.  On the other hand, about 900 additional units were found created within existing living space or changed from what had been nonresidential space.  A total of 35,360 housing units were available for interview, from which usable information was collected for 34,662 households, for a completion rate of 98.0 percent.

The original plan called for using the initial screening to select all four NLS Original Cohorts.  However, after the sample members for the Older Men were chosen, the sample was rescreened in September 1966 before the initial interview of the Young Men.  This decision was made because a seven-month delay between the screening and first interview seemed inordinate due to the mobility of Young Men in their late teens and early twenties.  To increase efficiency, it was decided to stratify the sample for the rescreening by the presence or absence of a 14- to 24-year-old male in the household.  The probability was high that a household that contained a 14- to 24-year-old in March would also have such a member in September.  However, to insure that the sample also represented persons who had moved into sample households in the intervening period, a sample of addresses that previously had no 14- to 24-year-old males was also included in the rescreening operation.  Since a telephone number had been recorded for most households at the time of the initial screening interview, every attempt was made to complete the short screening interview by telephone.  The sample of households from the initial screening, supplemented with information from the rescreening, was subsequently used to obtain the two samples of women ages 30–44 and 14–24 for the Mature Women and Young Women cohorts (Parnes et al. 1970; Shea et al. 1971). 

User Notes:  During the screening process a large number of multiple respondent households were designated for interview; more than half of respondents in the Mature Women, Young Women, and Young Men cohorts and one-third of respondents in the Older Men cohort originated from multiple respondent households (i.e., a household with at least one other NLS respondent).  For more information on multiple respondent households and on the types of relationships that existed between respondent pairs (e.g., spouse, sibling, etc.), see the “Household Composition” section of this guide.

Return to top 


2.3 Sampling Process

Following the initial household interview and rescreening operation, 5,533 women ages 14-24 as of December 31, 1967, were designated to be interviewed for the Young Women cohort.  The sample was designed to provide approximately 5,000 respondents—about 1,500 nonwhites and 3,500 whites.  The women were sampled differentially within four strata:  whites in predominantly white enumeration districts (EDs), non-whites in predominantly non-white EDs, whites in predominantly non-white EDs, and non-whites in predominantly white EDs.  An enumeration district is a geographical area considered to be an appropriate size for an interviewer to complete all necessary interviews within a prescribed time frame.  To provide separate reliable statistics for black respondents, the sample design called for oversampling of blacks at twice the expected rate in the total population.  The sampling rate of households in predominantly non-white EDs was between three and four times that for households in predominantly white EDs in order to meet this survey requirement.  During the first survey in 1968, 5,159 (93.2 percent) of the designated women were interviewed. 

Return to top 


2.4  Interview Schedule & Fielding Periods

In the initial survey plan, respondents from each of the four Original Cohorts were to be interviewed yearly over a five-year period.  However, due to cost considerations, it was decided after the second survey of the Older Men to survey the two older groups (Older Men and Mature Women) biennially rather than annually.  Due to their greater mobility, the Young Women and Young Men were interviewed annually.  A decision was made at the end of the first five-year period to continue the interviews for another five years because of the usefulness of these data and the relatively small sample attrition.  At this point, the interviewing pattern changed from a yearly personal interview to a 2-2-1 schedule; each respondent was contacted by phone approximately every two years, then again in person one year after the second phone interview.  The 2-2-1 schedule was continued through 1988, when the decision was made to conduct a personal interview every other year.  However, the implementation of the biennial schedule was interrupted by the 1990 decennial Census.  The scheduled 1990 Young Women survey was pushed back to 1991.  Table 2.4.1 depicts the years in which the cohort was surveyed, the fielding period, the percent of the cohort interviewed, and the type of interview utilized.  Users should note that, in years during which the survey was conducted in person, some interviews were administered by telephone when the field representative decided this alternative method was necessary.

Table 2.4.1  Sample Sizes, Retention Rates, and Fielding Periods

Year

Primary Type of interview 1

Fielding period

Total interviewed

Retention rate2

Retention rate, living sample members only3

1968

Personal

January-March

5159

100.0%

100.0%

1969

Personal

January-March

4930

95.6

95.6

1970

Personal

January-March

4766

92.4

92.5

1971

Personal

January-March

4714

91.4

91.6

1972

Personal

January-March

4625

89.6

90.0

1973

Personal

January-March

4424

85.8

86.1

1975

Telephone

January-March

4243

82.2

82.7

1977

Telephone

January-March

4108

79.6

80.1

1978

Personal

January-March

3902

75.6

76.1

1980

Telephone

January-March

3801

73.7

74.2

1982

Telephone

January-March

3650

70.8

71.4

1983

Personal

January-March

3547

68.8

69.4

1985

Telephone

January-March

3720

72.1

72.9

1987

Telephone

January-March

3639

70.5

71.5

1988

Personal

May-July

3508

68.0

69.0

1991

Personal

May-August

3400

65.9

67.1

1993

Personal

September-November

3187

61.8

63.1

1995

Personal

June-September

3019

58.5

60.0

1997

Personal

June-September

3049

59.1

61.1

1999

Personal

June-August

2900

56.2

58.4

2001 Personal June-August 2806 54.4 57.0
 
1  Even in years during which the survey was conducted in person, some interviews were administered by telephone when an interviewer determined that method to be the most appropriate for completing a case.
2  Retention rate is defined as the percent of base-year sample members who were interviewed in any given survey year.  Included in the calculations are deceased and institutionalized sample members, as well as those serving in the military.  The base-year number of sample members used in these calculations includes those who in later years died, became institutionalized, or joined the military. However, the number of sample members used in the calculations in each subsequent survey year does not include those who are deceased, institutionalized, or in the military.
3  This retention rate excludes sample members known to be deceased in each survey year.  This rate may be underestimated, as it is likely that some sample members classified as “refused” or “unable to locate” are actually deceased. The number of sample members used in these calculations does not include those who are deceased.

 

Table 2.4.2  Young Women Interview Schedule

Year

Personal

Telephone

Not available

Interviewed

Not interviewed

1975

887

20.9%

3,329

78.4%

27

0.6%

4,243

916

17.8%

1977

660

16.1

3,434

83.6

14

0.3

4,108

1,051

20.4

1980

333

8.8

3,468

91.2

0

3,801

1,358

26.3

1982

376

10.3

3,273

89.7

1

1

3,650

1,509

29.2

1985

369

9.9

3,343

89.9

8

0.2

3,720

1,439

27.9

1987

344

9.5

3,273

89.9

22

0.6

3,639

1,520

29.5

1991

2,502

73.6

860

25.3

38

1.1

3,400

1,759

34.1

1993

2,071

65.0

1,082

34.0

34

1.1

3,187

1,972

38.2

1995

2,144

71.0

844

28.0

31

1.0

3,019

2,140

41.5

1999

1,505

51.9

1,238

42.7

157

5.4

2,900

2,259

43.8

2001

1,388

49.5

1,319

47.0

99

3.5

2,806

2,353

45.6

1 Less than 0.05%.

 

User Notes: Although each of the personal interviews contains data of roughly the same degree of completeness, data gathered during the telephone interviews were not meant to update the longitudinal record of a respondent.  Rather, the telephone interviews were intended to obtain a brief update of information on each respondent and to maintain sufficient contact so that the lengthier personal interview could be completed.  The combination of fluctuating fielding periods and type of interview (i.e., personal or phone) may affect not only the probability of reinterview but also the reference periods of time-related questions. 

There is another source of inconsistency with respect to time references.  A given year’s survey instrument may use the previous calendar year as a reference period for some questions, while other questions will collect data for the year since last interview.  Income data, for example, may be collected for the calendar year, corresponding to the time frame for a respondent’s tax records; employment data are usually collected for the period since the last interview.

 


 

Go to next section of chapter    Return to top    Return to Table of Contents