|
|
Chapter 3 continued: Guide to the Mature Women Data Return to beginning of chapter
3.2 Types of VariablesFour types of variables are present in the Mature Women data files. The type of variable affects the title or variable description which names each variable and the physical placement of the variable within the codebook. Types of variables include:
Reference NumbersEvery variable within the main NLS data set has been assigned an identifying
number that determines its relative position within the data file and
documentation system. Persons contacting
NLS User Services should be prepared to discuss their question or problem in
relationship to the reference number(s) of the variable(s) in question. Reference numbers, once assigned, remain constant through subsequent revisions of the files. Reference numbers are assigned sequentially, with variables from the first survey year having a lower reference number than those variables specific to the second year, and so forth. Occasionally, variables are created sometime after the year in which the data were actually collected. These variables are frequently given a reference number that reflects the year in which the actual data were gathered rather than the year the created variable was constructed. Table 3.2.1 lists reference numbers for each survey year since 1967 for the Mature Women. Table 3.2.1 Mature Women Reference Numbers by Survey Year
Variable TitlesEvery variable within NLS main file data sets has been assigned an 80 character summary title that serves as the verbal representation of that variable throughout the hard copy and electronic documentation system. Variable titles are assigned by CHRR archivists who endeavor, within the limitations described below, to capture the core content of each variable and to incorporate within the title (1) common words that facilitate easy identification of comparable variables; (2) UNIVERSE IDENTIFIERS that specify the subset of respondents for which each variable is relevant; and (3) for some variables, REFERENCE PERIODS that indicate the period of time (e.g., survey year or calendar year) to which these data refer. Universe identifiers and reference periods are discussed below. Universe Identifiers: If two ostensibly identical variables differ only in that they refer to different universes, the variable title will include a reference to the applicable universe. Example 1: ‘Reason for Being OLF, 77 (Not Empld, Have Worked)’ ‘Reason for Being OLF, 77 (Not Empld, Not Worked)’ Reference Periods: Variable descriptions may include a phrase indicating the time period to which these data refer. The following general conventions apply: Survey Year: When the variable title includes either the phrase XX INT (82 INT) or the year (e.g., 67) without the year being preceded by the preposition “IN,” this indicates the survey year in which that variable was measured, not necessarily the year to which it applies. Example 2: ‘Move to Current Residence - Prior SMSA, 82 INT’ refers to a residential move occurring in the period before the 1982 interview. Example 3: ‘Number of Weeks Worked in Past Year, 67’ refers to the weeks worked in the 12-month period preceding the 1967 survey. Calendar Year: When a date follows a verbal description of a variable and is part of the prepositional phrase “in XX,” the date identifies the calendar year for which the relevant information was collected. The title in Example 4 refers to payments received in calendar year 1988 with data collected during the 1989 survey. Example 4: ‘Income from Social Security Payments Based on R’s Work Record in 88? 89.’
Flexibility in variable title assignment for raw data items is restricted by (1) the actual wording of the question as it appears within the survey instrument; (2) precedent, i.e., how that type of variable has been titled in previous survey years; and (3) the maximum allowable length for variable titles. An attempt is also made to include key phrases in variable titles so that large groups of variables with similar or related subject matter can be easily identified. Users should be careful not to assume that two variables with the same or similar titles necessarily have the same (1) universe of respondents or (2) coding categories or (3) time reference period. While the universe identifier and reference period conventions discussed above have been utilized, users are urged to consult the questionnaires for skip patterns and exact time periods for a given variable and to factor in the relevant fielding period(s). Variables with similar content (e.g., information on respondents’ labor force status) may have completely different titles, depending on the type of variable (raw versus created).
Finally, different archivists over a period of three decades have performed the task of assigning variable descriptions to data from the NLS cohorts. While every effort has been made to maintain consistency, users may find some differences in variable titles. Two primary sources of variation exist in Original Cohort variable title assignment. The first is systematic error in which identical questions may have the same question wording across the four Original Cohorts but slightly different variable titles. The rule before 1995 was to make title consistency within a cohort of highest priority. Starting in 1995, joint fielding forced the archivists to choose one title and cross-reference the other cohort’s title in the archivist notes. The second variation is attributed to random error due to spacing or punctuation errors. The sorting process that produces variable title listings usually places these variables near if not next to the series of interest. Identifying Mode of Interview
There are several different ways of identifying whether a survey is a personal or telephone interview. Users can (1) refer to Table 2.4.1 in the “Interview Schedule & Fielding Periods” section of Chapter 2, which depicts the type of interview by survey year, or (2) examine variable titles assigned to questions of similar content. Differences in what appear to be comparable variables reflect variations in the wording of the question or the fact that the reference period for an identically worded question may be different in a personal versus a telephone interview. Questions that refer to the last five years were usually found in a personal (or five-year) interview. This difference means that some questions were only asked in the five-year surveys and some were asked only in the telephone surveys. Users conducting longitudinal analysis need to change their variable creation procedures to account for the differences in data collection between the early years of uninterrupted personal interviews and subsequent survey years when telephone interviews were used. Starting with the 1989 survey, the collection pattern was altered again; a decision was made to conduct a personal interview every other year and collect data going back to the date of the last interview. However, the scheduled 1991 survey was delayed until 1992 due to the demands of the 1990 decennial census and the decision to interview the Young Women first, in 1991. The scheduled 1994 survey was then delayed until 1995 so that the women’s cohorts could be interviewed at the same time using the same CAPI/CATI instrument. Biennial surveys have been conducted in 1997, 1999, and 2001. When analyzing data, users should remember that not all surveys were conducted during the same season of each survey year. Responses to labor force status questions, for example, may differ significantly if fielding occurred during the summer versus winter months. See the discussion of fielding periods in chapter 2 of this guide for more information. Go
to next section of chapter Return
to top Return to beginning of chapter
|