Appendix A: Quick Reference Guide

This appendix provides helpful hints on how to most effectively use the NLSY79 User's Guide, access the NLSY79 data, and get additional help when necessary.

Return to Table of Contents


A.1 NLS Documentation

The NLSY79 User’s Guide provides in-depth information about the NLSY79 data. For more general information about the history, administration, or other cohorts of the NLS project, or more technical information about topics such as sampling, users should consult Table A.1.1 for the most appropriate NLS document.

Table A.1.1 List of NLS Documentation

NLS Internet Bibliography
http://www.nlsbibliography.org

This online searchable database provides citations for research using NLS data.

User's Guides

These cohort-specific guides help researchers understand NLS variables, survey instruments, documentation techniques, and other technical issues.

Questionnaires

A complete set of survey instruments from each survey year that allows researchers to view questions, supplemental information, and household interview forms. For links to documentation go to http://www.nlsinfo.org/web-investigator/docs.php.

Flowcharts

Schematic diagrams depict universe information and skip patterns for the survey instrument (1979-1992).

Codebook Supplements

Supplementary attachments and appendices contain variable creation, description, and coding information not present in the questionnaire.

Technical Sampling Report

This manual and addenda describe the sampling procedure used to select the initial survey participants, including information about weights and standard errors.

Compact Disc User's Guide

This guide provides installation and usage instructions for the CDs.

NLS Web Homepage
http://www.bls.gov/nls

This internet site offers an overview of the NLS programs.

NLS News
http://www.bls.gov/nls/nlsnews.htm

NLS News, a quarterly newsletter, contains informative articles about the NLS data, announces the release of new data files, and reports any errors discovered in the data or documentation. Past issues of the NLS News are available on the NLS internet site.

Contact information for NLS User Services is provided under "Additional Support" at the end of this chapter.

Return to top


A.2 NLS Investigator

NLSY79 variables can be accessed using NLS Investigator, which is available as a Web application or in a software package that resides on the user’s computer.  This section will focus on the Web application, called NLS Web Investigator.  Database Investigator, the software that accompanies data file downloads and data CDs, is similar in function although not identical in appearance.  Users who need more information on using the Database Investigator software are encouraged to consult the help files that accompany the software.

The main application of NLS Web Investigator is to access NLS variables for the purposes of identifying, selecting, extracting, and/or running frequencies or cross-tabulations.  The interface allows the researcher to connect to a database and perform variable extractions without installing any software on a local computer.  Through a personal online account, a researcher’s selected variable tag sets, frequencies, and extracts will be available for a specified period of time from any computer location with Web access.  Because there will be one central data source for all users, researchers will have the assurance that they are always working with the most up-to-date data, and that any necessary corrections will be immediate and universal.

Topics Covered

The NLSY79 survey collects information every year (or every two years, beginning with the 1994 survey) about family, income, work experience, education, marriage, lifestyle, military experience, and fertility. Occasionally, other topics are covered, including drug/alcohol use and sexual activity. Figure A.2.1 lists general topic areas covered by the data collection; more detailed discussions of specific topic areas are provided in chapter 4.

Figure A.2.1 General Topics of the NLSY79 Main Data Collection

Work & Non-Work Experiences

Attitudes & Aspirations

Training Investments

Geographic Residence Information

Schooling, School Records & Aptitude Information

Family Background & Demographic Characteristics

Military Experience

Household Composition

Income & Assets

Marital & Fertility Histories

Health Conditions, Injuries &Insurance Coverage

Child Care

Alcohol & Substance Use, Criminal Activities

 

Organization of Data

The NLSY79 is a panel data collection—specifically, the same persons are interviewed in multiple survey years. The NLSY79 often asks the same questions in each interview to gauge the change in behavior and life circumstances over time. Therefore, respondents have often answered the same or similar questions at numerous interview points since 1979. The database is organized by respondent. Users are provided with data extraction software on each CD to search this large database and extract the specific variables needed.

User Notes: Each respondent has a unique identification number (see variable R00001) ranging from 1 to 12686. This unique number was assigned to each respondent in 1979 and does not change with each survey round. It is good practice to include the identification code in every data extraction.

NLS Investigator Search

A basic NLS Web Investigator data search might include the following steps:

  1. select the desired NLS cohort
  2. choose filters to narrow the scope of that cohort's variables
  3. create a tag set of desired variables
  4. run extracts and/or frequency tables for tagged variables

In this case, the NLSY97 cohort was selected in step 1 (see Figure A.2.2).  After selecting a specific NLS cohort, users can peruse the data set and choose the variables they need through several search index options (described in the next section).  Utilizing drop-down menus, researchers can search indexes for Variable Title, Area of Interest, Survey Year, Reference Number, and Question Name.  If preferred, users can also search for any word in the variable title, question name, or question text.  In addition, the search indexes include a “not” option to exclude particular variables from a search.

Figure A.2.2  NLS Web Investigator Search Screen

During this search process, filters can be used to focus the search to specific variables of interest.  Researchers can use more that one search index at a time; for example, a user could search for variables in the Child Care area of interest for the 2002 survey year (round 6) only.  Filters can also be layered, so that more than one search term can be used in a single index; for example, a researcher could first use the Word in Title index to find variables containing the word “parent” and then fine-tune the variable list by saving the “parent” filter and searching for words like “contact” (see Figure A.2.3.) or “characteristics.”

Figure A.2.3  Using Filters in NLS Web Investigator

As users identify variables of interest, they can mark or save these variables in a “tag set,” essentially a saved list of variables of interest.  Users can save tag sets in their accounts on the central database server for up to 90 days and access the saved tag sets from any computer; tag sets can also be saved on the researcher’s local machine for as long as desired.

Once a tag set of variables is formed, researchers can run simple statistics such as frequencies and cross-tabulations (with or without round-specific weights) via the Web Investigator.  If more detailed data analysis is required, researchers can produce a data extract file with SAS or SPSS statements or a Stata dictionary for use on a local computer with the user’s own statistical software.  Results are received quickly, usually in less than a minute.  Users can save the frequency, table, and extract files to their local computer or access the files from their personal NLS Web Investigator accounts.  Extracts and tables are saved for 4 days.

NLSY79 Search Indexes

Several different search indexes are available within NLS Investigator for finding and selecting variables of interest.  These indexes are described in greater detail below.

Word in Title. All words, numbers, and symbols found in any variable title form an index in the data set.  The “Word in Title” search function allows the user to search this index and select NLSY79 variables whose titles contain any single word or combination of words. 

User Notes: Word in Title searches for NLSY79 variables are limited by the choice of variable titles.  Flexibility in variable title assignment for raw data items is restricted by the wording of the question as it appears in the survey instrument and the maximum allowable length for variable titles.  Users may need to try several different abbreviations.

Area of interest. NLSY79 data files are organized so that variables sharing a common factor are stored in unique groupings called “areas of interest.”  Users can browse through a given area and examine the variables associated with that topic. 

Survey year. This index lists variables by survey round. Because each round includes thousands of variables, this index is best used in combination with another type of search.

Reference number. Researchers can use this index to look up a specific variable by reference number.  In general, this is most useful when the researcher needs to double check a known variable; simply browsing by reference number is not recommended.

Question name. This index lists all variables using the question name (that is, question number) from the questionnaire.  This will facilitate finding the codebook page for an item of interest seen in the questionnaire or finding the same item across survey rounds.  Note that, whenever possible, survey staff keep question names constant when the same item appears in multiple rounds, but occasionally changes must be made.  If an item is not found by using the Question Name index, users may wish to double check using a different search function rather than assuming the item does not exist.

Additional Functions

NLS Web Investigator offers two key additional features to help researchers better understand and use NLSY79 data.

Custom weights.  The custom weighting program option that helps users to create a custom set of survey weights, which improves a researchers’ ability to accurately calculate summary statistics from multiple years of data.

Supplementary documentation.  In addition to a general user’s manual, NLS Web Investigator supplies user’s guides, questionnaires and other documentation for each cohort.  Items available online for the NLSY79 include an HTML version of this user’s guide, questionnaires from each round of the survey, main file and geocode codebook supplements for each round, and the technical sampling report, which provides statistical information about the selection of the initial NLSY79 sample.  Questionnaires are provided in various formats, depending on the technology used to produce them; for the first several rounds the questionnaires are in Word or PDF format, while more recent rounds have fully linked HTML questionnaires that permit researchers to follow various question paths through the instrument.  Similarly, codebook supplements from the first few survey rounds are PDF files, while more recent rounds are in HTML format.

Return to top


A.3 How to Get Help

Sometimes users have questions about the NLSY79 data, database retrieval software, or documentation. The following are strategies for finding answers to these questions.

On-line and Paper Documentation Help: CHRR provides both online and paper documentation help sources for users. The online help is present in the database retrieval software program. Press the F1 function key at any time in the NLS Web Investigator to receive online help.

Paper documentation includes the NLSY79 User's Guide, the NLS Handbook, the Compact Disc User's Guide, and assorted supplemental hard copy documentation (see Table A.1.1). The NLSY79 paper documentation contains the answers to most questions. Researchers should note that Appendix E in this guide defines a number of commonly used NLSY79 terms. If the problem is confusion over the use of a specific term, this appendix is the best source of information. The User's Guides and other publications are available online at http://www.bls.gov/nls/.

Frequently Asked Questions: Table A.3.1 provides answers to commonly asked questions about the NLSY79 data and accessing the NLSY79 data.

Additional Support: If questions arise which are not answered in the documentation, contact NLS User Services:  usersvc@chrr.osu.edu


Table A.3.1 Frequently Asked Questions

Question

Answer

There are many variables that indicate the respondent's race and sex. Which are the correct ones to use?

For respondent's race and sex use R02147. and R02148. respectively. For more information, see the "Race, Ethnicity & Nationality" and "Gender" sections of this guide.

How does one calculate the respondent's age at interview?

Respondent's age at interview has already been calculated for each year and is in area of interest "Key Variables." Area of interest "Key Variables" has many created variables that the researcher will find useful.

Within the data, how are the respondents identified?

See variable R00001., Identification Code. Each respondent has a unique identification number ranging from 1 to 12686. It is good practice to include the identification code in every data extraction.

The highest grade completed variables have too many valid skips to be useful. Is there something wrong?

Questions on highest grade completed are asked only of respondents who have been enrolled since the last interview. This explains the large number of valid skips. However, CHRR provides created cumulative highest grade completed variables correct for all interviewed respondents. See, for example, 'Highest Grade Completed as of May Survey Year (Revised)' in area of interest "Key Variables."

Is there an income measure for the respondent's family?

Yes, see variables with the title 'Total Net Family Income' in area of interest "Key Variables." Consult the "Income" section of this guide for additional information on the creation of this variable.

What are areas of interest?

All variables have been assigned to at least one of several areas of interest or groups of variables sharing a common factor. For example, income variables are in area of interest "Income," and marital history is in "Marriage." One way to search the data is by area of interest.

There seem to be several variables in the same year that have the same or nearly the same title but the frequencies are different.

This would mean that the same question has been asked to different universes of respondents depending on the skip patterns of the questions. The questionnaires provide the skip patterns to tell you the universe of respondents who answered each question.

The codebook for the industry codes says to "see attachment 3." What is attachment 3?

Attachment 3 lists the individual industry or occupation codes that are too numerous to list in the codebook. Attachment 3 is part of the Codebook Supplement, which includes most of the attachments and appendices that are listed in the codebook.

What's the difference between reference numbers and question numbers?

Reference numbers are unique variable names that have been assigned sequentially beginning with R00001. through to the end. Question numbers are unique only to each particular year, and they are used to find the question within each year's questionnaire.


Return to top Return to Table of Contents