Skip Navigation
small header image
The Condition of Education Indicator List Site Map Back to Home
User's Guide


- Data Sources and
Estimates

Standard Errors

Data Analysis and Interpretation

Variation in Populations

Rounding and Other Considerations
Technical Guide

Data Sources and Estimates

The data in this report were obtained from many different sources—including students and teachers, state education agencies, local schools, and colleges and universities— using surveys and compilations of administrative records. Users of The Condition of Education should be cautious when comparing data from different sources. Differences in aspects such as procedures, timing, question phrasing, and interviewer training can affect the comparability of results across data sources.

Most indicators in The Condition of Education summarize data from surveys conducted by NCES or by the Census Bureau with support from NCES. Brief explanations of the major NCES surveys used in this edition of The Condition of Education can be found in supplemental notes 3 and 4 of this volume. More detailed explanations can be obtained on the NCES website under "Surveys and Programs." Information about the Current Population Survey (CPS), another frequent source of survey data used in The Condition of Education, can be found in supplemental note 2 as well as at http://www.census.gov/cps/.

Data for indicators reported in this volume are obtained primarily from two types of surveys: universe surveys and sample surveys. Some indicators report data taken from entire populations (universe surveys), such as indicator 36 (Public School Expenditures by District Poverty). With this type of survey, information is collected from every member of the population. For example, data for indicator 36 were obtained for each school district in the United States. When data from an entire population are available, estimates of the total population or a subpopulation are made by simply summing the units in the population or subpopulation. A universe survey is usually expensive and time consuming, so researchers often opt to collect data from a sample of the population of interest (sample survey). Other indicators report data from sample surveys, such as indicator 12 (Reading Performance and Achievement Gaps). Indicator 12 reports information from the National Assessment of Educational Progress (NAEP), which assesses a representative sample of students rather than the entire population of students. When a sample survey is used, statistical uncertainty is introduced because data come from only a portion of the entire population. This statistical uncertainty must be considered when reporting estimates and making comparisons.

Various types of estimates derived from universe and sample surveys are reported in The Condition of Education. Many indicators report the size of a population or a subpopulation, and often the size of a subpopulation is expressed as a percentage of the total population. In addition, the average (or mean) values of some characteristic of the population or subpopulation may be reported. The average is obtained by summing the values for all members of the population and dividing the sum by the size of the population. An example is the annual average salaries of full-time instructional faculty at degree-granting institutions (indicator 43). Another population measure that is sometimes used is the median. The median is the value of a population characteristic at or above which 50 percent of the population is estimated to fall and at or below which 50 percent of the population is estimated to fall. An example is the median annual earnings of young adults who are full-time, full-year wage and salary workers (indicator 17).

Estimates based on universe and sample survey data may be affected by a wide range of potential data collection errors, such as coverage errors, response errors, data coding errors, and data entry errors. Estimates of the size of these types of errors are typically not available.

Using estimates calculated from data based on a sample of the population requires consideration of several factors before the estimates become meaningful. However conscientious an organization may be in collecting data from a sample of a population, some margin of error will always be present in estimations of the size of the actual total population or subpopulation because the data are available from only a portion of the total population. Consequently, data from samples can provide only an approximation of the true or actual value. The margin of error, or the range, of an estimate depends on several factors, including the amount of variation in the responses, the size and representativeness of the sample, and the size of the subgroup for which the estimate is computed. The magnitude of this margin of error is measured by what statisticians call the "standard error" of an estimate.

1990 K Street, NW
Washington, DC 20006, USA
Phone: (202) 502-7300 (map)