A graphic display of a univariate distribution.
The most commonly used statistical measure of variation.
Documentation for a data file that usually contains the question wording and responses codes for each variable.
The numerical difference between an observed value and the value predicted by the regression line.
Shows whether the association in a contingency table is statistically significant.
The middle value in a distribution.
Detecting and resolving errors in coding and data entry.
Examples are Cramer’s phi and the correlation coefficient.
Graphic depiction of a bivariate distribution.
The value or category in a distribution with the highest frequency.
Indicates how much the dependent variable changes for every one-unit increase in the independent variable.
A cleaning technique that can be programmed for automatic detection in computer-assisted interviewing.