1/39
Vocabulary flashcards for key statistics terms from Pages 1–2 notes.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Statistics
The science and art of collecting, analyzing, and drawing conclusions from data.
Individual
An object described in a data set. (People, Animals, and Things)
Variable
An attribute that can take different values for different individuals.
Categorical variable
Assigns labels that place each individual into a particular group, called a category.
Quantitative variable
Takes number values that are quantities—counts or measurements.
Discrete variable
A quantitative variable that takes a fixed set of possible values with gaps between them.
Continuous variable
A quantitative variable that can take any value in an interval on the number line.
Distribution
Of a variable, tells us what values the variable takes and how often it takes those values.
Frequency table
Shows the number of individuals having each value.
Relative frequency table
Shows the proportion or percent of individuals having each value.
Bar graph
Shows each category as a bar. The heights of the bars show the category frequencies or relative frequencies.
Pie chart
Shows each category as a slice of the 'pie.' The areas of the slices are proportional to the category frequencies or relative frequencies.
Two-way table
A table of counts that summarizes data on the relationship between two categorical variables for some group of individuals.
Marginal relative frequency
Gives the percent or proportion of individuals that have a specific value for one categorical variable.
Joint relative frequency
Gives the percent or proportion of individuals that have a specific value for one categorical variable and a specific value for another categorical variable.
Conditional relative frequency
Gives the percent or proportion of individuals that have a specific value for one categorical variable among individuals who share the same value of another categorical variable (the condition).
Side-by-side bar graph
Displays the distribution of a categorical variable for each value of another categorical variable. The bars are grouped together based on the values of one of the categorical variables and placed side by side.
Segmented bar graph
A segmented bar graph shows the distribution of a categorical variable as segments of a rectangle, with the area of each segment proportional to the percent of individuals in the corresponding category.
Mosaic plot
A mosaic plot is a modified segmented bar graph in which the width of each rectangle is proportional to the number of individuals in the corresponding category.
Association
There is an association between two variables if knowing the value of one variable helps us predict the value of the other. If knowing the value of one variable does not help us predict the value of the other, then there is no association between the variables.
Dotplot
Shows each data value as a dot above its location on a number line.
Symmetric distribution
A distribution is roughly symmetric if the right side of the graph (containing the half of observations with the largest values) is approximately a mirror image of the left side.
Skewed distribution
A distribution is skewed to the right if the right side of the graph is much longer than the left side. A distribution is skewed to the left if the left side of the graph is much longer than the right side.
Stemplot
Shows each data value separated into two parts: a stem, which consists of all but the final digit, and a leaf, the final digit. The stems are ordered from lowest to highest and arranged in a vertical column. The leaves are arranged in increasing order out from the appropriate stems.
Histogram
Shows each interval of values as a bar. The heights of the bars show the frequencies or relative frequencies of values in each interval.
Mean
The mean of a distribution of quantitative data is the average of all the individual data values. To find the mean, add all the values and divide by the total number of data values.
Statistic
A number that describes some characteristic of a sample.
Parameter
A number that describes some characteristic of a population.
Resistant
A statistical measure is resistant if it isn't sensitive to extreme values.
Median
The midpoint of a distribution, the number such that about half the observations are smaller and about half are larger.
Range
The range of a distribution is the distance between the minimum value and the maximum value. That is, Range = Maximum - Minimum.
Standard deviation
Measures the typical distance of the values in a distribution from the mean. It is calculated by finding an average of the squared deviations and then taking the square root.
Variance
The average squared deviation.
Quartiles
The quartiles of a distribution divide the ordered data set into four groups having roughly the same number of values. To find the quartiles, arrange the data values from smallest to largest and find the median.
First quartile
The first quartile Q1 is the median of the data values that are to the left of the median in the ordered list.
Third quartile
The third quartile Q3 is the median of the data values that are to the right of the median in the ordered list.
Interquartile range
The distance between the first and third quartiles of a distribution. In symbols: IQR = Q3 - Q1.
Five-number summary
The five-number summary of a distribution of quantitative data consists of the minimum, the first quartile Q1, the median, the third quartile Q3, and the maximum.
Boxplot
A visual representation of the five-number summary.