AP Statistics Comprehensive Review

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/35

Earn XP

Description and Tags

Vocabulary flashcards covering Units 1 through 9 of the Statistics curriculum, including data distributions, regression, sampling, probability, and inference.

Last updated 2:50 PM on 4/30/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

36 Terms

New cards

Standard deviation

The measure from center used when a distribution is symmetric; it is non-resistant and greatly affected by outliers.

New cards

Joint relative frequency

Calculated as the value in a specific cell divided by the total.

New cards

Conditional relative frequency

Calculated as the value in a specific cell divided by the row total.

New cards

Marginal relative frequency

Calculated as the row total divided by the grand total.

New cards

Bimodal

A characteristic of a graph that has two separate peaks.

New cards

Median ( $Q2$ )

The middle number of a data set when numbers are lined up from least to greatest.

New cards

Five number summary

The first step to create a box plot; includes the minimum, $q1$ , median, $q3$ , and maximum.

New cards

Interquartile range ( $IQR$ )

Shows where 50% of the data set lies, calculated as $q3 - q1$ .

New cards

SOCCS

An acronym for describing distributions: Shape, Outliers, Center, Context, and Spread.

New cards

Outlier rule (Skewed)

Any value less than $q1 - 1.5(IQR)$ or greater than $q3 + 1.5(IQR)$ ; identified as a star on a dot plot.

New cards

Outlier rule (Symmetric)

Any value more than 2 standard deviations away from the mean.

New cards

Response variable

Also known as the dependent variable or the $y$ value on a scatter plot; it represents the result.

New cards

Explanatory variable

Also known as the independent variable, factor, treatment, or the $x$ value on a scatter plot.

New cards

Percentile

The $P$ percentile is the value that has $p\%$ of the data less than or equal to it.

New cards

Z-score

A measures of a data value's distance from the mean in standard deviations, calculated as $z = \frac{x - \mu}{\sigma}$ .

New cards

Empirical rule (68-95-99.7 rule)

In a normal distribution, $68\%$ of data is within $1\sigma$ , $95\%$ is within $2\sigma$ , and $99.7\%$ is within $3\sigma$ .

New cards

Correlation coefficient ( $r$ )

A value between $1$ and $-1$ that describes the strength and direction of the relationship between two variables.

New cards

CDUFS

Acronym for describing scatter plots: Context, Direction (positive/negative/neutral), Unusual features (outliers/clusters), Form (linear/non-linear), and Strength.

New cards

Linear regression line ( $LSRL$ )

Also known as the line of best fit; it is the line that most closely matches the linear relationship and represents the average slope of the data.

New cards

Coefficient of determination ( $r^2$ )

The variation in $y$ explained by the linear relationship of $x$ ; indicates the percentage of data explained by the linear line.

New cards

Residuals

The difference between the actual value and the predicted value, calculated as $\text{actual } y - \text{predicted } y$ .

New cards

Extrapolation

Predicting a data point that is far away from the rest of the data, making the model less reliable.

New cards

High leverage points

Points with very large or very small $x$ values compared to the rest of the data.

New cards

Influential points

Points that, if removed, significantly change the slope or y-intercept of the regression line.

New cards

Bias

An over or underestimation of a population characteristic.

New cards

Simple Random Sample ( $SRS$ )

A sampling method where every individual has an equal chance of being chosen.

New cards

Confounding variable

A variable not accounted for that can influence the response variable and is related to the explanatory variable.

New cards

Law of large numbers

States that simulated probabilities tend to get closer to the true probability as the number of trials increases.

New cards

Statistically significant

A result that is unlikely to occur by chance alone, typically defined as having a probability of less than $5\%$ .

New cards

Mutually exclusive

Also known as disjoint events; two events where the outcome of one does not affect the outcome of the other, and they cannot occur simultaneously.

New cards

Central limit Theorem

As sample size grows, the sampling distribution of the mean becomes more normal regardless of the population's shape.

New cards

Confidence intervals ( $CI$ )

A range of believable values where the true parameter lies, found by $\text{point estimate} \pm \text{margin of error}$ .

New cards

Type 1 error ( $\alpha$ )

A false positive; rejecting the null hypothesis ( $H_0$ ) when it is actually true.

New cards

Type 2 error ( $\beta$ )

A false negative; failing to reject the null hypothesis ( $H_0$ ) when it is actually false.

New cards

Power ( $1 - \beta$ )

The probability of correctly rejecting a false null hypothesis in favor of a specific alternative.

New cards

Chi-square statistics ( $\chi^2$ )

A type of statistic that measures the difference between observed and expected frequencies in categorical data.