AP Statistics Comprehensive Review

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/35

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering Units 1 through 9 of the Statistics curriculum, including data distributions, regression, sampling, probability, and inference.

Last updated 2:50 PM on 4/30/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

36 Terms

1
New cards

Standard deviation

The measure from center used when a distribution is symmetric; it is non-resistant and greatly affected by outliers.

2
New cards

Joint relative frequency

Calculated as the value in a specific cell divided by the total.

3
New cards

Conditional relative frequency

Calculated as the value in a specific cell divided by the row total.

4
New cards

Marginal relative frequency

Calculated as the row total divided by the grand total.

5
New cards

Bimodal

A characteristic of a graph that has two separate peaks.

6
New cards

Median (Q2Q2)

The middle number of a data set when numbers are lined up from least to greatest.

7
New cards

Five number summary

The first step to create a box plot; includes the minimum, q1q1, median, q3q3, and maximum.

8
New cards

Interquartile range (IQRIQR)

Shows where 50% of the data set lies, calculated as q3q1q3 - q1.

9
New cards

SOCCS

An acronym for describing distributions: Shape, Outliers, Center, Context, and Spread.

10
New cards

Outlier rule (Skewed)

Any value less than q11.5(IQR)q1 - 1.5(IQR) or greater than q3+1.5(IQR)q3 + 1.5(IQR); identified as a star on a dot plot.

11
New cards

Outlier rule (Symmetric)

Any value more than 2 standard deviations away from the mean.

12
New cards

Response variable

Also known as the dependent variable or the yy value on a scatter plot; it represents the result.

13
New cards

Explanatory variable

Also known as the independent variable, factor, treatment, or the xx value on a scatter plot.

14
New cards

Percentile

The PP percentile is the value that has p%p\% of the data less than or equal to it.

15
New cards

Z-score

A measures of a data value's distance from the mean in standard deviations, calculated as z=xμσz = \frac{x - \mu}{\sigma}.

16
New cards

Empirical rule (68-95-99.7 rule)

In a normal distribution, 68%68\% of data is within 1σ1\sigma, 95%95\% is within 2σ2\sigma, and 99.7%99.7\% is within 3σ3\sigma.

17
New cards

Correlation coefficient (rr)

A value between 11 and 1-1 that describes the strength and direction of the relationship between two variables.

18
New cards

CDUFS

Acronym for describing scatter plots: Context, Direction (positive/negative/neutral), Unusual features (outliers/clusters), Form (linear/non-linear), and Strength.

19
New cards

Linear regression line (LSRLLSRL)

Also known as the line of best fit; it is the line that most closely matches the linear relationship and represents the average slope of the data.

20
New cards

Coefficient of determination (r2r^2)

The variation in yy explained by the linear relationship of xx; indicates the percentage of data explained by the linear line.

21
New cards

Residuals

The difference between the actual value and the predicted value, calculated as actual ypredicted y\text{actual } y - \text{predicted } y.

22
New cards

Extrapolation

Predicting a data point that is far away from the rest of the data, making the model less reliable.

23
New cards

High leverage points

Points with very large or very small xx values compared to the rest of the data.

24
New cards

Influential points

Points that, if removed, significantly change the slope or y-intercept of the regression line.

25
New cards

Bias

An over or underestimation of a population characteristic.

26
New cards

Simple Random Sample (SRSSRS)

A sampling method where every individual has an equal chance of being chosen.

27
New cards

Confounding variable

A variable not accounted for that can influence the response variable and is related to the explanatory variable.

28
New cards

Law of large numbers

States that simulated probabilities tend to get closer to the true probability as the number of trials increases.

29
New cards

Statistically significant

A result that is unlikely to occur by chance alone, typically defined as having a probability of less than 5%5\%.

30
New cards

Mutually exclusive

Also known as disjoint events; two events where the outcome of one does not affect the outcome of the other, and they cannot occur simultaneously.

31
New cards

Central limit Theorem

As sample size grows, the sampling distribution of the mean becomes more normal regardless of the population's shape.

32
New cards

Confidence intervals (CICI)

A range of believable values where the true parameter lies, found by point estimate±margin of error\text{point estimate} \pm \text{margin of error}.

33
New cards

Type 1 error (α\alpha)

A false positive; rejecting the null hypothesis (H0H_0) when it is actually true.

34
New cards

Type 2 error (β\beta)

A false negative; failing to reject the null hypothesis (H0H_0) when it is actually false.

35
New cards

Power (1β1 - \beta)

The probability of correctly rejecting a false null hypothesis in favor of a specific alternative.

36
New cards

Chi-square statistics (χ2\chi^2)

A type of statistic that measures the difference between observed and expected frequencies in categorical data.