AP Statistics Vocabulary

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/99

flashcard set

Earn XP

Description and Tags

A complete sequence of AP Statistics vocabulary terms, providing definitions and relevant statistical notation for each concept found in the lecture notes.

Last updated 7:09 PM on 5/22/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

100 Terms

1
New cards

Alternative Hypothesis

States that a treatment has had an effect or caused a change in the population

2
New cards

Bias

Describes a study which systematically favors certain outcomes

3
New cards

Binomial Distribution

The distribution of the probabilities of XX successes out of nn trials, calculated using pp as the probability of any single success – B(n,p)B(n, p)

4
New cards

Blind

Describes an experiment in which the subjects do not know which treatment they are getting

5
New cards

Blocking

A statistical design which creates groups that are similar in some way, and then randomizes the treatments within each block

6
New cards

Central Limit Theorem

States that when an SRS is drawn from a population with mean μ\mu and standard deviation σ\sigma, the sampling distribution for the sample mean will be approximately normally distributed, and have a mean μ\mu and a standard deviation σn\frac{\sigma}{\sqrt{n}}

7
New cards

Chi-Square Distributions

A family of skewed-right distributions which take on only positive values and are defined by their degrees of freedom – the specific shape of the Chi-Square Distribution changes as the sample size changes

8
New cards

Chi-Square Goodness-of-Fit Test

Used to determine if a population has a certain hypothesized distribution

9
New cards

Chi-Square Test for Homogeneity

Used to determine if every category in the population has the same population

10
New cards

Chi-Square Test for Independence

Used to determine if there is a relationship between two categorical variables – also known as Chi-Square Test for Association

11
New cards

Coefficient of Determination

Tells what percent of the change in the response variable can be attributed to the change in the explanatory variable – symbolized as r2r^2

12
New cards

Complement of an Event

The set of all outcomes not defined as successful outcomes for any event

13
New cards

Conditional Probability

The probability of an event occurring if it is known that another specific event has already occurred

14
New cards

Confidence Interval

An interval estimate of a parameter calculated using a sample from that population

15
New cards

Confidence Level

The probability that the desired parameter will fall into a confidence interval if many intervals were calculated from samples of the same size

16
New cards

Confounding Variable

A variable which could affect the result of a statistical test but has not been controlled for

17
New cards

Continuous Random Variable

A random variable which takes on all values in an interval of numbers

18
New cards

Control Group

Any group of subjects who receive either a placebo or no treatment at all during an experiment

19
New cards

Correlation

Measures the direction and strength of the linear relationship between two quantitative variables – symbolized as rr

20
New cards

Critical Value

A value (z-score, t-score, or χ2\chi^2 value) used in a hypothesis test to help determine if the null hypothesis should be rejected

21
New cards

Cumulative Distribution Function

A function which calculates the sum of the probabilities for each possible value for any random variable XX

22
New cards

Degrees of Freedom

A value used to help determine significance for a t-test or a Chi-Square test – measured as n1n-1 in most cases, or (r1)(c1)(r-1)(c-1) when dealing with two-way tables

23
New cards

Dependent Trials

Trials whose probability is affected by the outcome of previous trials

24
New cards

Density Curve

A curve used to represent a distribution; always on or above the horizontal axis and has a total area of exactly 11 underneath it

25
New cards

Discrete Random Variable

A random variable with countable outcomes

26
New cards

Mutually Exclusive Events

Events which cannot occur at the same time

27
New cards

Distribution

A list of what values a variable takes on and how often it takes on each one of those values

28
New cards

Double Blind

Describes an experiment in which neither the subjects nor the researcher know which treatment each subject is getting

29
New cards

Empirical Rule

Also known as the 689599.768-95-99.7 rule – is used as an approximation for what percent of the data falls within 11, 22, or 33 standard deviations of the mean in any normal distribution

30
New cards

Expected Value/ Mean

The 'average' of a data set

31
New cards

Experimental Units/Subjects

The individuals on which an experiment is conducted

32
New cards

Explanatory/Independent Variable

Attempts to explain the observed outcomes in a statistical study

33
New cards

Exploratory Data Analysis

Uses graphs and numerical summaries to describe the variables in a data set and the relationships among them

34
New cards

Factor

Any explanatory variable in an experiment

35
New cards

Five Number Summary

A method to describe a data set using the minimum, first quartile, median, third quartile, and maximum points in the data set

36
New cards

Geometric Distribution

A distribution of probabilities of when the first successful outcome occurs in a probability experiment

37
New cards

Hypothesis/Significance Test

A type of inference used to determine the feasibility of an assumed population parameter

38
New cards

Independent Trials

Trails whose probabilities are not affected by the outcome of previous trials

39
New cards

Individuals

People or objects described by a set of data

40
New cards

Inference

The statistical process of drawing conclusions about a population by examining data from a sample

41
New cards

Influential Point

A point which, if removed from the data set, would markedly change the regression equation for that data set

42
New cards

Interquartile Range (IQR)

The difference between the third and first quartiles of a data set

43
New cards

Law of Large Numbers

States that as increased numbers of observations are drawn from any population, the mean of the observations eventually approaches the mean of the population as closely as we would like to estimate it, and remains that close or closer

44
New cards

Least Squares Regression Line

A regression line which makes the sum of the squares of the vertical distances from the data points to the line as small as possible

45
New cards

Level

A numerical value of a factor of an experiment

46
New cards

Matched Pairs

A statistical design which compares two treatments – this is usually done with one sample receiving each treatment over a different time period

47
New cards

Median

The point at which 50%50\% of the data is above and 50%50\% of the data is below

48
New cards

Nonresponse

A type of bias that occurs when an individual chosen for a sample cannot be contacted or chooses not to participate

49
New cards

Normal Distribution

A symmetric, bell-shaped distribution in which approximately 68%68\% of the data lies within one standard deviation of the mean, 95%95\% lies within two standard deviations, and 99.7%99.7\% lies within three standard deviations – defined by mean and standard deviation

50
New cards

Null Hypothesis

States that either a treatment has had no effect on a population, or that the population has not changed

51
New cards

Observation

Any single point from a data set

52
New cards

Outlier

An individual observation that falls outside the pattern of the data set – often defined as any number that is 1.5(IQR)1.5(IQR) outside of Q1Q1 or Q3Q3

53
New cards

P-value

The probability that the observed outcome would take on a value as extreme or more extreme than observed if the null hypothesis were true

54
New cards

Parameter

A number that describes a population

55
New cards

Percentile

Tells what percent of a data set falls below the given observation

56
New cards

Placebo

A false treatment which should have no effect on an experiment

57
New cards

Pooled Procedures

Occurs when separate samples are combined into a single sample for analysis – done only if population variances are equal

58
New cards

Population

The entire group of individuals that we want information about

59
New cards

Power of a Hypothesis Test

The probability that the test will reject the null hypothesis when the null hypothesis is false – equal to 1P(Type II error)1 - P(\text{Type II error})

60
New cards

Probability

The proportion of times an outcome would occur over a large number of trials

61
New cards

Probability Distribution Function

A function which assigns a probability for each possible value for any discrete random variable XX

62
New cards

Proportion

Tells what percent of a data set falls into a given category

63
New cards

Qualitative Variable

A variable which takes on a non-numeric description

64
New cards

Quantitative Variable

A variable which takes on a numeric value

65
New cards

Quartiles

Observations which fall at the 25th25\text{th}, 50th50\text{th}, and 75th75\text{th} percentiles of a data set

66
New cards

Range

The difference between the maximum and minimum values of a data set

67
New cards

Random

When individual outcomes are uncertain, but there is a pattern to the distribution of the outcomes over time

68
New cards

Random Variable

A variable whose value is a numeric outcome of a random phenomenon

69
New cards

Randomization

Using the laws of probability to select members for a sample or assign treatments to samples in experiments

70
New cards

Regression Line

A straight line that describes how a response variable changes as the explanatory variable changes

71
New cards

Residual

The difference between and observed value of a response variable and its predicted value from a regression equation

72
New cards

Response/Dependent Variable

Measures the outcome of a statistical study

73
New cards

Robustness

A measure of how much the P-value of a test is affected if the conditions of the hypothesis test are not met

74
New cards

Sample

A part of the population used to gather information about the entire population

75
New cards

Sample Space

A list of all possible outcomes for a random event

76
New cards

Sampling Distribution

A distribution of values taken by a statistic in all possible samples of the same size from the same population

77
New cards

Sampling Frame

A list from which a sample is chosen – ideally consists of the entire population

78
New cards

Significance Level

The point at which it will be determined that a result is statistically significant

79
New cards

Simple Random Sample (SRS)

A sample in which every member and every group of size nn has the same probability to be chosen

80
New cards

Simulation

A method for collecting data which uses the laws of probability to represent all possible outcomes of an experiment

81
New cards

Skewed

Describes a distribution whose histogram extends much farther to one side of the mean than the other in the direction of the 'tail'

82
New cards

Standard Deviation

Square root of the variance – used as a common measure of spread for a data set

83
New cards

Standard Error

The standard deviation of a sampling distribution – measures the amount of expected error per standard deviation from the mean

84
New cards

Standard Normal Distribution

A normal distribution with a mean of zero and a standard deviation of one

85
New cards

Statistic

A number that describes a sample

86
New cards

Statistically Significant

An observed effect so far removed from the mean that it would be unlikely to occur by chance alone

87
New cards

Stratified Random Sample

A sample chosen by splitting the population into several well-defined groups, then taking an SRS from each group

88
New cards

Symmetric

Describes a distribution whose histogram has its left and right sides as mirror images of each other

89
New cards

t-Distributions

A family of symmetric, bell-shaped distributions with a standard deviation larger than that of the standard normal distribution – defined by degrees of freedom

90
New cards

Treatment

A specific experimental condition applied to an experimental unit or subject

91
New cards

Treatment Group

A group of subjects who receive an actual treatment during an experiment

92
New cards

Type I Error

When the null hypothesis is rejected but it is in fact true

93
New cards

Type II Error

When the null hypothesis is not rejected but it is in fact false

94
New cards

Unbiased Statistic

A statistic from a sampling distribution whose mean must be equal to the mean of the population

95
New cards

Undercoverage

A type of bias that occurs when some groups of a population are left out of the selection process for the sample

96
New cards

Variability

Describes the spread of a data set

97
New cards

Variable

Any characteristic of an individual

98
New cards

Variance

The average of the squares of the deviations of the observation from their mean – used as a measure of spread

99
New cards

Voluntary Response Sample

Consists only of people who choose to participate – a poor method for collecting meaningful data

100
New cards

z-Score

A measure used to tell how many standard deviations above or below the mean an observation lies – also known as a Standardized Score