AP Statistics Study Guide Flashcards

0.0(0)

Studied by 0 people

View linked note

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/97

Earn XP

Description and Tags

Flashcards covering key vocabulary and concepts from an AP Statistics course, based on provided study guide notes. These flashcards are optimized for vocabulary review.

Last updated 11:28 PM on 5/6/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

98 Terms

New cards

Statistics

The science and art of collecting, analyzing, and drawing conclusions from data.

New cards

Individual

Object described in a set of data.

New cards

Variable

Aspect that can take different values for different individuals.

New cards

Distribution

Pattern of variation of a variable.

New cards

Descriptive Statistics

Analyzing data.

New cards

Inferential Statistics

Making inferences / drawing conclusions from data.

New cards

Nominal Variable

No certain order.

New cards

Ordinal Variable

No order *could be numbers if they don’t measure anything (eg. cell phone digits)

New cards

Discrete Variable

Fixed set of possible values with gaps between them, whole numbers or defined intervals, countable or countably infinite.

New cards

Continuous Variable

Infinite possibilities, decimals / fractions, any value in an interval on the number line.

New cards

Basic Statistic Vocab

Also known as cases / observational units.

New cards

Frequency Table

Shows what values the variable takes & how often it takes them types of statistics.

New cards

Two-way table

Summarizes data on relationship between two categorical variables for a group of individuals.

New cards

Side-by-side bar graph

Bars showing the distribution of a categorical variable for each value of another categorical variable (grouped side-by-side).

New cards

Segmented bar graph

Distribution of a categorical variable as segments of a whole (bars stacked on top of each other & proportional to relative frequencies).

New cards

Mosaic plot

The width of the bars proportional to number of individuals in that category.

New cards

Association

Knowing the value of one variable allows you to predict value of the other.

New cards

Back-to-back stemplot

Quantitative data that’s split into two groups.

New cards

Mean

Average.

New cards

Median

Middle value.

New cards

Mode

Most common value.

New cards

IQR

Interquartile range (middle 50% of values).

New cards

Standard deviation

Typical distance from mean.

New cards

Resistant Measure

Not sensitive to skewness / outliers.

New cards

Statistic

A value that describes a characteristic of a sample.

New cards

Parameter

A value that describes a characteristic of a population.

New cards

Percentile

pth percentile is value with p% observations less than or equal to it.

New cards

Cumulative relative frequency graphs / ogives

Plots points corresponding to the percentile of a value in the distribution & points connected with line segments to create the graph.

New cards

Standardized scores (z-scores)

How many standard deviations from the mean a value is (& what direction).

New cards

Density curve

Simplified model of a distribution of a quantitative variable, always on or above horizontal axis, has an area of exactly 1 underneath it.

New cards

Normal distributions

Bell shaped & symmetric & unimodal distribution approximated with a normal curve (density curve).

New cards

Extrapolation

Using a regression line to make predictions way outside of the interval of x-values used to generate the line (beyond the scope of your data).

New cards

Least-squares regression line

Line that minimizes sum of squared residuals.

New cards

Residuals

Actual value – predicted value (based on line).

New cards

Residual plots

Scatterplot that plots residuals against explanatory variable, determines whether a linear model is appropriate (check for random scatter & no leftover curved pattern).

New cards

Standard deviation of residuals (s)

Measures typical residual (distance between predicted & actual).

New cards

Coefficient of determination (r2)

Square of correlation r when finding r from r2, make sure to consider direction of correlation!

New cards

Influential points

Points that, if removed, substantially change the slope, y-int, r, r2 , or s *these are very often influential (but not automatically guaranteed to be).

New cards

Transforming to achieve linearity

Applying a function to a quantitative variable (changes the scale of measurement) in order to make the scatterplot more approximately linear (in order to use linear regression methods).

New cards

Sampling

Selecting a random group of people out of a whole population (that’s representative of the population).

New cards

Sampling frame

The group of members from the population from which we select our sample.

New cards

Sampling survey

Collects data from the individuals in the sample (to learn about the population).

New cards

SRS (Simple Random Sample)

Every group of n individuals has an equal chance of being selected.

New cards

Stratified Sample

SRS selected from each strata. Strata: group w similar characteristics assumed to be associated with the variables being measured.

New cards

Clustered Sample

Randomly selecting entire clusters, Clusters: diff responses between (hopefully representative of population).

New cards

Systematic Sample

Randomly select starting point & select every kth individual after.

New cards

Convenience Sampling

Individuals who are easy to reach.

New cards

Voluntary Response Sampling

Allows individuals to choose to be in sample.

New cards

Bias

Likely to systematically overestimate or underestimate the value.

New cards

Undercoverage

Certain individuals less likely / cannot be chosen in a sample.

New cards

Nonresponse

Individual chosen for sample can’t be contacted / doesn’t participate.

New cards

Response Bias

Systematic pattern of inaccurate answers to a survey question.

New cards

Observational Studies

Observes individuals & measures variables of interest (does not influence responses).

New cards

Experiments

Imposes a treatment on individuals & measures their responses.

New cards

Placebo

No active ingredient.

New cards

Treatment

Condition imposed on individuals.

New cards

Experimental unit

Individual to which treatment applied, subject: human experimental unit.

New cards

Factor

Explanatory var that’s manipulated (may cause change in response var).

New cards

Levels

Diff possible values of a factor.

New cards

Confounding

When variables are associated so that their effects on a response variable can’t be distinguished from one another.

New cards

Control group

Provides a baseline for comparison.

New cards

Replication

Use enough subjects (diff in effects can be distinguished from chance variation).

New cards

Double blind

Neither subjects nor the ppl measuring know the treatment.

New cards

Single-blind

Only one of the groups (above) knows.

New cards

Completely randomized design

Experimental units assigned to treatments completely at random.

New cards

Randomized block design

Random assignment within each block. Block: group of experimental units known to be similar in some way that could affect their response to the treatments.

New cards

Matched pairs design

A type of RBD where blocks are pairs.

New cards

Statistical significance

Observed diff is larger than can be attributed to chance alone.

New cards

Statistical inference

Generalizing results to population, assuming sample is representative of population (ensured by random sample).

New cards

Sampling variability

Diff random samples (same size, same population) produce diff estimates.

New cards

Random process

Generates outcomes purely by chance.

New cards

Probability

Likelihood of an event to happen.

New cards

Law of large numbers

More trials means proportion approaches true probability (more accurate).

New cards

Simulation

Imitates random process such that simulated outcomes are consistent with real-world outcomes.

New cards

Probability model

Description of a random process that includes a list of all possible outcomes & the probability for each outcome.

New cards

Sample space

List of all outcomes.

New cards

Event

Any collection of outcomes from a random process.

New cards

Complement

The probability that an event does not occur.

New cards

Intersection

P(A and B) = A ∩ B (both A and B must be true).

New cards

Union

P(A or B) = A ⋃ B (at least one–either A or B, or both–must be true).

New cards

Mutually exclusive events

Cannot occur simultaneously (no outcomes in common) (also known as disjoint).

New cards

Non-mutually exclusive events

Can occur simultaneously.

New cards

Conditional probability

Probability that an event happens given that another event is known to have happened: P(A | B).

New cards

Independent events

Knowing whether or not one event has occurred does not change the probability that the other event will happen P(A | B) = P(A | BC) = P(A).

New cards

Random variable

Takes numerical values that describe the outcomes of a random process.

New cards

Discrete Random Variable

Fixed set of values with gaps between them can be described using probability distributions & histograms (each bar a value).

New cards

Continuous Random Variable

Any value in an interval on the number line probability distribution: density curve.

New cards

Binomial Random Variable

Use acronym BINS to check for binomial setting.

New cards

Geometric random variable

Number of trials it takes to get a success in a geometric setting.

New cards

Sampling distribution

The distribution of a statistic in all possible samples of the same size from the population.

New cards

Unbiased Estimator

Mean of sampling distribution of a statistic equal to true value of parameter same as accuracy check center (unbiased estimator).

New cards

Biased Estimator

Statistics consistently do not match parameters same as precision check variability choose an estimator with low bias & low variability.

New cards

Confidence interval

An interval of plausible values for an unknown population parameter based on sample data.

New cards

Confidence Level

Success rate / capture rate of the method that produces the interval accounts for sampling variability & increases confidence that our parameter value is correct.

New cards

Power of a test

Probability that a test will find convincing evidence for Ha when a specific alternative value of the parameter is true (probability that you avoid a type II error).

New cards

Chi square tests for goodness of fit

To check whether a hypothesized distribution seems valid.

New cards

Chi-square test for homogeneity

Compares distributions of a single cat var over multiple populations / treatments (multiple independent samples).

New cards

Chi-square test for independence & association

Compares distributions of two cat var (association) in a single population (one sample).