AP Stats must-know term

0.0(0)

Studied by 20 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/105

Earn XP

Description and Tags

Statistics

AP Statistics

AP Stats

10th

Last updated 9:16 PM on 1/14/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

106 Terms

New cards

Alternative Hypothesis

states that a treatment has had an effect or caused a change in the population

New cards

Bias

describes a study which systematically favors certain outcomes

New cards

Binomial Distribution

the distribution of the probabilities of X successes out of n trials, calculated using p as the probability of any single success - B(n, p)

New cards

Blind

describes an experiment in which the subjects do not know which treatment they are getting

New cards

Blocking

a statistical design which creates groups that are similar in some way, and then randomizes the treatments within each block

New cards

Central Limit Theorem

states that when an SRS is drawn from a population with mean µ and standard deviation σ, the sampling distribution for the sample mean will be approximately normally distributed, and have a mean µ and a standard deviation σ/√n

New cards

Chi-Square Distributions

a family of skewed-right distributions which take on only positive values and are defined by their degrees of freedom - the specific shape of the Chi-Square Distribution changes as the sample size changes

New cards

Chi-Square Goodness-of-Fit Test

used to determine if a population has a certain hypothesized distribution

New cards

Chi-Square Test for Homogeneity

used to determine if every category in the population has the same population

New cards

Chi-Square Test for Independence/Chi-Square Test for Association

used to determine if there is a relationship between two categorical variables

New cards

Coefficient of Determination

tells what percent of the change in the response variable can be attributed to the change in the explanatory variable - symbolized as r 2

New cards

Complement of an Event

the set of all outcomes not defined as successful outcomes for any event

New cards

Conditional Probability

the probability of an event occurring if it is known that another specific event has already occurred

New cards

Confidence Interval

an interval estimate of a parameter calculated using a sample from that population

New cards

Confidence Level

the probability that the desired parameter will fall into a confidence interval if many intervals were calculated from samples of the same size

New cards

Confounding Variable

a variable which could affect the result of a statistical test but has not been controlled for

New cards

Continuous Random Variable

a random variable which takes on all values in an interval of numbers

New cards

Control Group

any group of subjects who receive either a placebo or no treatment at all during an experiment

New cards

Correlation

measures the direction and strength of the linear relationship between two quantitative variables - symbolized as r

New cards

Critical Value

a value (z-score, t-score, or χ2 value) used in a hypothesis test to help determine if the null hypothesis should be rejected

New cards

Cumulative Distribution Function

A function which calculates the sum of the probabilities for each possible value for any random variable X

New cards

Degrees of Freedom

a value used to help determine significance for a t-test or a Chi-Square test - measured as n-1 in most cases, or (r-1)(c-1) when dealing with two-way tables

New cards

Dependent Trials

trials whose probability is affected by the outcome of previous trials

New cards

Density Curve

a curve used to represent a distribution - a density curve is always on or above the horizontal axis and has a total area of exactly 1 underneath it

New cards

Discrete Random Variable

a random variable with countable outcomes

New cards

Mutually Exclusive Events (disjoint)

events which cannot occur at the same time

New cards

Distribution

a list of what values a variable takes on and how often it takes on each one of those values

New cards

Double Blind

describes an experiment in which neither the subjects nor the researcher know which treatment each subject is getting

New cards

Empirical Rule

(the 68-95-99.7 rule)

New cards

is used as an approximation for what percent of the data falls within 1, 2, or 3 standard deviations of the mean in any normal distribution

New cards

Experimental Units

the individuals on which an experiment is conducted - if the test is being conducted on humans, the units are called Subjects

New cards

Explanatory Variable/Independent Variable

attempts to explain the observed outcomes in a statistical study

New cards

Exploratory Data Analysis

uses graphs and numerical summaries to describe the variables in a data set and the relationships among them

New cards

Factor

any explanatory variable in an experiment

New cards

Five Number Summary

a method to describe a data set using the minimum, first quartile, median, third quartile, and maximum points in the data set

New cards

Geometric Distribution

a distribution of probabilities of when the first successful outcome occurs in a probability experiment

New cards

Hypothesis Test/Significance Test

a type of inference used to determine the feasibility of an assumed population parameter

New cards

Independent Trials

trials whose probabilities are not affected by the outcome of previous trials

New cards

Individuals

people or objects described by a set of data

New cards

Inference

the statistical process of drawing conclusions about a population by examining data from a sample

New cards

Influential Point

a point which, if removed from the data set, would markedly change the regression equation for that data set

New cards

Interquartile Range (IQR)

the difference between the third and first quartiles of a data set

New cards

Law of Large Numbers

states that as increased numbers of observations are drawn from any population, the mean of the observations eventually approaches the mean of the population as closely as we would like to estimate it, and remains that close or closer

New cards

Least Squares Regression Line

a regression line which makes the sum of the squares of the vertical distances from the data points to the line as small as possible

New cards

Level

a numerical value of a factor of an experiment

New cards

Matched Pairs

a statistical design which compares two treatments - this is usually done with one sample receiving each treatment over a different time period

New cards

Mean (expected value)

the "average" of a data set

New cards

Median

the point at which 50% of the data is above and 50% of the data is below

New cards

Mutually Exclusive Events

see Disjoint Events

New cards

Nonresponse

a type of bias that occurs when an individual chosen for a sample cannot be contacted or chooses not to participate

New cards

Normal Distribution

a symmetric, bell-shaped distribution in which approximately 68% of the data lies within one standard deviation of the mean, 95% lies within two standard deviations of the mean, and 99.7% lies within three standard deviations of the mean - all normal distributions can be defined by their mean and standard deviation

New cards

Null Hypothesis

states that either a treatment has had no effect on a population, or that the population has not changed

New cards

Observation

any single point from a data set

New cards

Outlier

an individual observation that falls outside the pattern of the data set - often defined as any number that is 1.5(IQR) outside of Q1 or Q3

New cards

P-value

the probability that the observed outcome would take on a value as extreme or more extreme than observed if the null hypothesis were true

New cards

Parameter

a number that describes a population

New cards

Percentile

tells what percent of a data set falls below the given observation

New cards

Placebo

a false treatment which should have no effect on an experiment - placebos should appear to be the same as the actual treatment

New cards

Pooled Procedures

occurs when separate samples are combined into a single sample for analysis - this should only be done if it is known that the variances of the two populations are equal

New cards

Population

the entire group of individuals that we want information about

New cards

Power of a Hypothesis Test

the probability that the test will reject the null hypothesis when the null hypothesis is false - the power is equal to 1 minus (probability of a Type II error for the given alternative)

New cards

Probability

the proportion of times an outcome would occur over a large number of trials

New cards

Probability Distribution Function

a function which assigns a probability for each possible value for any discrete random variable X

New cards

Proportion

tells what percent of a data set falls into a given category

New cards

Qualitative Variable

a variable which takes on a non-numeric description

New cards

Quantitative Variable

a variable which takes on a numeric value

New cards

Quartiles

observations which fall at the 25th, 50th, and 75th percentiles of a data set

New cards

Percentiles

Fall at the 25th, 50th, and 75th percentiles of a data set.

New cards

Range

The difference between the maximum and minimum values of a data set.

New cards

Random

When individual outcomes are uncertain, but there is a pattern to the distribution of the outcomes over time.

New cards

Random Variable

A variable whose value is a numeric outcome of a random phenomenon.

New cards

Randomization

Using the laws of probability - this is done to select members for a sample and also to assign treatments to specific samples in experiments.

New cards

Regression Line

A straight line that describes how a response variable changes as the explanatory variable changes.

New cards

Residual

The difference between an observed value of a response variable and its predicted value from a regression equation.

New cards

Response Variable/Dependent Variable

Measures the outcome of a statistical study

New cards

Robustness

A measure of how much the P-value of a test is affected if the conditions of the hypothesis test are not met.

New cards

Sample

A part of the population used to gather information about the entire population.

New cards

Sample Space

A list of all possible outcomes for a random event.

New cards

Sampling Distribution

A distribution of values taken by a statistic in all possible samples of the same size from the same population.

New cards

Sampling Frame

A list from which a sample is chosen - ideally the sampling frame consists of the entire population.

New cards

Significance Level

The point at which it will be determined that a result is statistically significant.

New cards

Simple Random Sample (SRS)

A sample in which every member of the population has the same probability to be chosen, and every group of size n has the same probability to be chosen.

New cards

Simulation

A method for collecting data which uses the laws of probability to represent all possible outcomes of an experiment.

New cards

Skewed

Describes a distribution whose histogram extends much farther to one side of the mean than the other - the distribution is said to be skewed in the direction of this 'tail'.

New cards

Standard Deviation

Square root of the variance - used as a common measure of spread for a data set.

New cards

Standard Error

The standard deviation of a sampling distribution - measures the amount of expected error per standard deviation from the mean of the distribution.

New cards

Standard Normal Distribution

A normal distribution with a mean of zero and a standard deviation of one.

New cards

Statistic

A number that describes a sample.

New cards

Statistically Significant

An observed effect so far removed from the mean that it would be unlikely to occur by chance alone.

New cards

Stratified Random Sample

A sample chosen by splitting the population into several well-defined groups, then taking an SRS from each group.

New cards

Symmetric

Describes a distribution whose histogram has its left and right sides as mirror images of each other.

New cards

t-Distributions

A family of symmetric, bell-shaped distributions with a standard deviation larger than that of the standard normal distribution - the specific shape of the t-distribution changes as the sample size changes - this distribution is defined by its degrees of freedom.

New cards

Treatment

A specific experimental condition applied to an experimental unit or subject.

New cards

Treatment Group

A group of subjects who receive an actual treatment during an experiment.

New cards

Type I Error

When the null hypothesis is rejected but it is in fact true - the probability of a Type I Error is the significance value for that test.

New cards

Type II Error

When the null hypothesis is not rejected but it is in fact false - the probability of a Type II Error must be calculated for a specific alternative test value.

New cards

Unbiased Statistic

A statistic from a sampling distribution whose mean must be equal to the mean of the population.

New cards

Undercoverage

A type of bias that occurs when some groups of a population are left out of the selection process for the sample.

New cards

Variability

Describes the spread of a data set.

100

New cards

Variable