AP Stat Last Minute Cram

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/124

Earn XP

Description and Tags

Statistics

AP Statistics

cram

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

125 Terms

New cards

What to describe when asked “Describe the Distribution” or “Compare the distribution”

SOCV+ Context—(Shape, Outliers, Center, Variability)

Compare Center and Variability when asked to compare different distributions or data sets.

New cards

Describe Shape in Context

The distribution of (context) is (shape) with a peak at (highest point) and gaps between (gap)

Ex. The distribution of the exam scores is roughly symmetrical with a peak at 75 and gaps between 50 and 60.

New cards

Describe Outliers in Context

There seems to be outliers at (values)

Ex. There seem to be outliers at 95 and 20 in the exam scores distribution.

New cards

Describe Center in Context

The (mean/median) of the distribution is (mean/median + units).
if symmetric—use mean
if skewed—use median

Ex.The mean of the distribution is 75 points. (If skewed, use median instead.)

If asked to compare: Compare which is greater to that which is lesser

New cards

Describe Variability in Context

The distribution of (context) has a (SD/IQR/Range + units).

Ex.The distribution of the exam scores has a standard deviation of 10 points.

If asked to compare: Compare which distribution varies more

New cards

Interpret SD

“The (context) typically varies by about (SD + unit) from the mean of (mean+unit)”

New cards

Parameter

A number(or statement) that describes a population

New cards

Statistic

A number(or statement) that describes a sample

New cards

5 Number Summary

Minimum

Q1(25th Percentile)

Median

Q3(75th Percentile)

Maximum

New cards

2 Ways to Describe Location

Percentiles or Standardized Scores(z-scores)

New cards

Interpret a Z-Score

“(context) is (z-score) standard deviations (above(+)/below(-)) the mean of (μ+units)

New cards

Addition/Subtraction of Data

Shape- No change

Center/Location-±a

Variability- No Change

New cards

Multiplication/Division of Data

Shape- No change

Center/Location-x/÷ b

Variability-x/÷ b

New cards

Density Curves

models the distribution variable with a curve that:

is always above the horizontal axis
has exactly an area of 1 under it

Mean of a Density curve- Point at which the curve would balance if made of a solid material

Median of a Density curve- is the point that divides the area under the curve in half

New cards

Approximately Normal/Normal Curve

Roughly symmetric, single-peaked, bell-shaped density curve

Normal Dist. specified by 2 parameters: mean & SD

New cards

Empirical Rule

68-95-99.7 Rule

New cards

How to describe a scatterplot

Direction:(Positive/Negative/None)

Unusual Feature:(Outlier)

Form:(Linear/Nonlinear)

S:(Weak/Moderate/Strong)

*Describe Direction+Strength using correlation( r ), if given

New cards

Interpret a Scatterplot

“There is a (strength), (correlation), (form) relationship between (explanatory variable) and (response variable). There does/doesn’t seem to be unusual features in this relationship.(If yes, describe)”

Ex. There is a strong, positive, linear relationship between studying time and test scores. There doesn’t seem to be unusual features in this relationship.

New cards

Interpret Correlation( r )

“The correlation of r= ( r ) confirms that the linear association between (explanatory variable) and (response variable) is (positive/negative) and (weak/moderate/strong).”

New cards

Interpret Residuals—(Actual-Predicted)(y-ŷ)

“The actual (y-context) was (residual value) (above/below) the predicted value for x=(# in context)”

New cards

Interpret Slope(b)

“For every increase in (x-context) the predicted (y-context) (increases/decreases) by (slope unit of y).”

New cards

Interpret y-int(a)

“When (x-context) is 0, the predicted (y-context) is (y-int).”

New cards

Interpret Standard Deviation(In terms of LSRL)

“The actual (y-context) is typically about (s+unit) away from the number predicted by the LSRL with x=(context)

New cards

Context of Coefficient of Determination(r²)

“About (r²)% of the variability in (y-context) is accounted for by the LSRL at (x-context)

New cards

LSRL(Least Squares Regression Line) Equation

ŷ=a+bx

ŷ=predicted y

a=y-int

b=slope

x=explanatory variable

To find LSRL Eq. on Calc- Stat>Calc>8:Lin Reg(a+bx)

New cards

Residual Plot

Identifies if a Linear Model is appropriate

Appropriate if no leftover curved pattern

New cards

To find a & b

b=r * S_y/S_x a=ȳ-bx̄

New cards

Extrapolation

Explanatory Variables that are outside of the range of data which the LSRL was calculated

New cards

Influential Points

Can greatly affect correlation and regression calculaltions

New cards

Outliers

Out of pattern(large residuals)

New cards

High Leverage

Very large x-values

New cards

To tell if the Power Model is the best fit

Option 1: Raise the values of the explanatory variable by an integer, p

Option 2: Take the pth root of the response variable

New cards

To tell if the Exponential or Logarithmic Models are the best fit

Take the logarithm(log or ln) of one or both variables

New cards

Computer Generated Values

New cards

How to choose an SRS

Label, Randomize, Select

Must Be Without Replacement

New cards

Stratified Random Samplling

Taking a random sample from each strata(group)

More Precise Estimate

New cards

Cluster Sampling

Randomly select entire clusters- all individuals in the selected clusters are part of the sample

Saves time and money

New cards

Systematic Random Sample

Choose a k value, randomly select a starting value from 1 to k, choose every kth individual from the starting individual

New cards

Convenience Sampling

Choose individuals that are easiest to reach-BIAS

New cards

Voluntary Sampling

Individuals choose to be a part of the study b/c of open invitation-BIAS

New cards

Undercoverage

When some members of the population are less likely to be chosen or cannot be chosen in a sample

New cards

Nonresponse

When an individual chose for the sample can’t be contacted

New cards

Response Bias

When there is a systematic pattern of inaccurate answers to a survey question

New cards

Completely Randomized Design-Experiments

New cards

Randomized Block Design

New cards

Observational Study

Observes individuals and measures variables of interest but observes without influencing the response

New cards

Experimental Study

Deliberately imposes treatments on individuals to measure their responses

New cards

Matched Pairs Design

Uses blocks of size 2; twins especially

New cards

Statistically Significant

The observed results of a study are too unusual to be explained by chance alone

New cards

Probability for Statistical Significance

%<= 5% means something is statistically significant

New cards

Process of Identifying the Percentage(p-value)

Identify the difference in mean
Make a simulation and dotplot
Identify how many dots are greater of equal to the difference in mean from step 1
Calculate the percentage of how many dots are greater than or equal to the mean difference
Compare to the 5% rule and state if the study is statistically significant or not in the context of the problem

New cards

Scope of Inference

Random Selection of individuals allow inference about the population from which the individuals were chosen

Random Assignment of individuals to groups allows inference about cause and effect

New cards

P(A)

number of outcomes in event A/total number of outcomes in sample space

New cards

Complement Rule

P(A^c)=1-P(A)

New cards

Addition Rule for Mutually Exclusive Events

P(A U B)= P(A) + P(B)

New cards

General Addition Rule

P(A U B)= P(A) + P(B) - P(A ∩ B)

New cards

Conditional Probabilities(“given that”)

P(A|B) = P(A ∩ B) / P(B) = P(both events occur)/P(given event occurs)

New cards

Independent Events

P(A) = P(A|B^c)=P(A|B)

New cards

General Multiplication Rule

P(A ∩ B) = P(A) * P(B|A)

New cards

Multiplication Rule for Independent Events

P(A ∩ B) = P(A) * P(B)

New cards

“At least one” Probability Rule

P(at least one)=1-P(none)

New cards

Law of Large Numbers

If we observe more and more trials of any random process, the proportion approaches the true probability

New cards

Mutually Exclusive

No event can happen at the same time

New cards

Simulation

Imitates a random process in such a way that simulated outcomes are consistent with real-world outcomes

Simulation process:

1) Describe how you will simulate one trial(one repetition)

2) Perform many trials(repetitions)

3) Use the result to answer the question

New cards

Conditional Probability

Probability that one event happens given that another event is known to have happened

New cards

Independent Events

If knowing whether or not one event has occurred does not change the probability that the other event will happen

New cards

Mean/Expected Value

μX=E(X)=ΣxiP(xi)

New cards

Discrete Random Variable

Uses summation to calculate probabilities and means

New cards

Continuous Random Variable

Probabilities are areas under a density curve

New cards

Height of Density Curve

1/X₂-X₁

New cards

Probability of C

New cards

Combining Random Variables

New cards

Independent Random Variables

When x cannot hep or predict the value of y

Knowing the value of one variable does not change the probability of the other variable

New cards

Variance

σ²

New cards

Binomial Random Variables

When you have a fixed number of independent trials with the same probability of success

New cards

Conditions for Binomial Setting

BINS

Binary-”success” or “fail”

Independent-Knowing the outcome of one trial does not or tell us anything about the outcome of other trials. Or 10% Cond.

Number- fixed n number of trials

Same Probability- same probability p for every trial

New cards

10% Condition

If a binomial setting is not independent, we can use the 10% condition to treat each individual as independent

sample<=.1(population)

n<=10%N

New cards

Large Counts Condition

Helps us identify that the probability distribution of X is approximately Normal

n(p)>=10

n(1-p)>=10

New cards

Binomial Probability

P(X=x)=(nCx) (p)^x(1-p)^n-x

New cards

Mean(Expected Value)-Binomial

E(x)=np

New cards

SD-Binomial

\sqrt{np\left(1-p\right)} =σ_x

New cards

Geometric Random Variable

When you're counting the number of trials until the first success

New cards

Probability-Geometric

P(X=x)=(1−p)x−1p

New cards

Mean-Geometric

μ=p/1

New cards

SD-Geometric

\frac{\sqrt{1-p}}{p} = σ

New cards

Shape of Geometric Distribution

Always right-skewed when small sample size

-shape is right-skewed, p<.5

-shape is left-skewed, p>.5

-shape is approximately normal, p=.5

New cards

Interpretation of Probability

“There is a (probability/percentage) chance/probability of (context)”

New cards

Interpretation of Mean

“If many, many (unit) were randomly selected, the average (context) is about (μ + unit)”

New cards

Interpretation of SD

“If many, many, (unit) were randomly selected, the (context) typically varies by about (σ + unit) from the mean of (μ + unit)”

New cards

Describing Random Variable(Discrete, Continuous, Binomial, or Geometric)

Describe the Shape, Center, & Variability

New cards

Population Distribution

Values of ALL individuals in a sample

New cards

Sampling Distribution

Values of ALL POSSIBLE samples of the same size from the same population

New cards

Unbiased Estimator

If the center(μp̂ or μx̄) in equal to the true value of the parameter(p or μ)

New cards

Central Limit Theorem

If the population distribution is not Normal, but the sample size is large enough(n>=30), the sampling distribution is approx. Normal by CLT

New cards

Point Estimate

A chosen statistic(p-hat, x-bar, S_x) that will provide a reasonable estimate about the parameter

A+B/2

New cards

Margin of Error Strength

Confidence Level +; ME +(wider intervals)

Sample Size +; ME -(narrower intervals)

New cards

How to make a Confidence Interval(One Sample)

Choose: One-Sample z interval for p

Conditions: Random, 10%, Large Counts

Calculate: Stat>Tests>1-PropZInt
x:n(p-hat)
n:sample size
c-level:c%

Conclude: Interpret

New cards

How to make a Confidence Interval(Two Sample)

Choose: Two-Sample z interval for p₁-p₂

Conditions: Random, 10%, Large Counts (For Both Samples)

Calculate: Stat>Tests>2-PropZInt
x:n(p-hat)
n:sample size
c-level:c%

Conclude: Interpret

New cards

Convincing Evidence in Confidence Intervals

(+,+)-1st proportion is greater

(-,-)-2nd proportion is greater

(+,-)- No convincing evidence of a difference b/c interval contains 0

100

New cards

Interpretation of a Confidence Interval

We are (c%) confident that the interval from A to B captures the p=true proportion of [parameter in context].