AP Statistics

0.0(0)

Studied by 3 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/106

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

107 Terms

New cards

Quantitative Variable

Numerical Values

New cards

Categorical Values

Names or group labels

New cards

2 graphs for categorical variables

Bar graphs
Mosaic plot

New cards

Quantitative Variable

Discrete: whole numbers
Continuous: infinite numbers

New cards

Describe Distribution SOCV

Shape
Outliers
Center
Variation

New cards

How to describe the standard deviation

“The context varies by SD from the mean of x”

New cards

Mean and SD and greatly (blank) while the median is (blank)

affected by outliers
not

New cards

For symmetric use (blank) and skewed and outliers use (blank)

mean, SD
median

New cards

IQR method

low outlier < Q1 - 1.5IQR
high outlier > Q3 + 1.5IQR

New cards

The (blank) percentile is the value that p% of the data (blank)

pth
less than or equal to it

New cards

Q1 Percentile

Median Percentile

Q3 Percentile

25th
50th
75th

New cards

Standardizing a distribution explanation/z score

“context is when the z score standard deviations above/below mean”

New cards

What happens to the shape if you add/subtract/multiply/divide by a value?

it stays the same.

New cards

When happens to the center when you add/subtract/multiply/divide

it changes according to the value

New cards

What happens to the variability when you multiply/divide

it changes according to the value

New cards

Standardizing a distribution means the mean and standard deviation is

0 and 1

New cards

Skewed right

symmetric

Skewed left

mean > median
mean = median
mean < median

New cards

Empirical Rule

68, 95, 99.7

New cards

How to find proportion

NormalCdf

New cards

When given proportion

InvNorm

New cards

Explanatory

Response

x variable
y variable

New cards

Describe a relationship DUFS

Direction
Unusual behavior
Form
Shape

+Context

New cards

How to describe correlation

“The linear relationship between x and y is (strength) and (direction)”

New cards

Coefficient of Determination r² context

“The percent of the variation in y explained by the linear relationship with x”

New cards

Z-Score formula

value-mean/SD

New cards

Residual formula

Actual-Predicted

New cards

Residual Context

“The actual context was residual above/below the predicted value for x = #”

New cards

Interpretations

“when x=0 context, the predicted y context is y-int.”

“for each additional x-context, the predicted y context increases/decreases by slope.”

New cards

What is good residual plot and what is bad one?

Good = no pattern
Bad = pattern

New cards

High Leverage

Influential

Very large x values
if removed, the slope changes, y intercept and r

New cards

What is wrong with convenience sampling and voluntary sampling?

Leads to bias

New cards

Simple Random procedure

Label Individuals
Randomize (number generator, names in hat)
Select

New cards

Stratified Random Sampling

Splits the population into groups with like-characteristics (strata)
Chooses randomly from each Strata

+low bias and low variability

New cards

Cluster Random Sampling

a sample from some of all the groups

New cards

Different types of Bias

Undercoverage
Nonresponse
Response Bias

New cards

A confounding variable affects the (blank)

response variable
also related to the explanatory variable

New cards

Experimental Units

What/who the treatment is used on

New cards

Treatments

What is done or not done to the experimental units

New cards

How to make a well-designed experiment

Comparison
Random Assignment
Replication
Control

New cards

How to make a block design

Separate subjects into blocks and then randomly assign treatments

New cards

Matched Pair Design

Subjects are paired and then randomly assigned to a treatment
each subject receives two treatments (order of treatment is randomized)

New cards

What is Statistically significant

When results of an experiences is unlikely (less than 5%) to happen purely by chance

if significant we evidence that the treatment caused the difference

New cards

A random sample allows us to (blank) our conclusions to the population from which we sampled

generalize

New cards

Random Assignment allows us to conclude (blank) in the response variable

a treatment causes change

New cards

Long run relative frequency

always between 1 and 0
short run unpredictable
long run is predictable

New cards

Law of Large Numbers

Simulated probabilities tend to get closer to the true probability as the number of trials increase

New cards

Simulation

a way to model random events, such that simulated outcomes closely match real world outcomes

New cards

Evidence for a claim

Assuming a claim is true, find the probability of getting the observed result or more extreme
<5% statistically significant evidence against the claim

New cards

P(E) List all possible outcomes

number of outcomes in E/total outcomes in sample space

New cards

Complement rule

P(A^c) = 1 - P(A)
probability of the event not happening

New cards

P(A and B) / P(A∩B)

both events will occur

New cards

P(A or B) / P(A∪B)

one or the other or both

New cards

Addition rule when P(A or B)

P(A) + P(B) - P(A and B)

New cards

P(A/B) given probability

P(A and B) / P(B)

New cards

Independent events

One event does not change the probability for another
P(A) = P(A/B) = P(A/B^c)

New cards

General Multiplication Rule

P(A and B) = P(A) x P(B/A)

New cards

General Multiplication Rule when variables are independent

P(B/A) = P(B) so P(A and B) = P(A) x P(B)

New cards

at least 1 probability

P(at least 1) = 1 - P(none)

New cards

Combining Random variables

mean + mean
mean - mean
√SD²+SD²

New cards

Binomial Random Variable requirements

Binary
Independent
Number of trials
Same Probability

New cards

P(x=k)

binompdf

New cards

P(x<k)

binomcdf

New cards

Mean and Standard deviation for binomials

M = np

SD = √np(1-p)

New cards

10% condition

For a random sample without replacement the size of the population has to be n<0.10

New cards

Geometric distribution requirements

Binary
Independent
Trials until success
Same probability of success

New cards

Mean and Standard Deviation and Shape of Geometric distribution

M = 1/p
SD √1-p / p (ONLY TOP PART)
skewed right

New cards

A statistic is used to (blank)

estimate a parameter

New cards

Sampling Distribution Definition

The distribution of values for a statistic for all possible samples of a given size from a given population

New cards

Biased Estimator

overestimates or underestimates the true population parameter

New cards

Unbiased Estimator

mean of the sampling distribution is equal to the population parameter

New cards

A good statistic has a (blank) and a (blank)

low bias
low variability

New cards

Steps to check the Sampling Distribution of p hat

Z score as well

Shape Normal: np>10 and n(1-p)>10

Center: M=p

Variability: √p(1-p)/n

P hat - p / √p(1-p)/n

New cards

How to check sampling distribution x hat

Normal if distribution is normal

M = M

SD = SD/√n

New cards

Central Limit Theory

The sampling distribution of x hat is approximately normal when the sample size is large enough (n>30)

New cards

Confidence Interval

point estimate +- margin of error

interval (A,B)

P.E A+B/2

M.E B-A/2

New cards

How to interpret the Confidence interval

“We are % confident that the interval from A to B captures the true context.”

All values from A to B are plausible

New cards

How to interpret the Confidence level

“If we take many, many samples and calculate a confidence interval for each, about % of them will capture the true context.”

New cards

When you increase C.I and M.E

wider interval

New cards

increase trials and lower M.E

narrower interval

New cards

Conditions for C.I for proportion

Random Sample
10% condition n < 10
Large Counts np>10 and n(1-p)>10

New cards

Specific Formula for C.I for proportion

New cards

The 4 Cs for Proportion inference

Choose procedure, parameter, confidence level
Check Conditions
Calculate
Conclude interpret

New cards

Choosing a Sample Size

also when p is unknown, use 0.5 and if n is a decimal you round up

New cards

How to evaluate a claim

(+,+) convincing evidence 1st proportion is greater

(-,-) convincing evidence that 1st proportion is less

(-,+) no convincing evidence of a difference

New cards

Null Hypothesis and Alternative

H_0:p = null value

H_a: p < null value, p > null value, p ≠ null value

New cards

How interpret p-value

Assuming the null hypothesis is true, there is a p value probability of getting p hat of (blank) or more extreme purely by chance
Because p-value is < 0.05 we reject H_o and we do have convincing evidence for H_acontext
Because p-value is > 0.05 we reject H_aand we do not have convincing evidence for H_acontext

New cards

Calculate test statistic for Parameter

test statistic = statistic - parameter / SD

New cards

p value for a 2 sided parameter

p value = area x 2

New cards

Type I error

Null is true but we reject it

New cards

Type II error

Alternate is true but fail to reject Null

New cards

P(type I error)

P(type II error)

Type I = 0.05

Type II = 1 - power

New cards

Power Equation

P(reject Null/Accept alternative)

New cards

Interpret Power

“If the alternate is true (specific value in context) there is a power probability of finding

New cards

Conditions for constructing C.I for mean

Random Sample
10% condition
Normal Sample n>30 also if the distribution just looks normla

New cards

Degrees of Freedom

n-1

New cards

If null Hypothesis is in the interval

Fail to reject Null Hypothesis

New cards

If a mean sample is paired that is just (blank)

a one sample test, not two sample

New cards

x² is the (blank)

goodness of fit

New cards

Null Hypothesis and alternate for chi square

The claimed distribution of categorical variable is true
The claimed distribution of categorical variable is not true

100

New cards

Test statistic and p-value for chi squared

(O-E)²/E

Expected = np

df = # of catergories - 1

x²cdf(x²,9999,df)