AP Statistics

0.0(0)
studied byStudied by 2 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/106

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

107 Terms

1
New cards

Quantitative Variable

Numerical Values

2
New cards

Categorical Values

Names or group labels

3
New cards

2 graphs for categorical variables

  • Bar graphs

  • Mosaic plot

4
New cards

Quantitative Variable

  • Discrete: whole numbers

  • Continuous: infinite numbers

5
New cards

Describe Distribution SOCV

  • Shape

  • Outliers

  • Center

  • Variation

6
New cards

How to describe the standard deviation

“The context varies by SD from the mean of x”

7
New cards

Mean and SD and greatly (blank) while the median is (blank)

  • affected by outliers

  • not

8
New cards

For symmetric use (blank) and skewed and outliers use (blank)

  • mean, SD

  • median

9
New cards

IQR method

  • low outlier < Q1 - 1.5IQR

  • high outlier > Q3 + 1.5IQR

10
New cards

The (blank) percentile is the value that p% of the data (blank)

  • pth

  • less than or equal to it

11
New cards

Q1 Percentile

Median Percentile

Q3 Percentile

  • 25th

  • 50th

  • 75th

12
New cards

Standardizing a distribution explanation/z score

“context is when the z score standard deviations above/below mean”

13
New cards

What happens to the shape if you add/subtract/multiply/divide by a value?

it stays the same.

14
New cards

When happens to the center when you add/subtract/multiply/divide

it changes according to the value

15
New cards

What happens to the variability when you multiply/divide

it changes according to the value

16
New cards

Standardizing a distribution means the mean and standard deviation is

0 and 1

17
New cards

Skewed right

symmetric

Skewed left

  • mean > median

  • mean = median

  • mean < median

18
New cards

Empirical Rule

68, 95, 99.7

19
New cards

How to find proportion

NormalCdf

20
New cards

When given proportion

InvNorm

21
New cards

Explanatory

Response

  • x variable

  • y variable

22
New cards

Describe a relationship DUFS

  • Direction

  • Unusual behavior

  • Form

  • Shape

+Context

23
New cards

How to describe correlation

“The linear relationship between x and y is (strength) and (direction)”

24
New cards

Coefficient of Determination r2 context

“The percent of the variation in y explained by the linear relationship with x”

25
New cards

Z-Score formula

value-mean/SD

26
New cards

Residual formula

Actual-Predicted

27
New cards

Residual Context

“The actual context was residual above/below the predicted value for x = #”

28
New cards

Interpretations

“when x=0 context, the predicted y context is y-int.”

“for each additional x-context, the predicted y context increases/decreases by slope.”

29
New cards

What is good residual plot and what is bad one?

  • Good = no pattern

  • Bad = pattern

30
New cards

High Leverage

Influential

  • Very large x values

  • if removed, the slope changes, y intercept and r

31
New cards

What is wrong with convenience sampling and voluntary sampling?

Leads to bias

32
New cards

Simple Random procedure

  1. Label Individuals

  2. Randomize (number generator, names in hat)

  3. Select

33
New cards

Stratified Random Sampling

  • Splits the population into groups with like-characteristics (strata)

  • Chooses randomly from each Strata

+low bias and low variability

34
New cards

Cluster Random Sampling

a sample from some of all the groups

35
New cards

Different types of Bias

  • Undercoverage

  • Nonresponse

  • Response Bias

36
New cards

A confounding variable affects the (blank)

  • response variable

  • also related to the explanatory variable

37
New cards

Experimental Units

What/who the treatment is used on

38
New cards

Treatments

What is done or not done to the experimental units

39
New cards

How to make a well-designed experiment

  1. Comparison

  2. Random Assignment

  3. Replication

  4. Control

40
New cards

How to make a block design

Separate subjects into blocks and then randomly assign treatments

41
New cards

Matched Pair Design

  1. Subjects are paired and then randomly assigned to a treatment

  2. each subject receives two treatments (order of treatment is randomized)

42
New cards

What is Statistically significant

When results of an experiences is unlikely (less than 5%) to happen purely by chance

if significant we evidence that the treatment caused the difference

43
New cards

A random sample allows us to (blank) our conclusions to the population from which we sampled

generalize

44
New cards

Random Assignment allows us to conclude (blank) in the response variable

a treatment causes change

45
New cards

Long run relative frequency

  • always between 1 and 0

  • short run unpredictable

  • long run is predictable

46
New cards

Law of Large Numbers

Simulated probabilities tend to get closer to the true probability as the number of trials increase

47
New cards

Simulation

a way to model random events, such that simulated outcomes closely match real world outcomes

48
New cards

Evidence for a claim

  • Assuming a claim is true, find the probability of getting the observed result or more extreme

  • <5% statistically significant evidence against the claim

49
New cards

P(E) List all possible outcomes

number of outcomes in E/total outcomes in sample space

50
New cards

Complement rule

  • P(Ac) = 1 - P(A)

  • probability of the event not happening

51
New cards

P(A and B) / P(A∩B)

both events will occur

52
New cards

P(A or B) / P(A∪B)

one or the other or both

53
New cards

Addition rule when P(A or B)

P(A) + P(B) - P(A and B)

54
New cards

P(A/B) given probability

P(A and B) / P(B)

55
New cards

Independent events

  • One event does not change the probability for another

  • P(A) = P(A/B) = P(A/Bc)

56
New cards

General Multiplication Rule

P(A and B) = P(A) x P(B/A)

57
New cards

General Multiplication Rule when variables are independent

P(B/A) = P(B) so P(A and B) = P(A) x P(B)

58
New cards

at least 1 probability

P(at least 1) = 1 - P(none)

59
New cards

Combining Random variables

  • mean + mean

  • mean - mean

  • √SD2+SD2

60
New cards

Binomial Random Variable requirements

  • Binary

  • Independent

  • Number of trials

  • Same Probability

61
New cards

P(x=k)

binompdf

62
New cards

P(x<k)

binomcdf

63
New cards

Mean and Standard deviation for binomials

M = np

SD = √np(1-p)

64
New cards

10% condition

For a random sample without replacement the size of the population has to be n<0.10

65
New cards

Geometric distribution requirements

  • Binary

  • Independent

  • Trials until success

  • Same probability of success

66
New cards

Mean and Standard Deviation and Shape of Geometric distribution

  • M = 1/p

  • SD √1-p / p (ONLY TOP PART)

  • skewed right

67
New cards

A statistic is used to (blank)

estimate a parameter

68
New cards

Sampling Distribution Definition

The distribution of values for a statistic for all possible samples of a given size from a given population

69
New cards

Biased Estimator

overestimates or underestimates the true population parameter

70
New cards

Unbiased Estimator

mean of the sampling distribution is equal to the population parameter

71
New cards

A good statistic has a (blank) and a (blank)

  • low bias

  • low variability

72
New cards

Steps to check the Sampling Distribution of p hat

  • Z score as well

Shape Normal: np>10 and n(1-p)>10

Center: M=p

Variability: √p(1-p)/n

P hat - p / √p(1-p)/n

73
New cards

How to check sampling distribution x hat

Normal if distribution is normal

M = M

SD = SD/√n

74
New cards

Central Limit Theory

The sampling distribution of x hat is approximately normal when the sample size is large enough (n>30)

75
New cards

Confidence Interval

point estimate +- margin of error

interval (A,B)

P.E A+B/2

M.E B-A/2

76
New cards

How to interpret the Confidence interval

“We are % confident that the interval from A to B captures the true context.”

All values from A to B are plausible

77
New cards

How to interpret the Confidence level

“If we take many, many samples and calculate a confidence interval for each, about % of them will capture the true context.”

78
New cards

When you increase C.I and M.E

wider interval

79
New cards

increase trials and lower M.E

narrower interval

80
New cards

Conditions for C.I for proportion

  1. Random Sample

  2. 10% condition n < 10

  3. Large Counts np>10 and n(1-p)>10

81
New cards

Specific Formula for C.I for proportion

knowt flashcard image
82
New cards

The 4 Cs for Proportion inference

  1. Choose procedure, parameter, confidence level

  2. Check Conditions

  3. Calculate

  4. Conclude interpret

83
New cards

Choosing a Sample Size

also when p is unknown, use 0.5 and if n is a decimal you round up

<p>also when p is unknown, use 0.5 and if n is a decimal you round up</p>
84
New cards

How to evaluate a claim

(+,+) convincing evidence 1st proportion is greater

(-,-) convincing evidence that 1st proportion is less

(-,+) no convincing evidence of a difference

85
New cards

Null Hypothesis and Alternative

H0: p = null value

Ha: p < null value, p > null value, p ≠ null value

86
New cards

How interpret p-value

  • Assuming the null hypothesis is true, there is a p value probability of getting p hat of (blank) or more extreme purely by chance

  • Because p-value is < 0.05 we reject Ho and we do have convincing evidence for Ha context

  • Because p-value is > 0.05 we reject Ha and we do not have convincing evidence for Ha context

87
New cards

Calculate test statistic for Parameter

test statistic = statistic - parameter / SD

88
New cards

p value for a 2 sided parameter

p value = area x 2

89
New cards

Type I error

Null is true but we reject it

90
New cards

Type II error

Alternate is true but fail to reject Null

91
New cards

P(type I error)

P(type II error)

Type I = 0.05

Type II = 1 - power

92
New cards

Power Equation

P(reject Null/Accept alternative)

93
New cards

Interpret Power

“If the alternate is true (specific value in context) there is a power probability of finding

94
New cards

Conditions for constructing C.I for mean

  1. Random Sample

  2. 10% condition

  3. Normal Sample n>30 also if the distribution just looks normla

95
New cards

Degrees of Freedom

n-1

96
New cards

If null Hypothesis is in the interval

Fail to reject Null Hypothesis

97
New cards

If a mean sample is paired that is just (blank)

a one sample test, not two sample

98
New cards

x2 is the (blank)

goodness of fit

99
New cards

Null Hypothesis and alternate for chi square

  • The claimed distribution of categorical variable is true

  • The claimed distribution of categorical variable is not true

100
New cards

Test statistic and p-value for chi squared

(O-E)2/E

Expected = np

df = # of catergories - 1

x2cdf(x2,9999,df)