AP Statistics - Last Minute AP Review

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/79

flashcard set

Earn XP

Description and Tags

guess who prioritized another AP class and forgot to study for stats lol gl to you all too!

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

80 Terms

1
New cards

variable

characteristics about each case

2
New cards

case

individual we collect data from

3
New cards

statistic

value calculated from a sample

4
New cards

parameter

value calculated from a population

5
New cards

variance

a measure of spread, standard deviation squared

6
New cards

when data is unimodal symmetric

use mean and standard deviation

7
New cards

when data is skewed

use median and iqr

8
New cards

outlier rule

data values below q1 - 1.5(iqr) and above q3 - 1.5(iqr) are outliers

9
New cards

cumulative frequency graph

y-axis is always 0 - 100%, slope is never negative

10
New cards

z-score

[x - (x-bar)] / s

11
New cards

shifting data

(adding or subtracting) affects measures of center only

12
New cards

rescaling data

(multiplying or dividing) affects measures of center and spread

13
New cards

describing distributions

the distribution of [context] is [shape], centered at a mean/median of [center] and spread out with a standard deviation/iqr of [spread]. [outliers]

14
New cards

explanatory

x, independent

values being changed in an experiment (factors)

15
New cards

response

y, dependent

values being measured in an experiment

16
New cards

describing associations

there is a [direction], [strength], [form] association between [variable 1] and [variable 2]

17
New cards

correlation constant (r)

affected by outliers, unaffected by shifting and rescaling

measures strength of the linear association

18
New cards

slope interpretation

for each additional [explanatory], the predicted [response] increases/decreases by [slope]

19
New cards

Intercept interpretation

the model predicts that a(n) [explanatory] of 0 [x-units] will be [y-intercept]

this is/is not significant…

20
New cards

R^2 interpretation

[R^2] of the variability in [response] is accounted for by differences in the linear model using [explanatory]

21
New cards

residual

y - (y-hat)

“actual minus predicted”

22
New cards

leverage

away from the other points horizontally that has great influence on the linear model and r

23
New cards

influential point

changes the slope significantly

24
New cards

bowl of soup

the size of the sample doesn't matter compared to the quality of the sample (bigger does not equal better)

25
New cards

census

sampling the whole population

26
New cards

sampling frame

the list you draw your sample from

27
New cards

simple random sample (srs)

every possible group of n from a population has an equal chance of being sampled

28
New cards

stratified sample

divide the population into groups of individuals, called strata, that are similar in some way and do an srs in each to form a full sample

29
New cards

cluster sample

divides the population into clusters and randomly select to sample all individuals within them

30
New cards

systematic sample

sample every nth person, randomly determining where to start

31
New cards

multistage sample

a combination of two or more sampling methods

32
New cards

voluntary response bias

people choose whether or not to respond, typically those with stronger opinions

33
New cards

response bias

leads people to respond a certain way

34
New cards

undercoverage

some groups of the population are left out of the sample

35
New cards

nonresponse bias

an individual chosen for the sample cannot be contacted or does not cooperate

36
New cards

convenience sample

type of bias where you choose individuals who are easiest to reach

37
New cards

retrospective observational study

past data

38
New cards

prospective observational study

looking at data as it happens

39
New cards

treatment

made up of a combination of factors, assigned randomly

40
New cards

statically significant

when the observed difference is so great that it couldn't have been due to chance/randomization

41
New cards

blocking

reduces variability so that differences we see can be attributed to the treatments imposed

42
New cards

control group

provides a baseline (“basis for comparison”)

43
New cards

placebo effect

subjects can respond to something that doesn't exist

44
New cards

blinding

single: subjects don't know

double: subjects and researchers don’t know

45
New cards

confounding variable

another variable other than the factors that affects the response variable

46
New cards

lurking variables

affect both the explanatory and response variables

47
New cards

matched pairs design

one subject gets both treatments or use naturally paired subjects

48
New cards

replication

there needs to be more than one subject in each treatment group, if not replicate the experiment again

49
New cards

designing an experiment: introduction

the factors/response variable/treatments are…

[treatment 1] is the control group, which provides a baseline for comparison so we can see if there is an actual difference in [response variable] for subjects who [explanatory]

50
New cards

designing an experiment: randomization

we will randomly assign the n subjects to the treatments

assign them a number, #1-n and use a random number generator

the first [# of subjects] unique numbers are group 1

the next [# of subjects] unique numbers are group 2/3/4..

the remaining [# of subjects] are the last group

repeat this process for the [other] block

51
New cards

designing an experiment: blocking

we block this experiment by [blocking variable] because we want to reduce variability so that we can attribute the differences we see to the treatments being imposed

52
New cards

designing an experiment: blinding

we will (double) blind this experiment

the subjects won’t know what treatment they get [context] {and/or} the researcher won’t know who got what treatment

we blind the subjects to nullify the placebo effect, because if the subjects knew what treatment they were getting, the result could be from knowing instead of [explanatory] {and/or} we blind the researchers to keep them honest

53
New cards

sample space

set of all possible outcomes (S = {1, 2, 3…})

54
New cards

mutually exclusive/disjoint

events don’t share any outcomes in common

55
New cards

addition rule

for mutually exclusive: P(AuB) = P(A) + P(B)in general: P(AuB) = P(A) + P(B) - P(AnB)

56
New cards

union (u)

either event A, B, or both occurring

57
New cards

intersection (n)

event A and B happening

58
New cards

multiplication rule

if there are multiple things happening, multiply their probabilities

59
New cards

Independent events

events A and B are independent if P(B) = [P(B|A)]

60
New cards

conditional probability

P(A|B) = P(AnB) / P(B)

61
New cards

at least one probability

P(not 0) = 1 - P(0)

62
New cards

law of averages

is a scam, do not believe in it or its lies

probabilities do not change the more times one event occurs, nor do they do so to balance things out :c

63
New cards

law of large numbers

as we repeat a random process over and over again, the true probability will emerge

64
New cards

random variable (X)

a number based on a random event

65
New cards

expected value of a random variable

E(X) = Σ(x * P(x))

sum of value of that outcome times the probability it will occur for every outcome

66
New cards

standard deviation of a random variable

SD(X) = sqrt {Σ[(x - (x-bar))^2 * P(X)]}

the square root of the sum of [the value of the outcome minus E(X)] squared times the probability of that event occurring, for every outcome

67
New cards

adding two random variables

E(X + Y) = E(X) + E(Y)

SD(X + Y) = sqrt [SD(X)^2 + SD(Y)^2]

68
New cards

subtracting two random variables

E(X - Y) = E(X) - E(Y)

SD(X - Y) = sqrt [SD(X)^2 + SD(Y)^2]

69
New cards

summing a series

E(X1, X2…Xn) = n * E(X)

SD (X1, X2…Xn) = sqrt [n * SD(X)^2]

70
New cards

Bernoulli trials

only two possible outcomes, probability of success (p) is the same for every trial, trials are independent of each other

71
New cards

geometric probability model

about getting to the first success

P(X=x) = (1-p)^(x-1) * p

72
New cards

probability that the first success is the nth trial

P(X=n) = (1-p)^(n-1) * p

geometric pdf (p, n)

73
New cards

probability that the first success within n trials

P(X=1) + P(X=2)…+ P(x=n)

geometric cdf (p, n)

74
New cards

binomial probability model

fixed number of trials

P(X=x) = (nx) * p^x * (1 - p)^(nx)

n = number of trials

75
New cards

probability of x successes within n amount of trials

P(X=x) = (nx) * p^x * (1 - p)^(nx)

binom cdf (n, p, x)

76
New cards

probability of fewer than x successes within n amount of trials

binom cdf (n, p, [x-1])

x is not included

77
New cards

probability of more than x successes within n amount of trials

1 - binom cdf (n, p, x)

x is not included

78
New cards

probability of at least x successes within n amount of trials

1 - binom cdf (n, p, [x-1])

x is included

79
New cards

probability of at most x successes within n amount of trials

binom cdf (n, p, x)

x is included

80
New cards

probability of between x1 and x2 successes in n amount of trials

binom cdf (n, p, x2) - binom cdf (n, p, (x1-1)