AP Statistics

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/134

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

135 Terms

1
New cards

categorical variable

variable that cannot be quantified (e.g. M&M color, Skittle flavor)

2
New cards

quantitative variable

variable that can be expressed as a number

3
New cards

relative frequency

the number of times an event occurs divided by the total number of events occurring (e.g. the team won 70% (7/10) of their games)

4
New cards

marginal relative frequency

proportion of the whole (e.g. % of all students who chose music class)

5
New cards

joint relative frequency

combined proportions of the whole (e.g. % of all students who chose art as their favorite elective and math as their favorite core class)

6
New cards

conditional relative frequency

proportion of a part; this given that (e.g. % of the students who chose technology as their favorite elective given they chose math as their favorite core class)

7
New cards

segmented bar graph

bars are stacked to make up 100%

8
New cards

mosaic plot

segmented bar graph where width of the bars is proportional to the size of the group

9
New cards

association

when knowing the value of one variable helps us predict the other variable

10
New cards

discrete variable

variable with a countable number of values

11
New cards

continuous variable

variable with an infinite number of possible values

12
New cards

SOCS

Shape, Outliers, Center, Spread

13
New cards

unimodal

when a graph has only one peak

14
New cards

bimodal

when a graph has two peaks

15
New cards

non-resistant

measures of a distribution that are affected by outliers (mean, standard deviation)

16
New cards

IQR (interquartile range)

represents the middle 50% of a dataset, spanning from the 25th to 75th percentile

17
New cards

interpretation of standard deviation

“The context typically varies by standard deviation from the mean of .

18
New cards

1.5IQR method for outliers

low outlier < Q1 - 1.5IQR

high outlier > Q3 + 1.5IQR

19
New cards

2SD method for outliers

low outlier < mean - 2SD

high outlier > mean + 2SD

20
New cards

five-number summary

minimum, Q1, median, Q3, maximum

21
New cards

examples of comparative language

greater than, less than, similar to

22
New cards

adverbs for describing a distribution

strongly, roughly, approximately, moderately

23
New cards

percentile

The Pth percentile is the value that has P% of the data less than or equal to it

24
New cards

z-score

number of standard deviations a value is from the mean; shows position relative to other values in the distribution

25
New cards

interpretation for z-score

Context is z-score standard deviations above/below the mean of .”

26
New cards

linear transformation of data

add/subtract: only mean changes

multiply/divide: both mean and SD change

27
New cards

standardizing a distribution

mean = 0, SD = 1, shape is the same

28
New cards

empirical rule

68% of the data is contained within 1 SD of the mean, 95% is contained within 2, and 99.7% is contained within 3

29
New cards

DUFS

Direction, Unusual features, Form, Strength

30
New cards

unusual features

outliers or clusters

31
New cards

form

linear or nonlinear

32
New cards

strength

how close the data is to the form

33
New cards

r for correlation

direction: +/-

form: linear

strength: between -1 and 1

34
New cards

interpretation for r

“The linear relationship between x and y is strength and direction.”

35
New cards

interpretation for r2 (coefficient of determination)

r2% of the variation in y i accounted for by the linear relationship with x.”

36
New cards

extrapolation

the process of estimating unknown values by extending or projecting from known values

37
New cards

residual

actual-predicted

38
New cards

interpretation for residual

“The actual context was residual above/below the predicted value for x = #.”

39
New cards

interpretation for y-intercept

“When x = 0 in context, the predicted y context is y-intercept.”

40
New cards

interpretation for slope

“For each additional x context, the predicted y context increases/decreases by slope.”

41
New cards

least squares regression line (LSRL)

minimizes the sum of the residuals

42
New cards

residual plot

If there is no clear pattern in the residuals, it is appropriate to use a linear model

43
New cards

interpretation for standard deviation of the residuals (s)

“The actual y context is typically about s away from the number predicted by the LSRL.”

44
New cards

interpretation for r2 of LSRL

“About r2% of the variability in y context is accounted for by the LSRL.”

45
New cards

LSRL outliers

points with high residuals

46
New cards

LSRL formula

ŷ = a + bx

47
New cards

LSRL slope formula

b = r(Sy/Sx)

48
New cards

LSRL constant formula

a = ȳ - bx̄

49
New cards

convenience sample

participants are chosen based on access and availability; non-random

50
New cards

stratified random sample

splits population into homogeneous groups based on shared characteristics (Taylor Swift concert rows)

51
New cards

cluster sample

split population into heterogeneous groups, select a number of groups, and sample everyone in each group

52
New cards

systematic random sample

choose random starting point, select participants at equal intervals (e.g. every 5th person)

53
New cards

standards for good sampling methods

low bias, low variability

54
New cards

undercoverage

some people are less likely to be chosen (e.g. people without cell phones)

55
New cards

non-response

people can’t be reached or refuse to answer

56
New cards

response bias

problems in data gathering instrument/process (question wording, people don’t tell the truth, etc.)

57
New cards

experiment

procedure where treatment is imposed on experimental units

58
New cards

observational study

no treatment is imposed on participants

59
New cards

experimental units

who/what the treatment is imposed on

60
New cards
61
New cards

criteria for a well-design experiment

comparison of 2+ treatments, random assignment, replication (more than one subject in each treatment group), control

62
New cards

benefit of random assignment

shows causation

63
New cards

benefit of random sampling

results can be generalized to the population

64
New cards

matched pairs design

subjects are paired based on similarity and then randomly assigned to a treatment (e.g. similar SAT scores) OR each subject receives both treatments (e.g. two sides of a leaf)

65
New cards

randomized block design

subjects are separated into blocks based on similarities and randomly assigned treatments within each block (e.g. testing student performance in different grades)

<p>subjects are separated into blocks based on similarities and randomly assigned treatments within each block (e.g. testing student performance in different grades)</p>
66
New cards

statistical significance

when results of an experiment are unlikely (<5%) to have occurred by chance

67
New cards

law of large numbers

simulated probabilities tend to get closer to the true probability as the number of trials increases

68
New cards

long run relative frequency

more predictable than short run relative frequency

69
New cards

complement rule

probability of an event NOT occurring; P(Ac) = 1 - P(A)

70
New cards

addition rule

P(AB) = P(A) + P(B) - P(AB)

*if events A and B are mutually exclusive, they cannot occur together and P(A∩B) = 0

71
New cards

mutually exclusive

when events cannot occur at the same time (e.g. heads and tails)

72
New cards

independent

the outcome of one event does not affect the outcome of the other

73
New cards

superscript c

denotes a complement; the probability or trials where something does not happen

74
New cards

when combining random variables…

add/subtract means, add variances

75
New cards

binomial distribution

a distribution with a fixed number of trials and two possible outcomes

76
New cards

formula for binomial probability

px = (nCx)pxqn-x

n = number of trials

x = number of successes

p = probability of success

q = probability of failure

77
New cards

BINS

Binary (success or failure), Independent trials, Number of trials is fixed, Same probability of success (p)

78
New cards

calculator function for P(x = r)

binompdf(n, p, x)

79
New cards

calculator function for P(x ≤ r)

binomcdf(n, p, x)

80
New cards

calculator function for P(x ≥ r)

1 - binomcdf (n, p, n-x)

81
New cards

mean for binomial distribution

µ = np

82
New cards

standard deviation for binomial distibution

σ = √np(1-p)

83
New cards

interpretation for mean of binomial distribution

“After many groups of n trials, the average number of successes is µ.”

84
New cards

interpretation for the standard deviation of binomial distribution

“The number of successes typically varies by σ from the mean of µ.”

85
New cards

large counts condition

np ≥ 10 and n(1-p) ≥ 10; allows for the use of a normal distribution and its calculations

86
New cards

10% condition

n ≤ .10N; allows for sampling without replacement

87
New cards

BITS

Binary (success or failure), Independent trials, Trials until success, Same probability of success

88
New cards

formula for geometric probability

p(x = k) = (1-p)k-1p

p = probability of success

k = number of trials

89
New cards

shape of geometric distribution

skewed right

90
New cards

shape of chi-squared distribution

skewed right

91
New cards

shape of t distribution

normal with greater area in the tails

92
New cards

mean of geometric distribution

µ = 1/p

93
New cards

standard deviation of geometric distribution

σ = √(1-p)/p

94
New cards

sampling distribution

distribution of values of a statistic for all possible samples of a given size from a given population

95
New cards

as n increases…

variability decreases

96
New cards

biased estimator

consistently over/underestimates the true population parameter

97
New cards

unbiased estimator

results in a sample mean that is equal to the population mean

98
New cards

mean of sampling distribution for p̂

µ = p

99
New cards

standard deviation of sampling distribution for p̂

σ = √p(1-p)/n

100
New cards

mean of sampling distribution for x̄

µ = µ