AP stats

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/90

flashcard set

Earn XP

Description and Tags

holy cram

Last updated 7:47 PM on 5/6/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

91 Terms

1
New cards

Quantitative variable

takes numerical values for a measured of counted quantity

2
New cards

categorical variable

takes on values that are category names or group labels

  • bar groups show frequency (how many) or relative frequency (percent)

3
New cards

misleading graphs

  • vertical axis must start at 0

  • beware of using images for bar graphs

4
New cards

segmented bar graph

stack up bars to make 100%

5
New cards

mosaic plot

segmented bar graph where the width of bars is proportional to the group size

6
New cards

association

if knowing the value of one variable helps us predict the other variable

  • segmented bar graphs are different

7
New cards

2 different quantitative data

discrete and continuous

8
New cards

discrete quantitative data

countable number of values

9
New cards

continuous quantitative data

infinite values

10
New cards

SOCV

shape, outliers, center , variability with context]

11
New cards

interpret standard deviation

“The context typically varies by SD from the mean of

12
New cards

how to find variance?

(SD)2

13
New cards

Greated affected by outliers (nonresistant)

mean and standard deviation

14
New cards

resistant to change

median

15
New cards

1.5 x IQR method

low outlier < Q1 - 1.5(IQR)

high outlier > Q3 + 1.5(IQR)

16
New cards

SD Method for outliers

low outlier < mean - 2(SD)

high outlier > mean + 2(SD)

17
New cards

calculate IQR

MAX - MIN

18
New cards

Boxplots

5 number summary:

min, Q1 , med, Q3, max

19
New cards

Interpret percentile

“The pth percentile is the value that has the p% of the data less than or equal to it.”

20
New cards

Q1 percentile

25th percentile (0.25)

21
New cards

What percentile is the Median?

50th percentile (0.50).

22
New cards

Q3 percentile

75th percentile (0.75)

23
New cards

interpret Z-score

context” is z-score standard deviations above/below the mean of

24
New cards

what do z-scores show?

They show position relative to other values in the distribution

25
New cards

Empirical rule (68-95-99.7)

26
New cards

how to find z-score for a given proportion

use Table A, or TI84: InvNorm

27
New cards

Describe a relationship (DUFS + Context0

Direction (positive/negative, none)

Unusual features (outliers, clusters)

Form (linear or nonlinear)

Strength

28
New cards

Interpret Correlation ( r )

“The linear relationship between x and y is strength and direction .”

29
New cards

interpret coefficient of determination r2

“the percent of the variation in y explained by the linear relationship with x”

30
New cards

does correlation equal causation?

NO

31
New cards

how to calculate residual

residual = Actual - Predicted

(r=a-p)I

32
New cards

Interpret residual

“the actual context was residual value above/below the predicted value for x=#

33
New cards

interpret y-intercept

“when x=0 context, the predicted y-context is y-intercept.”

34
New cards

interpret slope

“For each additional x-context the predicted y-context increases/decreases by slope.”

35
New cards

Least Squares regression line

minimizes the sum of the squared residuals

36
New cards

horizontal outliers

tilt the least squares regression line

37
New cards

vertical outliers

shift the least squares regression line up or down

38
New cards

high leverage outliers

very large of small x-values

39
New cards

influential outliers

if removed, big changes happen to the slope, y-int and correlation ( r ).

40
New cards

choosing best regression model

  1. check the scatter plot for a linear pattern

  2. check residual plot for no leftover pattern

  3. check for the r2 that is closest to 1.

41
New cards

convenience sample

people are easy to reach, can lead to bias

42
New cards

voluntary response

people choose to respond, can lead to bias

43
New cards

what samples can lead to bias?

convenience sample and voluntary response

44
New cards

simple random sample (SRS)

  1. label individuals

  2. randomize

  3. select

45
New cards

label individuals in SRS

assign numbers or write names on slips of papers

46
New cards

randomize in SRS

random number generator (no repeats) or names in a hat (shuffle)

47
New cards

stratified random sample

splot the population into groups (strata) then choose an SRS from each strata.

48
New cards

Homogenous grouping

each strata has individuals with shared attributes or characteristics

49
New cards

what leads to the best estimates in a sampling method?

low bias and low variability

50
New cards

cluster sample

heterogenous groups, sample all from some groups

51
New cards

systematic random sampling

choose a random starting point and go from equal intervals

52
New cards

undercoverage

some people are less likely to be chose

ex. calling landlines, surveying homeowners

53
New cards

nonresponse

people cant be reached or refuse to answer

ex. don’t answer of Hang up phone calls

54
New cards

response bias

problems in the data gathering instrument or process

ex. people lie (self reported responses), wording of quesion

55
New cards

observational study

no treatment imposed

56
New cards

experiment

treatment imposed, allow us to show causation

57
New cards

well-designed experiment steps

  1. comparison (2 or more treatments)

  2. random assignment

  3. replication (more than 1 in each treatment group)

  4. control (keep other variables constant)

58
New cards

what does random assignment do?

allows us to show causation

59
New cards

placebo effect

when a fake treatment works

60
New cards

blinding

when subjects (single blind) and/or experimenters (double blind) don’t know about treatments

61
New cards

randomized block design

separate subjects into blocks and then randomly assign treatments within each block

62
New cards

block

group of experimental units that are similar

63
New cards

matched pairs design

subjects are paired (block size 2) and then randomly assigned to a treatment and each subject receives two treatments

64
New cards

statistically significant

when results of an experiment are unlikely (less than 5% (0.05)) to happen purely by chance.

If statistically significant, we have convincing evidence the treatment caused the difference.

65
New cards

what does a random sample allow us to do?

Allows us to generalize our conclusions to the population from which we sampled

66
New cards

long run relative frequency

  • always between 0 and 1 (inclusive)

    • predictable

67
New cards

short run relative frequency

predictable

68
New cards

law of large numbers

simulated probabilities tend to get closer to the true probability as the number of trials increase.

69
New cards

sample space, list of all possible outcomes

P(E) = # outcomes in E/ total # outcomes in sample space

70
New cards

complement rule, probability of an event not happening

P(Ac) = 1- P(A)

71
New cards

P(A and B)

P(A n B) where both occur

72
New cards

P(A or B)

P(A U B) one of the other or both

73
New cards

addition rule in probability

P(A or B) = P(A) + P(B) - P(A and B)M

74
New cards

Mutually exclusive

Events A and B can’t occur together

P(A and B) = 0, so P(A or B) = P(A) + P(B)co

75
New cards

conditional probability

the probability of one event given another has occurred

P(A | B)= P(A and B) / P(B)

76
New cards

independent events

knowing whether or not one event occurs does not changed the probability of the other event

If P(A) = P(A | B) = P(A | Bc)

then A and B are independent

77
New cards

general multiplication rule

If A and B are independent

P( A and B) = P(A) x P(B)

78
New cards

probability of getting at least 1

1 - P(none)

79
New cards

discrete random variable

takes a countable number of values with gaps between

80
New cards

continuous random variable

has infinite values with no gaps

ex. uniform, normal

81
New cards

sample size

as sample size increases, variability decreases

82
New cards

central limit theorem

the sampling distribution of x-bar is approximately normal when the sample size is => 30

83
New cards

confidence interval

point estimate +- margin of error

P.E = A+B/2

M.E = B - A / 2

84
New cards

interpret confidence interval

we are % confident that the interval from A to B captures the true context

  • all values between A and B are plausible

85
New cards

Margin of Error

increased confidence, increased M.E leading to a wider interval

increased sample size, decreased M.E leading to narrower interval

86
New cards

interpret confidence level

if we take many, many samples and calculate a confidence interval for each, about % if them will capture the true context.

87
New cards

conditions for constructing a confidence interval for proportion

  1. random condition (must have random sample)

  2. 10% condition (when sampling w/o replacement, check n <= 10% (N population))

  3. Large Counts condition ( n(p-hat) >= 10 and n( 1-(p-hat) ) => 10

88
New cards

interpret p value

assuming the null hypothesis is true ( p = ho context), there is a p-value probability of getting a p-hat of or more extreme purely by chance.

89
New cards

conclusion

Because p-value < significance level (or p-value > sig. lvl.), we reject the null hypothesis (or fail to reject) and we do (or do not) have convincing evidence for Ha context.

90
New cards

type I error

the null hypothesis (context) is true, but we find convincing evidence for Ha(context

91
New cards

type II error

the ha(context) is true but we don’t find convincing evidence for Ha(context)