ap stats unit 1 notes

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/56

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

57 Terms

1
New cards

quantitative data

data that is numerical. the values have an inherent order. examples: income, weight, height # of classes

*can you average the data? YES

2
New cards

categorical data

datat where values are categories or group labels, which often don’t have an inherent order. examples: eye color, relationship status, left or right

*cannot be averaged

3
New cards

misleading graphs may not have axis labels or ___

scale

4
New cards

misleading graphs may __ the x or y axis, or start at a weird place

cut off

5
New cards

misleading graphs may use __ for a bar graph also called what

pictures, also called a pictograph

6
New cards

mean formula

sum of all values / number of all values

7
New cards

median

middle

if it’s odd, use middle value of data set, if even use the value in between

8
New cards

approximating a median in a histogram

  1. label the frequency

  2. add up the frequency

  3. divide by two

  4. find the bins that contain the median, the nth data point

9
New cards

formula for range

range = max - min

10
New cards

why do you think we find the difference between each data point and the mean, standard deviation (Sx) is a measure of spread

it’s a way to measure distance between data values and the mean

11
New cards

Formula of interquartile range (IQR)

Q3 - Q1

12
New cards

resistance

not seriously affected by

13
New cards

Is the median resistant by skew and outliers

yes, the ___ is resistance by skews and outliers (median/mean)

14
New cards

Is the mean resistance to skews and outliers

No, the ___ is NOT resistance by skews and outliers (median/mean)

15
New cards

is the IQR resistant to skews and outlers?

Yes, the ___ is resistant to skews and outliers. (IQR/range and standard deviation)

16
New cards

Is range and standard deviation resistant to outliers

No, the ___ is NOT resistant to skews and outliers (IQR/range and standard deviation)

17
New cards

Why is the mean not resistant to skews and outliers

The mean can drag up or drag down values in calculation

18
New cards

Why is the median and IQR resistant to skews and outliers

Position matters more than the value, the outlier is not given a large weight in calculation

19
New cards
<p>Right skew</p><p></p>

Right skew

mean > median

20
New cards
<p>symmetric</p><p></p>

symmetric

mean = median

21
New cards
<p>left skew</p><p></p>

left skew

mean < median

22
New cards

Advantages to using a dotplot to visualize data

see every data point

23
New cards

Disadvantages to using a dotplot to visualize data

  • can’t always see exact value

  • not great for large data sets

24
New cards

Advantages to using a stemplot to visualize data

See exact values

25
New cards

Disadvantages to using a stemplot to visualize data

not great for large data sets

26
New cards

Advantages to using a histogram to visualize data

Great for large data sets

27
New cards

Disadvantages to using a histogram to visualize data

Can’t see individual values

28
New cards

when asked to describe the distribution use …

CSOCS

29
New cards

1st C in CSOCS

Context: what variable is being measured. Example: The distribution of payroll for 2002 baseball teams…

30
New cards

1st S in CSOCS

Shape: right/left skew, symmetric modes (unimodal or bimodal)

31
New cards

O in CSOCS

Outliers: unusual points. Examples: No obvious outliers

32
New cards

2nd C in CSOCS

Center: mean, median, general center. Example: The center is approximately $50 million

33
New cards

2nd S in CSOCS

Spread: range, IQR, standard deviation. Example: Has a range of between $20-140 million

34
New cards

outliers

unusually high or low data values

35
New cards

Formulas for outlier boundaries

Upper Limit and Lower Limit

36
New cards

Upper limit

Q3 + 1.5 x IQR

37
New cards

Lower Limit

Q1 - 1.5 x IQR

38
New cards

CSOCS for boxplot

  • Context: subject of data

  • Shape: Skew (not modes) shape may not be determined

  • Outlier: dots/astriecks

  • Center: median

  • Spread: IQR

39
New cards

Comparing distirbutions

Use CSOCS and use comparative language for each feature, use AND, not but or howeve.

40
New cards

Percentile

percent of data less than or equal to a certain data value

41
New cards

standardization

a point’s location in the distribution depends on both distance from the center and the distribution’s spread (or variability)

42
New cards

formula for z score

z = (x - mean) / Standard deviation (sx)

43
New cards

If data value > mean

positive z-score

44
New cards

if data value < mean

negative z score

45
New cards

positive z score

the number of standard deviation, ABOVE THE MEAN

46
New cards

negative z score

the number of standard deviations, BELOW the mean

47
New cards

The normal curve

  • symmetric

  • mean = median, both located at center

48
New cards

The empirical rule

normal curves 1,2,3 standard deviations away from the mean

49
New cards

Empirical rule: 1 SD from mean

68%

50
New cards

Empirical rule: 2 SD from mean

95%

51
New cards

Empirical rule: 3 SD from mean

99.7%

52
New cards

strategy for normal curves

  • draw + label curve

  • perform calculations

  • answer the question with context

53
New cards

Empirical rule: 3 SD from the mean is statistically significant how

Out of the norm

54
New cards

Percentile will always be shaded to the…

to the left

55
New cards

Empirical rule FORMULA: 1 SD from mean

Mean ± 1(Standard deviation)

56
New cards

Empirical rule FORMULA: 2 SD from mean

Mean ± 2(Standard deviation)

57
New cards

Empirical rule FORMULA: 3 SD from mean

Mean ± 3(Standard deviation)