AP stats: unit 1

0.0(0)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/34

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

categorical data

Data that consists of names, labels, or other nonnumerical values (averages are meaningless)

examples: grade, fav season, birth month

2
New cards

What is statistics?

A way of reasoning, along with a collection of tools and methods to help us understand the world

It’s the science of collecting, organizing, analyzing and interpreting data.

3
New cards

quantitative data

numerical data (averages give some sort of information)

examples: # of people, age, time, height

4
New cards

frequency table

includes category and frequency (amount)

5
New cards

relative frequency

includes category, frequency (amount), and relative frequency (%)

6
New cards

Ways to represent categorical data

pie chart, bar chart, dot plot

7
New cards

Rules for bar chart

Label your axes: variable name on horizontal, frequency on vertical

Scale axes: start scaling vertical axis at 0 and go up in equal increments until you =/< maximum frequency

Draw bars: make them equal in width and leave gaps

8
New cards

Types of quantitative data

discrete (whole numbers with gaps) and continuous (no gaps, can be decimal)

9
New cards

dot plot advantages and disadvantages (quantitative data

advantages: shows every individual value, easy to see shape of distribution

disadvantages: bad for large amts

10
New cards

Ways to represent quantitative variables

dot plot, stem plot, histogram

11
New cards

stem plot advantages and disadvantages

shows every individual value in the data set, easy to see distribution shape

disadvantages: difficult to make for large data sets

12
New cards

histogram advantages and disadvantages

advantages: good for large data sets, easy to see distribution shape

disadvantages: doesn't show every individual value

13
New cards

cumulative frequency

the sum of the frequencies for that class and all previous classes

14
New cards

factors to comment on to describe a quantitative variable

shape, center, variability, unusual features

15
New cards

IQR range

Q3-Q1 where Q1 is in between the min and median, and Q3 is between max and median

16
New cards

Formula for outliers

Lower fence: Q1 - 1.5 x IQR

Upper fence: Q3 + 1.5 x IQR

If it goes beyond these boundaries, it’s an outlier

17
New cards

Range

highest value - lowest value

18
New cards

Variance

standard deviation squared

19
New cards

skewed left

mean is less than median

20
New cards

skewed right

mean is greater than median

21
New cards

standard deviation

the square root of the variance (always less than variance when variance => 1)

if you add a constant to add values, SD remains the same

22
New cards

ways to describe distribution

shape, center, spread, variability

23
New cards

shapes of graphs

Uniform (roughly the same across all points)
Symmetric (no skew)
Skewed right (more low values)
Skewed left (more high values)
unimodal (one spot where data raises forming a lump)
bimodal (two spots where data raises forming a lump)

24
New cards

center

median and mean

25
New cards

unusual features

outliers, gaps, clusters

26
New cards

Variability

IQR, range, standard deviation, variance

27
New cards

Box plot/five number summary

minimum, Q1, median, Q3, maximum

28
New cards

Histogram

a bar graph depicting a frequency distribution (larger, more inspecific data points)

29
New cards

z-score

how many SD away from the mean;

value given > mean = positive
value given < mean = negative

30
New cards

Percentile

amount below or equal to the point

if you are in 99th percentile, you're the top 1%

number of values before & including value / total data

31
New cards

68-95-99.7 rule

in a normal model, about 68% of values fall within 1 standard deviation of the mean, about 95% fall within 2 standard deviations of the mean, and about 99.7% fall within 3 standard deviations of the mean

32
New cards

Z-Score Chart

use if you can't find % with the rule

33
New cards

working backwards

find closest value on z-score chart and solve for x

34
New cards

Adding data

If adding amount: range, SD and variance stays the same but mean changes

If adding percentage: change all by multiplying by 1.x but variance is still just SD²

35
New cards

Finding median and IQR with cumulative frequency chart

Point where cumulative proportion is 0.5 = median

Q1 is between median and lowest, Q3 is between median and highest → find both then use Q3 - Q1 to find IQR