AP stats: unit 1

0.0(0)

Studied by 1 person

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/34

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

35 Terms

New cards

categorical data

Data that consists of names, labels, or other nonnumerical values (averages are meaningless)

examples: grade, fav season, birth month

New cards

What is statistics?

A way of reasoning, along with a collection of tools and methods to help us understand the world

It’s the science of collecting, organizing, analyzing and interpreting data.

New cards

quantitative data

numerical data (averages give some sort of information)

examples: # of people, age, time, height

New cards

frequency table

includes category and frequency (amount)

New cards

relative frequency

includes category, frequency (amount), and relative frequency (%)

New cards

Ways to represent categorical data

pie chart, bar chart, dot plot

New cards

Rules for bar chart

Label your axes: variable name on horizontal, frequency on vertical

Scale axes: start scaling vertical axis at 0 and go up in equal increments until you =/< maximum frequency

Draw bars: make them equal in width and leave gaps

New cards

Types of quantitative data

discrete (whole numbers with gaps) and continuous (no gaps, can be decimal)

New cards

dot plot advantages and disadvantages (quantitative data

advantages: shows every individual value, easy to see shape of distribution

disadvantages: bad for large amts

New cards

Ways to represent quantitative variables

dot plot, stem plot, histogram

New cards

stem plot advantages and disadvantages

shows every individual value in the data set, easy to see distribution shape

disadvantages: difficult to make for large data sets

New cards

histogram advantages and disadvantages

advantages: good for large data sets, easy to see distribution shape

disadvantages: doesn't show every individual value

New cards

cumulative frequency

the sum of the frequencies for that class and all previous classes

New cards

factors to comment on to describe a quantitative variable

shape, center, variability, unusual features

New cards

IQR range

Q3-Q1 where Q1 is in between the min and median, and Q3 is between max and median

New cards

Formula for outliers

Lower fence: Q1 - 1.5 x IQR

Upper fence: Q3 + 1.5 x IQR

If it goes beyond these boundaries, it’s an outlier

New cards

Range

highest value - lowest value

New cards

Variance

standard deviation squared

New cards

skewed left

mean is less than median

New cards

skewed right

mean is greater than median

New cards

standard deviation

the square root of the variance (always less than variance when variance => 1)

if you add a constant to add values, SD remains the same

New cards

ways to describe distribution

shape, center, spread, variability

New cards

shapes of graphs

Uniform (roughly the same across all points)
Symmetric (no skew)
Skewed right (more low values)
Skewed left (more high values)
unimodal (one spot where data raises forming a lump)
bimodal (two spots where data raises forming a lump)

New cards

center

median and mean

New cards

unusual features

outliers, gaps, clusters

New cards

Variability

IQR, range, standard deviation, variance

New cards

Box plot/five number summary

minimum, Q1, median, Q3, maximum

New cards

Histogram

a bar graph depicting a frequency distribution (larger, more inspecific data points)

New cards

z-score

how many SD away from the mean;

value given > mean = positive
value given < mean = negative

New cards

Percentile

amount below or equal to the point

if you are in 99th percentile, you're the top 1%

number of values before & including value / total data

New cards

68-95-99.7 rule

in a normal model, about 68% of values fall within 1 standard deviation of the mean, about 95% fall within 2 standard deviations of the mean, and about 99.7% fall within 3 standard deviations of the mean

New cards

Z-Score Chart

use if you can't find % with the rule

New cards

working backwards

find closest value on z-score chart and solve for x

New cards

Adding data

If adding amount: range, SD and variance stays the same but mean changes

If adding percentage: change all by multiplying by 1.x but variance is still just SD²

New cards

Finding median and IQR with cumulative frequency chart

Point where cumulative proportion is 0.5 = median

Q1 is between median and lowest, Q3 is between median and highest → find both then use Q3 - Q1 to find IQR