AP STAT REVIEW: exploring data

5.0(1)
studied byStudied by 4 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/28

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 7:10 PM on 5/2/23
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

29 Terms

1
New cards
Categorical Variable
takes on values that are category names or group labels
2
New cards
Quantitative variable
one that takes on numerical values for a measures or counted quantity
3
New cards
IQR
Q₃-Q₁
4
New cards
Interpet the slope of the least squares regression line
for every 1 unit increase in \[x context\], the predicted \[y context\] increases/decreases by \[slope\]
5
New cards
Regression outlier
a point that does not follow the general trend shown in the rest of the data and has a large residual
6
New cards
Describing a distribution
Shape, Center, Spread, Outliers
7
New cards
Shape of a distribution
Skewed? symmetric? Distinct peaks?
8
New cards
Outliers
potential outliers if estimating, if not, Q₁-1.5(IQR), Q₃+1.5(IQR)
9
New cards
Center
What is the mean? If skewed, what is the median?
10
New cards
Spread
Standard deviation (goes with mean), IQR (goes with median)
11
New cards
Interpret the coefficient of determination
The coefficient of determination gives the percent of the variation of \[y context\] that is explained by the variation in \[x context\].
12
New cards
Skewed left graph
mean
13
New cards
roughly symmetric graph
mean≈median
14
New cards
skewed right graph
mean>median
15
New cards
High Leverage Point
a point having a substantially larger or smaller x-value than the other observations have
16
New cards
Calc function for LSRL
\#8: LinReg(a+bx)
17
New cards
influential point
any point that, if removed, changes the relationship substantially (big changes to slope and/or y-intercept (outliers and high-leverage points are often influential)
18
New cards
calculate the percentile of a specific value
percentile = # of values less than or equal to that of interest/total # values in data set
19
New cards
Discrete variable
countable number of values, may be infinite or finite (countable)
20
New cards
Continuous variable
can take on infinitely many values, but those values can not be counted (must be measured)
21
New cards
Correlation (r)
gives strength and direction of the linear relationship between 2 quantitative variables, can be between -1 and 1
22
New cards
Interpreting standard deviation
gives the typical distance that values are away from the mean
23
New cards
Describing relationship between 2 quantitative variables
Direction (+ or -), Unusual values, form (linear or curved), strength (weak - strong)
24
New cards
Interpreting the y-intercept of the LSRL
The predicted value of \[y-context\] when \[x-context\] is 0 is \[y-intercept value\].
25
New cards
Marginal Distribution
the distribution of values of that variable among all individuals described by the 2 way table of counts
26
New cards
Conditional Distribution
describes the values on that variable among individuals who have a specific value of another variable. There is a separate conditional distribution for each value of the other variable.
27
New cards
Association
if knowing the value of one variable helps predict the value of the other
28
New cards
Z score calculation
x-mean / standard deviation
29
New cards
Interpreting Z score
\[ \] is \[Z SCORE\] standard deviations ABOVE/BELOW the mean of \[MEAN\]