AP Statistics Chapter 3

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/22

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

For what type of variables should we use a scatterplot?

What should we include when creating a scatterplot?

2 quantitative variables

Explanatory = x-axis

Response = y - axis

2
New cards

key points to cover when describing a scatterplot

FoCUSeD

Form: _________

Context: ______

Unusual: points that deviate from the pattern

Strength: ________

Direction: _______

linear, curved, scattered

explanatory + response variables

strong, moderate, weak

positive or negative

3
New cards

What is a succinct way of describing scatterplots?

There is a strength, direction, form relationship between explanatory and response, with unusual points.

4
New cards

Correlation measures the ___ and ___ of the ____ relationship btwn 2 quantitative variables

r is _____ if x and y are interchanged.

r is _____ if x and/or y are rescaled.

r units:

strength, direction, linear

umchanged

unchanged

none

5
New cards

What is that range of correlation, what values translate to a strong or weak correlation?

-1 to 1, |coefficient| —> .9 to 1 = A, .8-.9 = B, etc. (weaker as u get closer to 0)

6
New cards

How to calculate correlation coefficient (r) when given data points?

  1. make table with __,__ ,__ ,__ , __ for each value

  2. find mean and standard deviations of x values and y values

  3. find _____ of each x and y value based on that, then the product

  4. ___ the products, divide by ____

x, y, zx, zy, product of zx * zy

z scores, product

add, n-1

7
New cards

Residuals: The error (vertical distance) between a linear model’s prediction and the observed data point.

Residual = ______

actual value - predicted value

y-ŷ

8
New cards

least squares regression line form

<p></p>
9
New cards

Interpret the slope in context.

As x increases by 1 unit, y is predicted to increase by b units

x and y shud be in context

10
New cards

Interpret the y-intercept in context. Use this sentence stem:

When x is zero units, y is predicted to be a units.

x and y shud be in context

11
New cards

the unique line that minimizes the sum of the squares of the vertical distances (residuals) between each data point and the line itself, predicts a value of y for a given value of x

least-squares regression line (LSRL)

12
New cards

Creating a LSRL

What formula do we use to find the slope?

What point does the LSRL always go through?

(x̄, ȳ)

Then fill in the equation ŷ=a+bx 

<p>(x̄, ȳ)</p><p>Then fill in the equation&nbsp;<span>ŷ=a+bx</span>&nbsp;</p>
13
New cards
<p>understanding regression outputs</p><p>slope = </p><p>y-int =</p><p>standard deviation of residuals = </p><p>r =</p>

understanding regression outputs

slope =

y-int =

standard deviation of residuals =

r =

income

constant

s

√r²

<p>income</p><p>constant</p><p>s</p><p>√r²</p>
14
New cards

s = standard deviation of residuals

average distance from ŷ to actual y is s (units for y)

<p>average distance from ŷ to actual y is s (units for y)</p>
15
New cards

analyzing residual plot

Residual plot: The residual plot (is randomly scattered, has a pattern eg. spreading out, curved) indicating that a linear model (is, is not) appropriate.

<p>Residual plot: The residual plot (is randomly scattered, has a pattern eg. spreading out, curved) indicating that a linear model (is, is not) appropriate.</p>
16
New cards

What is your fill-in-the-blank sentence to interpret Coefficient of Determination: r²?

The % variability in [y in context] that’s accounted for by the linear model

17
New cards

Outliers in the y-direction (vertically far from the trend) usually:

  • Increase the ____ (since they are far from the line)

  • Have little effect on the ____ of the regression line if their x-value is typical

  • They mostly affect the _____ strength (make r ____, i.e., weaken the linear relationship)

So: large vertical outliers make the line fit worse but don’t strongly pull it.

residual error

slope

correlation, smaller

18
New cards

What effect do outliers in the x-direction tend to have? What are they called?

Outliers in the x-direction are called _____ points because:

  • They can strongly influence the ____ and position of the _____, sometimes pulling it toward themselves

  • Removing it would noticeably change the regression line.

influential, LSRL

19
New cards

Not among explanatory or response variables, and yet may influence the interpretation of relationships among those variables

lurking variable

20
New cards

Points with large residuals are called ____ (note: the converse is not true: outliers don’t necesarily have large residuals). Points which change the slope of the line and the correlation coefficient greatly when removed are called _____

outliers,

influential points

21
New cards

The mean of the residuals

should be 0, if it’s not 0 means that LSRL is not perfect fit

22
New cards

high leverage point

far out in the x-direction

23
New cards

coefficient of determination