AP Stats - Unit 3

studied byStudied by 0 people
0.0(0)
Get a hint
Hint

Correlation Coefficient

1 / 33

flashcard set

Earn XP

Description and Tags

This is mostly vocab and things to remember in wording. Practice problems are not here.

34 Terms

1

Correlation Coefficient

This is “r”

  • A quantitative assessment of the strength & direction of the linear relationship between bivariate, quantitative data

  • Pearson’s sample correlation is used most

  • parameter - ρ (rho)

  • statistic - r

New cards
2

Does the value of r depend on which of the two variables is labeled x?

The value of r does not depend on which of the two variables is labeled x

New cards
3

Weak Correlation range

-.5 to .5

New cards
4

No Correlation range

No range. 0 is no correlation

New cards
5

Strong Correlation Range

-1 to -.8 and .8 to 1

New cards
6

Moderate Correlation Range

-.8 to -.5 and .5 to .8

New cards
7

Does the value of r depend on which of the two variables is labeled x?

The value of r does not depend on which of the two variables is labeled x

New cards
8

Is value of r resistant or non-resistant?

The value of r is non-resistant

Outliers affect the correlation coefficient

New cards
9

What is the value of r a measure of?

The value of r is a measure of the extent to which x & y are linearly related

New cards
10

A value of r close to zero does not rule out any strong relationship between x and y.


True or False?

True

New cards
11

Does correlation imply causation?

Correlation does not imply causation

New cards
12

x – variable

  • the independent or explanatory variable

New cards
13

y- variable

  • the dependent or response variable

New cards
14

What does LSRL stand for?

Least Squares Regression Line

(LSRL)

New cards
15

What is the LSRL formula?

y (with a hat, as pictured)= a + bx

y (y-hat) - means the predicted y

b  – is the slope

  • it is the amount by which y increases when x increases by 1 unit

a  – is the y-intercept

  • it is the height of the line when x = 0

  • in some situations, the y-intercept has no meaning

<p>y (with a hat, as pictured)= a + bx</p><p></p><p>y (y-hat) - means the predicted y</p><p><span><em>b</em>&nbsp; – is the slope</span></p><ul><li><p><span>it is the amount by which <em>y</em> increases when <em>x</em> increases by 1 unit</span></p></li></ul><p><span><em>a</em>&nbsp; – is the <em>y</em>-intercept</span></p><ul><li><p><span>it is the height of the line when <em>x</em> = 0</span></p></li><li><p><span>in some situations, the <em>y</em>-intercept has no meaning</span></p></li></ul><p></p>
New cards
16

What exactly is the LSRL?

  • The line that gives the best fit to the data set

The line that minimizes the sum of the squares of the deviations from the line

New cards
17

How do you interpret the slope?

For each unit increase in x, there is an approximate increase/decrease of b in y.

(Plug in the boldened words/letters).

New cards
18

How do you interpert the correlation coefficent?

There is a direction, strength, linear of association between x and y.

New cards
19

How does LSRL work with Extrapolation

  • The LSRL should not be used to predict y for values of x outside the data set. 

It is unknown whether the pattern observed in the scatterplot continues outside this range.

New cards
20

The correlation coefficient and the LSRL are both non-resistant measures.  


True or false?

True.

New cards
21

Formulas (null)

y(-hat) = b0 + b1x

b0 = y-hat - b1x

b1 = r(Sy/Sx)

New cards
22

Residual formula

Residual = y - y(-hat)

New cards
23

What are residuals?

Error

  • The vertical deviation between the observations & the LSRL

  • the sum of the residuals is always zero

  • error = observed - expected

New cards
24

What are Residual plots?

  • A scatterplot of the (x, residual) pairs.

  • Residuals can be graphed against other statistics besides x

  • Purpose is to tell if a linear association exist between the x & y variables

If no pattern exists between the points in the residual plot, then the association is linear.

New cards
25

If no pattern exists between the points in the residual plot, then the association is _____

If no pattern exists between the points in the residual plot, then the association is linear.

New cards
26

If a pattern exists between the points in the residual plot, then the association is ___.

non-linear

New cards
27

Are residual plots are the same no matter if plotted against x or y-hat?

Yes

Residual plots are the same no matter if plotted against x or y-hat.

New cards
28

Coefficient of determination

  • r2

  • gives the proportion of variation in y that can be attributed to an approximate linear relationship between x & y

remains the same no matter which variable is labeled x

New cards
29

Interperation of r2

Approximately r2% of the variation in y can be explained by the LSRL of x & y.

New cards
30

Outlier

In a regression setting, an outlier is a data point with a large residual

New cards
31

Influential point-

  • A point that influences where the LSRL is located

If removed, it will significantly change the slope of the LSRL

New cards
32

Which of these measures are resistant?

  • LSRL

  • Correlation coefficient

  • Coefficient of determination

NONE are resistant – all are affected by outliers


New cards
33

Computer-generated regression analysis of knee surgery data: 

What is the equation of the LSRL?

Find the slope & y-intercept.

Predictor Coef Stdev T P

Constant 107.58 11.12 9.67 0.000

Age 0.8710 0.4146 2.10 0.062


s = 10.42 R-sq = 30.6% R-sq(adj) = 23.7%

y(-hat) = 107.58 + .8710x

New cards
34

Computer-generated regression analysis of knee surgery data: 

What are the correlation coefficient and the coefficient of determination

Predictor Coef Stdev T P

Constant 107.58 11.12 9.67 0.000

Age 0.8710 0.4146 2.10 0.062

s = 10.42 R-sq = 30.6% R-sq(adj) = 23.7%

Never use adjusted R2

Be sure to convert r2 to decimal before taking the square root!

r=.5532

New cards

Explore top notes

note Note
studied byStudied by 3 people
... ago
5.0(1)
note Note
studied byStudied by 8 people
... ago
4.0(1)
note Note
studied byStudied by 8 people
... ago
5.0(1)
note Note
studied byStudied by 13 people
... ago
5.0(1)
note Note
studied byStudied by 7 people
... ago
5.0(1)
note Note
studied byStudied by 62 people
... ago
5.0(3)
note Note
studied byStudied by 22 people
... ago
5.0(1)
note Note
studied byStudied by 5946 people
... ago
5.0(12)

Explore top flashcards

flashcards Flashcard (25)
studied byStudied by 27 people
... ago
5.0(1)
flashcards Flashcard (50)
studied byStudied by 4 people
... ago
5.0(1)
flashcards Flashcard (32)
studied byStudied by 51 people
... ago
5.0(1)
flashcards Flashcard (49)
studied byStudied by 4 people
... ago
5.0(1)
flashcards Flashcard (294)
studied byStudied by 9 people
... ago
4.0(1)
flashcards Flashcard (201)
studied byStudied by 712 people
... ago
4.0(3)
flashcards Flashcard (50)
studied byStudied by 16 people
... ago
5.0(1)
flashcards Flashcard (35)
studied byStudied by 29 people
... ago
5.0(5)
robot