GEA1000 chapt 3 correlation coefficient and its limitations

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/8

flashcard set

Earn XP

Description and Tags

BRUHHHHHH

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

9 Terms

1
New cards

what is correlation coefficient

  • a measure of linear association

  • summarises direction and strength of linear association

  • ranges between -1 and 1

2
New cards

what are the different interpretations of r-value

r > 0 → positive linear association

r < 0 → negative linear association

r = 1 → perfect positive linear association

r = -1 → perfect negative linear association 

r = 0 → no linear association 

3
New cards

what does the magnitude of the r-value mean

  • strength of linear association

  • closer value of r to -1 or 1 → stronger the linear association

  • closer value of r to 0 → weaker the linear association

4
New cards

what is the formula for correlation coefficient

  1. convert each data point to Standard Unit (SUx, SUy, Sx is s.d of X and Sy is s.d of Y)

SUx = [X - avg(X)] / Sx

SUy = [Y - avg(Y)] / Sy

  1. r value is the sum of product of X and Y in standard unit divided by (n-1)

  • n is no. of data points

5
New cards

what is the properties of r

r is not affected by:

  • interchange of 2 variables

  • adding a number to all values of a variable

  • multiplying a positive number to all values of a variable

6
New cards

what are some limitations of correlation coefficient 

  • correlation DOES NOT imply causation 

  • a third variable might affect the outcome 

7
New cards

how can a quadratic scatter plot have a r = 0

  • r only measures the linear association between 2 variables

  • quadratic scatter plot is not a linear association

8
New cards

what are outliers

observations that fall far from the main cluster of points

9
New cards

how can they affect correlation

decrease of increase the strength of the correlation, depending on where the outlier is with respect to the other points