Ch3 - Bivariate Data

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
full-widthPodcast
1
Card Sorting

1/25

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

26 Terms

1
New cards

response varaible

measures the outcome of a study (dependent variable)

2
New cards

explanatory variable

attempts to explain the overserved outcomes (the independent variable)

3
New cards

describing a scatterplot

look for the overall pattern & striking outliers

4
New cards

describing/interpreting a scatterplot sentence

1 sentence: There is a [strong/moderate/weak] [positive/negative] [shape] relationship between [x-context] and [y-context].  There [do/do not] appear to be any outliers. 

  1. strength (strong/moderate/weak) r-value

  2. dir (pos or neg assoc)

  3. shape (clusters, linear, curved, quadratic, log, etc)

    1. if dir. is pos. r is pos.

  4. state outliers

  5. context

5
New cards

pos association is

between 2 variables: when x axis variable increases, y axis variable increases

6
New cards

neg association

between 2 variables: when x axis variable increases, y axis variable decreases

7
New cards

correlation 

r → measures dir & strength of the linear relationship between 2 quantitative variables

  • not impacted by stretch/shrink (like z its standardized)

  • impacted by outliers

  • can only discuss r see data/know linear

8
New cards

interpret/describe correlation

 There is a [strength], [direction] linear relationship between [variable1] & [variable2].

9
New cards

facts abt correlation

R

  1. no distinction between explanatory (dependent) and response (independent)

  2. both variables must be quantitative (makes no sense w/ categorical)

  3. r has no units bc its standardized (ie z score doesn’t have units)

  4. the sign of r matches the sign of slope

  5. -1 <= r <= 1

  6. r only measures strength & dir thru linear relation (correlation = 0 does not mean no relation just not linear ie circle)

  7. correlation does not equal causation (only tru when random assignment)

10
New cards

y hat = a +bx

Least Squares Regression Line LSRS

  • y hat = predicted value

  • a = y-int (y coordinate)

  • b = slope

linear relationship

11
New cards

interpret/describe predicted value (y hat)

[y-hat] is the predicted value of [response variable y] when [explanatory variable] is [input amount]. 

12
New cards

interpretations =

“predicted”

13
New cards

Extrapolation

is the use of a regression line for prediction far outside the domain & range of the data to obtain the line

  • usual = inaccurate predictions

14
New cards

a residual is

the diff between the observed value & the predicted value (regression line)

  • residual = y - y hat

  • actual - predicted value

a good regression line makes residuals as small as possible

15
New cards

interpret/describe residual

The actual/true [y-context] is [higher/lower] than the predicted [y-context] by [residual].

16
New cards

interpret/describe the slope of a regression line

(b):  As [explanatory variable] increases by one [unit], the predicted [response variable] [increases/decreases] by [b] [units]. 

17
New cards

interpret/describe the y-int of a regression line

(a):  When [x-context] is zero, the predicted [y-context] is [a] [units].

18
New cards

interpret/describe the standard deviation of the residuals

When using the LSRL, the predicted number of [y-context] is typically [s units] off from the actual number. 

19
New cards

A Least Sqaure Regression Line

(LSRL) tries to minimize the sum of the squared residuals (as small as possible)

  • the mean of the residuals is always 0

  • the LSRL always goes thru the point (x bar, y bar)

use for summary data

20
New cards

defining y hat

y hat = “predicted __ units”

21
New cards

Calc Steps (LSRL)

  • Stat → calc → option 8

  • stat → calc → option 8 → store rest → var → y → 1 (graphs line)

use when given table data

22
New cards

coefficient of determination: the square of the correlation coefficient describes the strength (not dir.) of a linear relationship

  • measures how closely the points fall to the LSRL

23
New cards

r² =

<p></p>
24
New cards

interpret/describe the coefficient of determination

About [r2] % of the variability in [y-context] is accounted for by the least-squares regression line. 

25
New cards

residual plot

is a scatterplot of the residuals agaisnt the explanatory variable → if it is linear there will be a random scatter on plot

26
New cards

residual plot on calc

  1. run linear regression!

  • 2nd statplot → 1 → Xlist (list of explanatory, 1) → Ylist → 2nd stat → bottom!