Linear Regression

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/19

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

20 Terms

1
New cards
  • Define linear regression and its purpose in data analysis

  • Explain assumptions to perform a linear regression

  • Interpret regression coefficients

  • Apply linear regression concepts to pharmacy related experiments

Learning Objectives

2
New cards

It described the degree of which the change in value of one variable corresponds to a change in the value of a second variable - measures the STRENGTH of association 

ex: One variable changes, and another changes also - How related are they? Does one cause the other? 

What is Correlation in terms of statistical analyses?

3
New cards

It is a statistical technique to estimate the relationship between a dependent variable (the response) and one or more independent variables (The explanatory) - It measures what the association IS

ex: AS X increases by 1 unit, how much does Y increase by? How much does X affect Y? 

What is regression in terms of statistical analyses? 

4
New cards

The Dependent variable (Y)

Regression analysis also allows us to predict ______ by fitting a linear equation to observed data

5
New cards

y = mx + b

What is the regression analysis equation?

6
New cards
  • Yi = β0 + β1Xi + Ei

  • (i = 1, 2, 3…. n)

  • β0 = Y intercept

  • β1 = slope = average amount of change in the dependent variable (Y) for each one unit change in X (independent variable) 

  • Ei = residual for i’th subject (Difference between what is observed (Yi) and what the model predicts 

What is the simple linear regression equation when you have multiple (“n”) pairs of measures

7
New cards

equal to zero

In regard to the simple linear regression equation, for the NULL HYPOTHESIS, the slope for the regression line would be:

8
New cards

NOT equal to zero

  • Two measures ARE associated

  • Use two-tailed tests to assess

In regard to the simple linear regression equation, for the ALTERNATIVE HYPOTHESIS, the slope for the regression line would be:

9
New cards
  1. F test - asks “is there an association between X and Y?” 

    1. if R2 is equal to 0 

    2. if P-value is <0.05 = evidence suggests measures ARE associated 

  2. T-tests - asks “are the slope and intercept significantly different from 0?”

What is the two-phase-hypothesis test for seeing if two variables are associated? 

10
New cards
  • Normality assumption for residuals

  • Linear relationship between dependent and independent variables

  • NO Heteroskedasticity present 

    • In simpler words:

      • In a perfect regression, the dots (errors) should be evenly spread around the line at all values of X.

      • Heteroskedasticity happens when the dots start fanning out or shrinking — meaning the variance of errors is not constant.

  • Check for multicollinearity among independent variables - if highly correlational, deep them from model (measuring same thing)

  • Screen for outliers in both independent and dependent variables

What are the assumptions for Simple linear regression?

11
New cards

The difference between prediction (expectation) and observation (Ei, etc)

  • The predicted values are the ones gained from the regression equation

  • SPSS can calculate this (Observed - predicted = residuals) 

What are residuals?

12
New cards

smaller ones

  • Larger R2 = larger coefficient of determination (ratio of predicted value variance to total variance) 

RESIDUALS NEED TO BE NORMALLY DISTRIBUTED IN ORDER OT PERFORM LINEAR REGRESSION

The best regression residuals are ____

13
New cards

Describes how much variability for this specific independent variable (DBP) is not explained by other independent variables in the model (Should be > 0.10) 

Tolerance definition - in regard to SPSS calculation of regression

14
New cards

A measure to detect and quantify multicollinearity (Should be <10)

Variance Inflation Factor definition - in regard to SPSS calculation of regression

15
New cards
  • They are the probability-probability plot data points that should be close to the LINE

  • Scatter plot data points fall into rectangle and have no standard deviation exceeding 3 or -3 along the X or Y axis

Residuals information - in regard to SPSS calculation of regression

16
New cards

Expressed in the actual units of each independent variable; these are the numbers we plug into the equation to predict the values of the independent variable

Unstandardized coefficients definition

17
New cards

Expressed as standardized value for all independent variables in the model so we can compare them against each other on the same scale

Standardized coefficients definition

18
New cards

There is a proportional relationship between X and Y

On the linear regression graph (After computing equation), if there is a POSITIVE, UPWARD slope, that means

19
New cards

There is an inversely proportional relationship between X and Y

On the linear regression graph (After computing equation), if there is a NEGATIVE, DOWNWARD slope, that means

20
New cards

Little to no relationship between X and Y

On the linear regression graph (After computing equation), if there is a POSITIVE, BUT SMALL, MINIMAL increase in slope, that means