Psych 300A: Midterm 2 Review (Regression)

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/43

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

44 Terms

New cards

Regression

The straight line that fits the data best, can be expressed mathematically

New cards

What does the regression line let us do with respect to X and Y

We can predict unknown Y scores from given X scores (assuming scores fall in data range)

New cards

How is regression calculated

Y’ = a_y + b_yx (sometimes Ŷ is also used)

New cards

Best fit line

Line that makes predictions about people’s beliefs that are as close to the true scores as possible

New cards

Error

Difference between a person’s predicted score and the person’s actual score on the criterion variable

New cards

What is another name for the regression line

Least squares regression, because it mathematically minimizes errors associated with trying to predict info

New cards

How do we know if a line minimizes errors

We take the sum of squared errors ( ∑(Y - Y’)² ), which is a minimum on the best fitting line

New cards

b_y

Slope, constant value that dictates the proportional change in Y given a value of X

New cards

How is b_y calculated

b_y = r(SDy/SDx)

New cards

a_y

Y intercept, when predicting Y it is the point where the regression line intercepts with the Y-axis

New cards

How is a_y calculated

a_y = Ӯ - b_y(X̄)

New cards

T or F: If X̄ = 0, then Y’ = a_y

New cards

What leads to a flatter slope

Higher variability or heteroscedasticity

New cards

What 2 values are used to plot a regression line

Y intercept (0, ay)
The mean of X and mean of Y (X̄, Ӯ)

(Values need to be in range of data)

New cards

T or F: The regression line represents the mean for bivariate data

T, the regression lines for Y’ and X’ intersect at the mean for X and Y

New cards

What three datapoints are needed to plot Y’ and X’

Y’ - (0, a_y) and (X̄, Ӯ)

X’ - (a_x, 0) and (X̄, Ӯ)

New cards

What happens to the angle of the two regression lines if the correlation is very high

The angle will be very small, if r = ± 1.00 then they overlap, if r = 0 they are at a 90 degree angle (angle increases approaching zero)

New cards

What happens to Y’ and X’ respectively if r = 0

Y’ → b_y = 0, a_y = Ӯ and the line is flat

X’ → b_x = 0, a_x = X̄ and the line is vertical

New cards

What are the equations if the relationship is curvilinear (quadratic, cubic or quartic)

Quadratic: Y′ = a + bX + cX²

Cubic: Y′ = a + bX + cX² + dX³

Quartic: Y′ = a + bX + cX² + dX³ + eX⁴

New cards

What is Y - Y’

Measure of variability, it is the residual/error or difference between an observed Y and a predicted Y on a regression line. Measures error or prediction around the regression line

New cards

Standard error of estimate

A measure of the average deviation of the errors, the difference between the -values predicted by the multiple regression model and the -values in the sample

New cards

How is deviation score calculated for SD and SDy-y’

SD = (x - x̄)

SDy’ = (Y - Y’)

New cards

How are squared deviations calculated for SD and SDy-y’

SD = (x - x̄)²

SDy’ = (Y - Y’)²

New cards

How is sum of squared deviations calculated for SD and SDy-y’

SS = ∑(x −x̄)²

SSy-y’ = ∑(Y − Y’)²

New cards

How are SD calculated for SD and SDy-y’

SD = √(∑(x −x̄)²/N)

SDy-y’ = √(∑(Y − Y’)²/N)

New cards

T or F: there is an alternate way to calculate SDy’

T, SDy’ = SDy√(1-r²)

New cards

T or F: when r is not equal to 0, SDy’ will be smaller than SDy

T, because SDy’ reduces the potential error by using information from 2 sources instead of 1

New cards

What would happen to SDy’ if r = 0

SDy’ = SDy (because SDy’ = SDy√1-r2)

New cards

What would happen to SDy’ if there is a perfect positive or negative correlation (r = +- 1)

SDy’ = 0 (because SDy’ = SDy√1-r2)

New cards

What do we need to know in order to understand explained variability

Total and unexplained variability

New cards

Total variability

Denoted with X_O, Y - ȳ

New cards

Unexplained variability

Denoted with Xt, Y - Y’

New cards

Explained variability

Denoted with Xe, Y’ - ȳ

New cards

How can one conceptually explain explained variability

Total variability = prediction + residuals

New cards

In what 2 ways can total variability be expressed

SS_T

∑(Y - ȳ)²

New cards

In what 2 ways can explained variability be expressed

SS_R

∑(Y’ - ȳ)²

New cards

In what 2 ways can unexplained variability be expressed

SS_E

∑(Y −Y’)²

New cards

Why must the values be squared to calculate total variability (e.g why can we not just do ∑(Y - ȳ) = ∑(ȳ – Y’) + ∑(Y-Y’)

Because the sum of deviations is 0, this is why we must us SS in our regression equation

New cards

What is the problem regarding SS_R and SS_E

We have trouble interpreting SS_R and SS_E because they are squares

New cards

What is the solution to the problem regarding SS_R and SS_E

We calculate the proportion of variability with regard to explained and unexplained variability

New cards

How is proportion of variability used to calculate total variability

Conceptual: Total variability = proportion of explained variability + proportion of unexplained variability

Equation: SS_T/SS_T = SS_R/SS_T + SS_E/SS_T

New cards

What is the important implication regarding proportion of explained variability and proportion of unexplained variability and their relationship with r

Explained variability = r² and unexplained variability = 1 - r²

New cards

What are the 9 attributes of the regression line

Regression line represented bivariate data in a linear relationship and predicts scores based on observed data
Defined by linear equation for a straight line

Two regression lines can represent bivariate data (Y’ and X’)
Does not predict values outside of the range of data
Is the best descriptor of bivariate data
Reflects the method of least squares (e.g. Σ(Y - Y’)2)
Always has some error of prediction present and is measured as standard error of estimate SDy - y’ (unless r = ± 1)

Is a traveling normal distribution with a moving mean
Allows separate measures of SST, SSR and SSE

New cards

What are the two linear equations for Y’ and X’

Y’ = ay + byX

X’ = ax + bxY