STATS

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/28

There's no tags or description

Looks like no tags are added yet.

Last updated 2:36 PM on 3/13/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

29 Terms

New cards

cook’s distance

a single point’s “pull”

New cards

covariance

extent to which a person’s deviation scores vary (positive=high on both, negative=low on both, 0=no covariance) (do they match?)

New cards

correlation [r]

a standardized measure (covariance) of the strength of an association, also the average of (z-scores x*z-scores y)

New cards

Spearman’s Rho (p)

power, finding the correlation of non-linear relationships, tests if your data is statistically significant, if p<0.05 then it is!

New cards

Linear regression model

y^=b0+b1Xi (predicted y= intercept+slopeX)

New cards

Quadratic regression model

y=b0+b1Xi², where b1=the shape of the curve

New cards

residual error, y-y^

New cards

Sum of Squared Errors (SSE)

Defines the spread, (sum(Yi-Y^i)²), sum of residuals squared

New cards

Mean Square Error

The average amount of spread, same equation as SSE, just averaged; you want to find the lowest MSE

New cards

Least Squares Estimation

method for finding the smallest MSE

New cards

Brute Force Method

Start with a possible value for b1, find residuals and compute MSE, adjust b1 and repeat until you find lowest MSE

New cards

Analytic method

first find b1 using (b1=rx,y(SDy/SDx)), then plug that into this to find b0: (b0=My-b1Mx)

New cards

Z-scored regression

Will always have a y-int of 0 and a slope=correlation, so b1=rx,y and b0=0, Use this to find the lowest MSE: (Zy^=rx,yZxi) (predicted mean z-score of y=correlation*mean z-score of X

New cards

Mean centering

subtract the mean amount of x from every amount of x (Mx-x); this causes b0 to become the mean, making it directly interpretable

New cards

R²

A scaled index of model fit, 1-MSE/Var(y)=R²; when the model is perfect, r²=1, and when it does no better it is 0; ALSO correlation squared

New cards

Comparing models

compare r²’s of different models, the one closest to 1 is the best fit, allows you to predict y from multiple values of x

New cards

Model Errors

E-measurement error and variables not in the model, the residual after accounting for the model

New cards

Q-Q plot

Plot z-scored residuals against z-scores from a standard normal distribution. Should fall in a straight line with a slope of 1.

New cards

homoscedacity

The variance of the residuals should be about equal for all Xi values.

New cards

Linear check

predicted values (y^) should be equally likely to be bigger or smaller than observed values of x

New cards

overfitting

models with more predictors are likely explaining what’s actually just noise, rather than something actually systematic; not explaining x and y in the real world.

New cards

Adjusted R²

a modified version of r², higher for a more complex model only if the additional parameters improve fit more than would be expected by chance; if adjust r² > r², then the added parameters are not adding value

New cards

interpolation

estimating y^ values for unmeasured xi values that are within the range of the data

New cards

extrapolation

estimating y^ values for unmeasured xi values that are out of the range of the data, limited because most variables cannot infinitely exponentially increase/decrease

New cards

RMSE

root MSE, square to get MSE

New cards

SSE

sum of squared errors, sum of error squared

New cards

SSR

variance is an average of SSR, measures the distance between the predicted values on your regression line and the mean (average) of the data. Sum of predicted deviations squared; SUM(Yi^-My)²

New cards

“Standardized” on Table

standardized b1

New cards