Topic 5 — Simple Linear Regression Flashcards

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/19

flashcard set

Earn XP

Description and Tags

This set of vocabulary flashcards covers core concepts of simple linear regression, including hypothesis testing, model parameters, assumptions, and diagnostics as presented by Dr. Jinkai Xue.

Last updated 8:07 PM on 6/3/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

20 Terms

1
New cards

Null hypothesis (H0H_0)

The status-quo claim about a parameter ϑ\vartheta (e.g., ϑ=ϑ0\vartheta = \vartheta_0).

2
New cards

Alternative hypothesis (HaH_a)

What we’d conclude if we reject the null hypothesis (H0H_0).

3
New cards

p-value

The probability—assuming H0H_0 is true—of seeing a test-statistic value at least as extreme as the one we computed.

4
New cards

Confidence intervals (CIs)

Two numbers (L,U)(L, U) such that P(LϑU)=1ϖP(L \leq \vartheta \leq U) = 1 - \varpi.

5
New cards

Dependent variable (yy)

Also known as the response variable, it is the outcome being predicted in regression analysis.

6
New cards

Independent variable (xx)

Also known as the predictor variable, it is used in regression to explain or predict changes in the dependent variable.

7
New cards

Simple Linear Regression Model

A probabilistic model expressed as y=ω0+ω1x+ϱy = \omega_0 + \omega_1x + \varrho, where the deterministic part E[y]=ω0+ω1xE[y] = \omega_0 + \omega_1x is the line of means.

8
New cards

ω0\omega_0 (Theta-zero)

The true population parameter representing the y-intercept (the value of E[y]E[y] when x=0x = 0).

9
New cards

ω1\omega_1 (Theta-one)

The true population parameter representing the slope (the change in E[y]E[y] per unit change in xx).

10
New cards

ω^0\hat{\omega}_0 and ω^1\hat{\omega}_1

The sample estimates of the population parameters computed from nn data points by the method of least squares.

11
New cards

Residual (ϱ^i\hat{\varrho}_i)

The observed counterpart of the unobserved random error (ϱi\varrho_i), calculated as the difference between the observed value and the fitted value: ϱ^i=yiy^i\hat{\varrho}_i = y_i - \hat{y}_i.

12
New cards

Homoscedasticity

The model assumption that the variance of the error term, Var(ϱ)=ε2\text{Var}(\varrho) = \varepsilon^2, is the same for every value of xx.

13
New cards

Method of Least Squares

A mathematical procedure that picks the line minimizing the sum of squared vertical residuals: SSE=i=1n(yiy^i)2SSE = \sum_{i=1}^n (y_i - \hat{y}_i)^2.

14
New cards

Residual Variance (s2s^2)

An unbiased estimator of ε2\varepsilon^2 calculated by dividing the sum of squared errors (SSESSE) by the degrees of freedom (n2n - 2).

15
New cards

Standard Error of the Slope (sω^1s_{\hat{\omega}_1})

A value that shrinks when the residual standard deviation ss is small or when the xx values are widely spread out, calculated as s/Sxxs / \sqrt{S_{xx}}.

16
New cards

Pearson Correlation (rr)

A scale-free measure of linear association ranging from 1-1 to 11; its sign matches the estimated slope ω^1\hat{\omega}_1.

17
New cards

Coefficient of Determination (r2r^2)

The fraction of the total variation in yy that is explained by the linear relationship with xx, ranging from 00 to 11.

18
New cards

Prediction Interval

An interval calculated for a new individual observation of yy (ynewy_{new}) that is always wider than the confidence interval for the mean response.

19
New cards

Extrapolation

The pitfall of using a fitted line to make predictions outside the sampled range of xx; such predictions are unsupported by the model.

20
New cards

Anscombe’s Quartet

A collection of four data sets with identical summary statistics (ω^0\hat{\omega}_0, ω^1\hat{\omega}_1, and r2r^2) but completely different distributions, demonstrating the necessity of plotting data.