Topic 5 — Simple Linear Regression Flashcards

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/19

Earn XP

Description and Tags

This set of vocabulary flashcards covers core concepts of simple linear regression, including hypothesis testing, model parameters, assumptions, and diagnostics as presented by Dr. Jinkai Xue.

Last updated 8:07 PM on 6/3/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

20 Terms

1

New cards

Null hypothesis ( $H_0$ )

The status-quo claim about a parameter $\vartheta$ (e.g., $\vartheta = \vartheta_0$ ).

2

New cards

Alternative hypothesis ( $H_a$ )

What we’d conclude if we reject the null hypothesis ( $H_0$ ).

3

New cards

p-value

The probability—assuming $H_0$ is true—of seeing a test-statistic value at least as extreme as the one we computed.

4

New cards

Confidence intervals (CIs)

Two numbers $(L, U)$ such that $P(L \leq \vartheta \leq U) = 1 - \varpi$ .

5

New cards

Dependent variable ( $y$ )

Also known as the response variable, it is the outcome being predicted in regression analysis.

6

New cards

Independent variable ( $x$ )

Also known as the predictor variable, it is used in regression to explain or predict changes in the dependent variable.

7

New cards

Simple Linear Regression Model

A probabilistic model expressed as $y = \omega_0 + \omega_1x + \varrho$ , where the deterministic part $E[y] = \omega_0 + \omega_1x$ is the line of means.

8

New cards

$\omega_0$ (Theta-zero)

The true population parameter representing the y-intercept (the value of $E[y]$ when $x = 0$ ).

9

New cards

$\omega_1$ (Theta-one)

The true population parameter representing the slope (the change in $E[y]$ per unit change in $x$ ).

10

New cards

$\hat{\omega}_0$ and $\hat{\omega}_1$

The sample estimates of the population parameters computed from $n$ data points by the method of least squares.

11

New cards

Residual ( $\hat{\varrho}_i$ )

The observed counterpart of the unobserved random error ( $\varrho_i$ ), calculated as the difference between the observed value and the fitted value: $\hat{\varrho}_i = y_i - \hat{y}_i$ .

12

New cards

Homoscedasticity

The model assumption that the variance of the error term, $\text{Var}(\varrho) = \varepsilon^2$ , is the same for every value of $x$ .

13

New cards

Method of Least Squares

A mathematical procedure that picks the line minimizing the sum of squared vertical residuals: $SSE = \sum_{i=1}^n (y_i - \hat{y}_i)^2$ .

14

New cards

Residual Variance ( $s^2$ )

An unbiased estimator of $\varepsilon^2$ calculated by dividing the sum of squared errors ( $SSE$ ) by the degrees of freedom ( $n - 2$ ).

15

New cards

Standard Error of the Slope ( $s_{\hat{\omega}_1}$ )

A value that shrinks when the residual standard deviation $s$ is small or when the $x$ values are widely spread out, calculated as $s / \sqrt{S_{xx}}$ .

16

New cards

Pearson Correlation ( $r$ )

A scale-free measure of linear association ranging from $-1$ to $1$ ; its sign matches the estimated slope $\hat{\omega}_1$ .

17

New cards

Coefficient of Determination ( $r^2$ )

The fraction of the total variation in $y$ that is explained by the linear relationship with $x$ , ranging from $0$ to $1$ .

18

New cards

Prediction Interval

An interval calculated for a new individual observation of $y$ ( $y_{new}$ ) that is always wider than the confidence interval for the mean response.

19

New cards

Extrapolation

The pitfall of using a fitted line to make predictions outside the sampled range of $x$ ; such predictions are unsupported by the model.

20

New cards

Anscombe’s Quartet

A collection of four data sets with identical summary statistics ( $\hat{\omega}_0$ , $\hat{\omega}_1$ , and $r^2$ ) but completely different distributions, demonstrating the necessity of plotting data.