MAT 3522: Linear Models and Design of Experiments - Review Flashcards

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/31

flashcard set

Earn XP

Description and Tags

These vocabulary flashcards cover random vectors and matrices, simple and multiple linear regression, goodness of fit measures, regression assumptions, and statistical inference as presented in the MAT 3522 lecture notes.

Last updated 8:15 PM on 6/24/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

32 Terms

1
New cards

Random Vector

A vector whose components are random variables, such as X=(X1X2)X = \begin{pmatrix} X_1 \\ X_2 \end{pmatrix}.

2
New cards

Mean Vector

For a random vector XX, the mean vector is defined as E(X)=μ=(E(X1)E(X2)E(Xp))E(X) = \mu = \begin{pmatrix} E(X_1) \\ E(X_2) \\ \vdots \\ E(X_p) \end{pmatrix}.

3
New cards

Covariance

A measure of the direction of the linear relationship between two random variables XX and YY, defined by Cov(X,Y)=E[(XμX)(YμY)]Cov(X, Y) = E[(X - \mu_X)(Y - \mu_Y)] or E(XY)E(X)E(Y)E(XY) - E(X)E(Y).

4
New cards

Positive Covariance (Cov(X,Y)>0Cov(X, Y) > 0)

Indicates that the variables XX and YY tend to increase together.

5
New cards

Negative Covariance (Cov(X,Y)<0Cov(X, Y) < 0)

Indicates that one variable tends to increase while the other variable decreases.

6
New cards

Zero Covariance (Cov(X,Y)=0Cov(X, Y) = 0)

Indicates that there is no linear relationship between the two variables.

7
New cards

Variance-Covariance Matrix (Σ\Sigma)

A symmetric matrix where the diagonal elements are variances (σii=Var(Xi)\sigma_{ii} = Var(X_i)) and off-diagonal elements are covariances (σij=Cov(Xi,Xj)\sigma_{ij} = Cov(X_i, X_j)).

8
New cards

Positive Semi-definite

A mathematical property that all covariance matrices possess.

9
New cards

Simple Linear Regression

A statistical technique used to model the relationship between one response variable and one explanatory variable using a straight-line population model: Yi=β0+β1Xi+εiY_i = \beta_0 + \beta_1 X_i + \varepsilon_i.

10
New cards

Intercept (β0\beta_0)

The predicted value of the response variable YY when the explanatory variable XX is zero.

11
New cards

Slope (β1\beta_1)

The amount the predicted value of YY increases for every one-unit increase in the explanatory variable XX.

12
New cards

Estimated Regression Equation

The fitted model using sample data given by Y^=b0+b1X\hat{Y} = b_0 + b_1 X, where b0b_0 and b1b_1 are the estimated intercept and slope.

13
New cards

Homoscedasticity

The assumption for regression that the variance of the error terms remains constant, mathematically expressed as Var(εi)=σ2Var(\varepsilon_i) = \sigma^2.

14
New cards

Least Squares Estimation

A method of estimating regression coefficients by minimizing the sum of squared residuals (SSE=i=1n(YiY^i)2SSE = \sum_{i=1}^{n} (Y_i - \hat{Y}_i)^2).

15
New cards

Residual (eie_i)

The difference between the observed value and the predicted value, defined as ei=YiY^ie_i = Y_i - \hat{Y}_i. It measures the prediction error.

16
New cards

Multiple Linear Regression

A statistical technique used to model the relationship between one response variable and two or more explanatory variables using the model Yi=β0+β1X1i+β2X2i++βkXki+εiY_i = \beta_0 + \beta_1 X_{1i} + \beta_2 X_{2i} + \dots + \beta_k X_{ki} + \varepsilon_i.

17
New cards

Perfect Multicollinearity

A situation where explanatory variables are perfectly linearly related (e.g., X2=2X1X_2 = 2X_1), making it impossible to uniquely estimate regression coefficients.

18
New cards

Design Matrix (XX)

In the matrix form Y=Xβ+εY = X\beta + \varepsilon, a matrix where the first column is ones for the intercept term and the remaining columns contain the explanatory variable data.

19
New cards

Least Squares Estimator (Matrix Form)

The formula used to estimate all regression coefficients simultaneously: β^=(XX)1XY\hat{\beta} = (X'X)^{-1} X'Y.

20
New cards

Goodness of Fit

The extent to which the fitted regression equation adequately describes the relationship between the response and explanatory variables.

21
New cards

Total Sum of Squares (SST)

Measures the total variation present in the response variable, defined as SST=i=1n(YiYˉ)2SST = \sum_{i=1}^{n} (Y_i - \bar{Y})^2.

22
New cards

Regression Sum of Squares (SSR)

Measures the variation in the response variable that is explained by the regression model, defined as SSR=i=1n(Y^iYˉ)2SSR = \sum_{i=1}^{n} (\hat{Y}_i - \bar{Y})^2.

23
New cards

Error Sum of Squares (SSE)

Measures variation not explained by the regression model, defined as SSE=i=1n(YiY^i)2SSE = \sum_{i=1}^{n} (Y_i - \hat{Y}_i)^2 or i=1nei2\sum_{i=1}^{n} e_i^2.

24
New cards

Coefficient of Determination (R2R^2)

The proportion of variation in the response variable explained by the regression model, calculated as R2=SSRSSTR^2 = \frac{SSR}{SST}.

25
New cards

Adjusted Coefficient of Determination (Radj2R^2_{adj})

A measure of model performance that rewards useful variables and penalizes unnecessary variables, defined as Radj2=1(SSE/(nk1)SST/(n1))R^2_{adj} = 1 - \left( \frac{SSE/(n - k - 1)}{SST/(n - 1)} \right).

26
New cards

Residual Plot

A graph of residuals (eie_i) against fitted values (Y^i\hat{Y}_i) used as a diagnostic tool for checking regression assumptions.

27
New cards

Heteroscedasticity

A pattern in a residual plot exhibiting a funnel shape, indicating that the constant variance assumption has been violated.

28
New cards

t-Test for Slope

A hypothesis test where H0:β1=0H_0: \beta_1 = 0 against H1:β10H_1: \beta_1 \neq 0 to determine if an explanatory variable significantly affects the response.

29
New cards

Mean Square Error (MSE)

An estimator of error variance (σ2\sigma^2), calculated as MSE=SSEn2MSE = \frac{SSE}{n - 2} in simple regression or MSE=SSEnk1MSE = \frac{SSE}{n - k - 1} in multiple regression.

30
New cards

Overall F-Test

A test of the significance of the entire regression model where H0:β1=β2==βk=0H_0: \beta_1 = \beta_2 = \dots = \beta_k = 0 against H1:At least one βj0H_1: \text{At least one } \beta_j \neq 0, using the statistic F=MSRMSEF = \frac{MSR}{MSE}.

31
New cards

Confidence Interval

A range of plausible values for an unknown population parameter, constructed using the general form: Estimate±(CriticalValue)×(StandardError)Estimate \pm (Critical Value) \times (Standard Error).

32
New cards

Partial Effect

An interpretation of a regression coefficient in multiple regression representing the effect of one variable while holding all other variables constant.