Linear Regression Analysis

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/79

flashcard set

Earn XP

Description and Tags

This collection of flashcards covers key vocabulary and concepts related to Linear Regression Analysis, including model structures, statistical tests, and measures of goodness-of-fit.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

80 Terms

1
New cards

Population Regression Model

A regression model that represents the relationship between a dependent variable and independent variables in a population.

2
New cards

Marginal Effect

The change in the dependent variable resulting from a one-unit increase in an independent variable while holding other independent variables constant.

3
New cards

Sample Regression Function (SRF)

The predicted relationship used to estimate the dependent variable from independent variables based on a sample.

4
New cards

Sample Residual

The difference between the actual observed value and the predicted value from the sample regression function.

5
New cards

Minimizing Sum of Squared Residuals

The process in regression analysis to estimate parameters by minimizing the total of the squared differences between the observed and predicted values.

6
New cards

Goodness-of-Fit

A measure of how well the regression model predicts actual data.

7
New cards

Coefficient of Determination (R-squared)

A statistical measure that represents the proportion of the variance for the dependent variable that is explained by independent variables.

8
New cards

Adjusted R-squared

A modified version of R-squared that adjusts for the number of predictors in a model.

9
New cards

Unexplained Sum of Squares (USS)

The portion of the total variability in the observed data that is not explained by the independent variables.

10
New cards

Total Sum of Squares (TSS)

The total variance in the dependent variable; the sum of explained and unexplained variances.

11
New cards

Statistical Hypothesis Test

A method for testing a hypothesis about a population parameter based on sample data.

12
New cards

Null Hypothesis (H0)

A statement indicating no effect or no difference, used as a starting point for statistical testing.

13
New cards

Alternative Hypothesis (H1)

The hypothesis that indicates the presence of an effect or difference.

14
New cards

Level of Significance

The probability of rejecting the null hypothesis when it is true, commonly denoted as alpha (α).

15
New cards

Rejection Rule

The criteria set for rejecting the null hypothesis based on the statistical test results.

16
New cards

F-Test

A statistical test used to assess the overall significance of a linear regression model.

17
New cards

F-Statistic

A ratio used in the F-Test to determine whether the variability explained by the model is significantly greater than unexplained variability.

18
New cards

Right-tailed Test

A statistical test where the critical region is on the right side of the distribution.

19
New cards

F-Distribution

The probability distribution of the F-statistic under the null hypothesis.

20
New cards

Critical Value

A threshold value that the test statistic must exceed to reject the null hypothesis.

21
New cards

t-test

A statistical test used to determine if there is a significant difference between the means of two groups.

22
New cards

Exclusion Restriction Test

A test determining whether specific independent variables can be omitted from the regression model without significantly affecting its explanatory power.

23
New cards

Unrestricted Regression Model

A regression model that includes all possible predictors without any constraints.

24
New cards

Restricted Regression Model

A regression model that imposes restrictions on certain coefficients, usually setting them to zero.

25
New cards

Structural Differences Test (Chow Test)

A statistical test to determine if the regression coefficients differ across two or more groups.

26
New cards

Pooled Regression

A regression model that combines data from different groups to evaluate overall relationships.

27
New cards

Separate Regressions

Separate linear regression models run on different subsets of the data.

28
New cards

Unexplained Sum of Squares (USS) Comparison

The process of comparing the USS from different regression models to assess structural differences.

29
New cards

Coefficient Equality Hypothesis

The hypothesis that regression coefficients are equal across different models or groups.

30
New cards

R-squared Formula

R² = Explained Variation/Total Variation in y.

31
New cards

Explanatory Variable

The independent variable(s) in regression that are used to predict the value of the dependent variable.

32
New cards

Dependent Variable

The outcome variable that is being predicted in regression analysis.

33
New cards

Independent Variable

The variable that is manipulated or changed to observe its effect on the dependent variable.

34
New cards

Parameter Estimation

The process of estimating the values of parameters in a regression model.

35
New cards

Variance

A measure of the dispersion of a set of values.

36
New cards

Statistical Significance

A determination that a relationship or effect is unlikely to be due to chance.

37
New cards

P-value

The probability of obtaining a statistic as extreme as the test statistic, assuming the null hypothesis is true.

38
New cards

Type I Error

The error of rejecting a true null hypothesis (false positive).

39
New cards

Type II Error

The error of failing to reject a false null hypothesis (false negative).

40
New cards

Model Fit

How well a statistical model encompasses the data it is supposed to represent.

41
New cards

Residual Analysis

An examination of the residuals to assess the fit of a regression model.

42
New cards

Regression Coefficient

A value that represents the relationship between a given independent variable and the dependent variable.

43
New cards

Standard Error of Estimate

An indication of the accuracy of predictions made with a regression analysis.

44
New cards

Likelihood Ratio Test

A statistical test used to compare the goodness of fit of two models.

45
New cards

Multicollinearity

A situation in statistical models where two or more independent variables are highly correlated.

46
New cards

Heteroscedasticity

A condition in regression analysis where the variance of errors varies across observations.

47
New cards

Linearity Assumption

The assumption that the relationship between independent and dependent variables is linear.

48
New cards

Independence of Errors

The assumption that the residuals are statistically independent.

49
New cards

Normality of Errors

The assumption that the residuals are normally distributed.

50
New cards

Fisher's F-Test

A test to determine if there are significant differences between the variances of populations.

51
New cards

R-squared Adjustments

Adjustments to R-squared that account for the number of predictors in the model.

52
New cards

Parameter Significance Testing

The process of testing whether individual regression coefficients are significantly different from zero.

53
New cards

Model Specification

The process of selecting the correct variables and functional form for a regression model.

54
New cards

Residuals

The differences between observed and predicted values in a regression model.

55
New cards

Total Variation

The overall variability in a dataset.

56
New cards

Explained Variation

The portion of the total variation that is accounted for by the model.

57
New cards

Dummy Variables

Binary variables used to represent categories in regression analysis.

58
New cards

Endogeneity

A situation in which an explanatory variable is correlated with the error term.

59
New cards

Outlier

An observation that lies an abnormal distance from other values in a dataset.

60
New cards

Influential Observation

An observation that significantly affects the estimate of the regression coefficients.

61
New cards

Statistical Software

Programs used to perform statistical analysis and regression analysis.

62
New cards

Bootstrap Methods

A resampling technique used to estimate the distribution of a statistic.

63
New cards

Panel Data

Data that combines cross-sectional and time-series data.

64
New cards

Time-Series Analysis

Analysis of data points collected or recorded at specific time intervals.

65
New cards

Cross-Sectional Data

Data collected at a single point in time across multiple subjects.

66
New cards

Regression Diagnostics

Methods for checking the validity and appropriateness of a regression model.

67
New cards

Sample Size Determination

The process of calculating the number of observations required for a study.

68
New cards

Statistical Power

The probability that a statistical test will correctly reject a false null hypothesis.

69
New cards

Covariance

A measure of how much two random variables change together.

70
New cards

Correlation Coefficient

A statistical measure that describes the strength and direction of a relationship between two variables.

71
New cards

Predictive Modeling

The process of creating a statistical model to predict future outcomes.

72
New cards

Model Validation

The process of assessing the performance of a model using new data.

73
New cards

Variance Inflation Factor (VIF)

A measure of how much the variance of a regression coefficient is inflated due to multicollinearity.

74
New cards

Principal Component Analysis (PCA)

A statistical technique used to reduce the dimensionality of data.

75
New cards

Chow Test

A test for structural breaks in regression analysis.

76
New cards

Residual plots

Graphs that plot residuals on the x-axis against predicted values or another variable on the y-axis.

77
New cards

Autocorrelation

The correlation of a signal with a delayed copy of itself.

78
New cards

Cross-Validation

A technique for assessing how the results of a statistical analysis will generalize to an independent dataset.

79
New cards

Machine Learning Regression

A subset of machine learning techniques focused on predicting numerical outcomes.

80
New cards

Data Transformation

The process of converting data from one format or structure into another to meet the assumptions of a model.