Business Analytics - Linear Regression

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/74

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts, definitions, and important terms related to linear regression analysis as discussed in the Business Analytics lecture.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

75 Terms

1
New cards

Simple Linear Regression Model

A model that estimates how a dependent variable is related to a single independent variable.

2
New cards

Dependent Variable

The variable being predicted in a regression model.

3
New cards

Independent Variable

The variable used to predict the value of the dependent variable.

4
New cards

Multiple Linear Regression Model

A regression model that estimates the relationship between a dependent variable and two or more independent variables.

5
New cards

Least Squares Method

A statistical method used to determine the best-fitting line by minimizing the sum of squared residuals.

6
New cards

Coefficient of Determination (R²)

A statistical measure that explains the proportion of variance in the dependent variable that can be predicted from the independent variables.

7
New cards

Error Term

The difference between the observed value and the predicted value in a regression model.

8
New cards

Extrapolation

The act of predicting values outside the range of observed data, which can be risky.

9
New cards

Residuals

The errors made in estimating the value of the dependent variable for each observation.

10
New cards

Dummy Variables

Binary variables created to represent categorical variables in regression analysis.

11
New cards

Hypothesis Testing

A statistical method that uses sample data to evaluate a hypothesis about a population parameter.

12
New cards

Statistical Significance

A determination that an effect or relationship is likely not due to random chance.

13
New cards

ANOVA

Analysis of variance, a statistical technique used to determine if there are significant differences between the means of three or more groups.

14
New cards

Model Fitting

The process of adjusting a model so it best represents the observed data.

15
New cards

Curvilinear Relationship

A relationship between two variables that is not a straight line, often represented as a curve.

16
New cards

Interaction Term

A variable that represents the combined effect of two or more independent variables on the dependent variable.

17
New cards

Piecewise Linear Regression

A type of regression that allows for different linear relationships in different intervals of the independent variable.

18
New cards

Overfitting

A modeling error that occurs when a model is too complex and captures noise instead of the underlying pattern.

19
New cards

Confidence Interval

An estimated range of values that is likely to include an unknown population parameter.

20
New cards

Prediction Interval

An estimate of the range of possible values for a new observation based on the regression model.

21
New cards

Multicollinearity

A situation in which two or more independent variables in a regression model are highly correlated.

22
New cards

Standard Error

An estimate of the variability between the sample statistic and the population parameter.

23
New cards

Statistical Software

Programs used to perform statistical analyses and computations.

24
New cards

Regression Output

The results from a regression analysis, including coefficients, standard errors, and significance levels.

25
New cards

Model Selection Procedures

Methods used to determine which variables to include in a regression model.

26
New cards

Quadratic Regression Model

A regression model that includes the square of an independent variable to account for curvilinear relationships.

27
New cards

Inference

The process of drawing conclusions about a population based on sample data.

28
New cards

Big Data

Extremely large data sets that may be analyzed computationally to reveal patterns and trends.

29
New cards

Basic Regression Equation

An equation that expresses a dependent variable as a function of one or more independent variables.

30
New cards

Experimental Region

The range of values for independent variables in the data used to estimate the regression model.

31
New cards

Hypotheses

Statements that can be tested statistically in a regression analysis.

32
New cards

Sums of Squares

Measure of variation in data; includes total, regression, and error sums of squares.

33
New cards

Statistical Procedures

Step-by-step methodologies used to analyze data and draw conclusions.

34
New cards

Residual Sum of Squares (RSS)

The sum of the squares of residuals, used to measure the discrepancy between observed and predicted values.

35
New cards

Estimate of Variance

A calculation that assesses the variability of the dependent variable.

36
New cards

Normal Distribution

A probability distribution that is symmetric about the mean, showing that data near the mean are more frequent in occurrence.

37
New cards

Explanation of Variability

The portion of variance in a dependent variable that can be explained by the independent variable(s) in a regression model.

38
New cards

Residual Plot

A graphical representation that shows the residuals on the Y-axis and fitted values on the X-axis.

39
New cards

Statistical Independence

The condition where the occurrence of one event does not affect the probability of another event.

40
New cards

Sampling Distribution

The probability distribution of a given statistic based on a random sample.

41
New cards

Quadratic Terms

Variables in a model that represent the square of independent variables, allowing for curvature in the regression line.

42
New cards

Variable Selection

The process of selecting the most important variables to include in a regression model.

43
New cards

Significant Level (α)

The threshold for determining if a result is statistically significant; commonly set at 0.05.

44
New cards

t Test

A statistical test used to compare the means of one or two groups to determine if they are significantly different.

45
New cards

F Test

A statistical test used to compare two variances to determine if they are significantly different.

46
New cards

Identification of Relationships

The process of discovering relationships between variables based on statistical analyses.

47
New cards

Explanatory Variable

Another term for an independent variable, used to explain variations in the dependent variable.

48
New cards

Statistical Relationship

A relationship that indicates a statistical correlation or association between variables.

49
New cards

Random Sample

A subset of individuals chosen from a larger set, ensuring each individual has an equal chance of being selected.

50
New cards

Numerical Methods

Algorithms that use numerical approximation for the solutions of mathematical problems.

51
New cards

Confidence Level

The probability that the value of a parameter falls within a specified range of values.

52
New cards

Sampling Error

The difference between the sample statistic and the actual population parameter.

53
New cards

Regression Coefficient

A measure of the change in the dependent variable associated with a one-unit change in an independent variable.

54
New cards

Data Exploration

The process of analyzing data sets to summarize their main characteristics, often using visual methods.

55
New cards

Statistical Analysis

The science of collecting and interpreting data through various mathematical techniques.

56
New cards

Independent Variable Assessment

The evaluation of the effects that different independent variables have on the dependent variable.

57
New cards

Regression Diagnostics

Procedures used to check the validity of regression models.

58
New cards

Statistical Theory

The framework consisting of principles and mathematical foundations for statistical practice.

59
New cards

Computational Statistics

A basic branch of statistics that focuses on algorithms that facilitate statistical analysis.

60
New cards

Resistance to Outliers

The ability of a statistical measure or method to not be unduly affected by extreme values.

61
New cards

Error Variance

The variability in the dependent variable that cannot be explained by the independent variable(s) in the model.

62
New cards

Adjusted R²

A modified version of R² that has been adjusted for the number of predictors in the model.

63
New cards

Observed Frequency

The count of how many times an outcome is observed in the sample.

64
New cards

Bivariate Analysis

An analysis of two variables to understand their relationship.

65
New cards

Statistical Model

A mathematical representation of observed data and the relationships between the variables.

66
New cards

Characteristic Curve

A plot that visualizes the relationship between two variables in regression analysis.

67
New cards

Model Validation

The process of checking if the statistical model is applicable to the data.

68
New cards

Random Assignment

Assigning participants to groups by chance, to ensure each participant has an equal chance of being included.

69
New cards

Error Analysis

The study of the types and causes of errors in analytical datasets.

70
New cards

Regression Assumptions

The basic requirements that a regression model must satisfy for valid results.

71
New cards

Non-linear Regression

A type of regression analysis where the relationship between the variables is modeled as a non-linear equation.

72
New cards

Dependent Observations

Measurements of the same subject taken over time; these are not independent of one another.

73
New cards

Testing Variability

Investigating the extent to which measures of data vary.

74
New cards

Null Hypothesis

The hypothesis that there is no effect or no difference, used as a default or starting assumption.

75
New cards

Statistical Reporting

Presenting the results of statistical analyses in a clear and interpretable format.