Business Analytics: Final Exam

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/75

Earn XP

Description and Tags

A set of 100 vocabulary flashcards covering key concepts in linear regression and business analytics.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

76 Terms

New cards

Simple Linear Regression

A statistical method that models the relationship between a dependent variable and one independent variable.

New cards

Dependent Variable

The variable being predicted in a regression analysis.

New cards

Independent Variable

Variables used to predict the value of the dependent variable.

New cards

Multiple Linear Regression

A regression analysis that involves two or more independent variables.

New cards

Error Term

The part of the dependent variable that cannot be explained by the independent variables in a model.

New cards

Coefficient of Determination (r²)

A statistical measure that explains the proportion of variance in the dependent variable that can be explained by the independent variable(s).

New cards

Least Squares Method

A statistical technique used to determine the best-fitting line or model by minimizing the sum of the squares of the residuals.

New cards

Residuals

The differences between the observed values and the predicted values in a regression analysis.

New cards

Goodness of Fit

A measure of how well a statistical model fits the data.

New cards

Extrapolation

The act of estimating values outside the range of the data used to fit the model.

New cards

Dummy Variable

A numerical variable used in regression analysis to represent categorical data.

New cards

Interaction Term

A variable that represents the interaction between two or more independent variables in regression analysis.

New cards

Quadratic Regression

A form of regression analysis in which the relationship between the independent variable and the dependent variable is modeled as a second degree polynomial.

New cards

Piecewise Linear Regression

A regression method that models different linear relationships for different segments of data.

New cards

Stepwise Regression

An iterative method for selecting independent variables in a regression model by adding or subtracting predictors based on specified criteria.

New cards

Predictive Accuracy

The degree to which a regression model accurately predicts values and outcomes.

New cards

Statistical Independence

Condition in which the probability of one event occurring does not affect the probability of another event occurring.

New cards

Confidence Intervals

Range of values that is likely to contain the true parameter of the model, expressed at a certain confidence level.

New cards

Prediction Intervals

Range of values that predicts the value of a new observation based on the regression model.

New cards

Multicollinearity

A phenomenon in multiple regression where independent variables are highly correlated, making it difficult to determine the individual effect of each variable.

New cards

Outlier

An observation in a dataset that is distant from other observations, potentially influencing the results of the analysis.

New cards

ANOVA

Analysis of variance; a statistical method used to compare the means of three or more samples.

New cards

Scatter Plot

A graphical representation of the relationship between two quantitative variables.

New cards

Residual Plot

A plot that displays residuals on the vertical axis and fitted values on the horizontal axis, used to assess the fit of a model.

New cards

Statistical Software

Computer programs that perform statistical analysis.

New cards

Intercept

The predicted value of the dependent variable when all independent variables are equal to zero.

New cards

Slope

The change in the dependent variable associated with a one-unit increase in an independent variable.

New cards

Effect Size

A quantitative measure of the magnitude of the difference or relationship in a dataset.

New cards

Regularization

A technique used in regression that adds a penalty to the loss function to avoid overfitting.

New cards

F-Test

A statistical test used to determine if the variances of two populations are equal.

New cards

Standard Error

An estimate of the standard deviation of the sampling distribution of a statistic.

New cards

Variance Inflation Factor (VIF)

A measure used to detect the severity of multicollinearity in regression analysis.

New cards

Homoscedasticity

The assumption that the variance of errors is constant across all levels of the independent variable.

New cards

Normal Distribution

A probability distribution that is symmetric around the mean, describing a bell-shaped curve.

New cards

Sample Size

The number of observations or data points used in a study.

New cards

Hypothesis Testing

A statistical procedure that uses sample data to evaluate a hypothesis about a population parameter.

New cards

Parametric Tests

Statistical tests that assume a specific distribution for the population from which the sample is drawn.

New cards

Nonparametric Tests

Statistical tests that do not assume a specific distribution for the population.

New cards

Bootstrap Method

A resampling technique used to estimate statistics on a dataset by sampling with replacement.

New cards

Hierarchical Models

Statistical models that incorporate multiple levels of analysis.

New cards

Bootstrap Confidence Interval

A method for calculating confidence intervals using resampling techniques.

New cards

Model Specification

The process of developing a regression model based on the theoretical framework and the data.

New cards

R-Squared Adjusted

A modified version of r-squared that provides a more accurate measure of fit when comparing models with different numbers of predictors.

New cards

Data Transformation

The process of converting data from one format or structure into another.

New cards

Categorical Data

Data that can be divided into groups or categories and is often represented with dummy variables.

New cards

Extreme Value

Data points that lie far outside the overall distribution, potentially skewing results.

New cards

Influential Point

An observation that significantly affects the slope of a regression line.

New cards

Forecasting

The process of making predictions about future outcomes based on historical data.

New cards

Endogeneity

A situation in a statistical model where an explanatory variable is correlated with the error term.

New cards

Sampling Error

The error caused by observing a sample instead of the whole population.

New cards

Latent Variable

An unobservable variable that is inferred from observable variables.

New cards

Causal Inference

The process of determining whether a relationship between two variables is causal.

New cards

Propensity Score Matching

A statistical matching technique that attempts to estimate the effect of a treatment by accounting for covariates that predict receiving the treatment.

New cards

Cross-Validation

A statistical method for estimating the skill of a model using different subsets of the data.

New cards

Holdout Method

A method for validating a predictive model by partitioning data into training and test sets.

New cards

Residual Analysis

An examination of the residuals from a regression model to check for any violations of assumptions.

New cards

Statistical Significance

A determination that a relationship observed in data is not likely to be due to chance.

New cards

Statistical Power

The probability that a statistical test will correctly reject a false null hypothesis.

New cards

Bayesian Statistics

A statistical paradigm that uses Bayes' theorem to update the probability for a hypothesis as more evidence or information becomes available.

New cards

Model Robustness

The ability of a model to perform well across different conditions and assumptions.

New cards

Type I Error

The incorrect rejection of a true null hypothesis (false positive).

New cards

Type II Error

The failure to reject a false null hypothesis (false negative).

New cards

Null Hypothesis

A default hypothesis that there is no effect or no difference.

New cards

Alternative Hypothesis

The hypothesis that indicates the presence of an effect or a difference.

New cards

Power Analysis

A method to determine the sample size required to detect an effect of a given size.

New cards

Chi-Squared Test

A statistical test used to determine if there is a significant association between categorical variables.

New cards

Time Series Analysis

Statistical techniques used to analyze time-ordered data points.

New cards

Data Visualization

The graphical representation of information and data.

New cards

Bivariate Analysis

The analysis of two variables to determine the empirical relationship between them.

New cards

Multivariate Analysis

The analysis of more than two variables simultaneously.

New cards

Factor Analysis

A statistical method used to identify underlying relationships between variables.

New cards

Cluster Analysis

A technique used to group a set of objects in such a way that objects in the same group are more similar to each other than to those in other groups.

New cards

Regression Coefficients

The estimates that represent the relationship between each independent variable and the dependent variable.

New cards

Sensitivity Analysis

The study of how changes in the input of a model can affect its output.

New cards

Influence Function

A measure of the effect of a small change in the data on a statistical estimate.

New cards

Residual Standard Deviation

The standard deviation of the residuals, indicating the spread of the residuals around zero.