Business Analytics Final Exam Summary

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/27

flashcard set

Earn XP

Description and Tags

Comprehensive vocabulary flashcards covering Business Analytics concepts including regression, classification, model evaluation, ensemble methods, and artificial intelligence.

Last updated 1:50 AM on 5/1/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

28 Terms

1
New cards

Regression

A predictive method used for numeric prediction where the dependent variable is a continuous variable.

2
New cards

Classification

A predictive method used for binary prediction where the dependent variable is a binary variable.

3
New cards

Holdout Method

A method to find the best model by splitting data into training and test sets, normally in a 70%30%70\%-30\% or 80%20%80\%-20\% ratio.

4
New cards

Cross Validation

A technique where data is split into equal-sized chunks (folds) to perform the holdout method multiple times and take the average of the results.

5
New cards

Dependent Variable

The outcome variable that the model is checking the relationship of and making predictions for.

6
New cards

Independent Variables

The variables used as predictors to estimate the value of or relationship with the dependent variable.

7
New cards

Simple Regression

A regression analysis using only one independent variable, represented by the formula y=β0+β1x+ϵy = \beta_0 + \beta_1x + \epsilon.

8
New cards

Multiple Regression

A regression analysis involving more than one independent variable, represented by the formula y=β0+β1x1+β2x2++ϵy = \beta_0 + \beta_1x_1 + \beta_2x_2 + \dots + \epsilon.

9
New cards

Interaction Term

A term added to a regression model to capture the joint effect of two or more variables, such as β3x1×x2\beta_3x_1 \times x_2.

10
New cards

R Square (R2R^2)

Also called the coefficient of determination, it represents the percentage of variation in the dependent variable explained by the independent variable.

11
New cards

Standard Error

A regression statistic similar to standard deviation that measures the error of a prediction.

12
New cards

Binary Variable

A variable that takes values of 11 or 00, often converted from a categorical variable and interpreted as a 'yes' or 'no' effect.

13
New cards

Extrapolation

A limitation of regression referring to the inability to accurately predict outcomes outside the range of variables used in fitting the model.

14
New cards

Logistic Regression

A regression technique for binary dependent variables that provides a probability as a prediction outcome; if the probability is above a threshold, the outcome is predicted as 11.

15
New cards

K-Nearest Neighbors (KNN)

A prediction method that uses the kk nearest data points by calculating distance to determine an outcome.

16
New cards

Accuracy

A performance metric calculated as the number of correct classifications divided by the number of total classifications.

17
New cards

Sensitivity

A classification metric measuring how many times the model predicted 11 among all true 1s1\text{s}, calculated as TPTP+FN\frac{TP}{TP + FN}.

18
New cards

Specificity

A classification metric measuring how many times the model predicted 00 among all true 0s0\text{s}, calculated as TNTN+FP\frac{TN}{TN + FP}.

19
New cards

Supervised Data Mining

The process of exploring patterns in data specifically with a target variable, such as in regression models.

20
New cards

Decision Tree

A model that splits data into groups (Regression Tree for numeric outcomes or Classification Tree for binary outcomes) using nodes and edges.

21
New cards

Overfitting Prevention

Setting parameters such as Max_depth or Min_samples_split to ensure a decision tree does not grow infinitely and fit noise in the data.

22
New cards

Ensemble Methods

Techniques that combine multiple prediction models so that their individual mistakes cancel out, working best when models are different and errors are independent.

23
New cards

Bagging (Bootstrap Aggregation)

An ensemble method where multiple identical models are created using different random samples of data, and their predictions are averaged.

24
New cards

Random Forest

An extension of bagging that creates multiple models using different random samples of data and a random set of independent variables.

25
New cards

Boosting

An ensemble method that sequentially creates models to fix the mistakes of previous ones, adapting to points with large errors.

26
New cards

Artificial Intelligence

A system capable of simulating human intelligence and thought processes, including pattern recognition, computer vision, and machine learning.

27
New cards

Machine Learning

A subset of AI focused on making predictions by finding patterns in examples without being given specific human instructions.

28
New cards

Deep Learning

A powerful machine learning technique using neural networks with multiple hidden layers, particularly useful for unstructured data like images and text.