DATA MINING MIDTERM

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/39

There's no tags or description

Looks like no tags are added yet.

Last updated 10:28 PM on 10/14/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

40 Terms

New cards

Sum of squared residuals

We estimate the linear regression coefficients by minimizing the _____

New cards

RSE( Residual squared error)

The standard deviation of the error and accuracy of the model is measured using ____

New cards

P-Value

The _____ can be used to reject the null hypothesis if < 0.05

New cards

MSE

The ____ is reported in units of Y

New cards

K-Nearest Neighbor

The _____ approach is a non-parametric method that makes a prediction based on the closest training observation

New cards

Cross validation (either LOOCV OR K-Fold)

Performing _____ ensures that every observation is selected for the testing data at least once

New cards

Decision Boundary (Discriminant function)

Linear discriminant analysis uses a _____ to seperate observations into distinct classes

New cards

Prior Probability

The ______ measures the probability that a random chosen observation belongs to class

New cards

Posterior Probability

Refers to updated beliefs or probabilities after new data has been incorporated through Bayes' Theorem

New cards

Best Subset Selection

Performing ______ to sub-select predictors requires the user to check every possible combinations of predictors (2p).

New cards

Principal Component Analysis (PCA)

The ______ is unsupervised method used to transform the predictors (p) to a linear combination of the predictors (M, p ≥ M).

New cards

Knot

A _____ is a location where our coefficients and functions change.

New cards

Regression spline

The _______ is a combination of step functions and polynomial regression.

New cards

Random Forest

The Decision Tree based model can be improved upon by using bagging and sub-selecting predictors at each split, typically called _______.

New cards

Pure Nodes

The goal of splits in trees is to produce homogeneous child nodes, often called ______.

New cards

We can relax the additive assumption of linear regression by adding interaction terms.

True

New cards

Linear regression is applicable to datasets where p is larger than n.

False

New cards

Naive Bayes classifiers assumes that all predictors are independent within classes

True

New cards

Classifiers typically return a probability that a given observation belongs to class k.

True

New cards

It is expected that the training error rate is lower than the testing error rate.

True

New cards

A confusion matrix is used to assess accuracy for classification and regression models.

False

New cards

It is good practice to prevent data leakage by reusing the same sample in both training and testing.

False

New cards

Both Ridge Regression and Lasso use a shrinkage penalty to regularize the coefficients to reduce the impact of the predictor on the model.

True

New cards

Forward and Backward Stepwise Selection are guaranteed to find the best possible combinations of predictors.

False

New cards

Cross Validation is often the best method to find the most optimal parameters.

True

New cards

Basis Functions are fixed, known functions (bk(X)) that transform X to allow us to use statistical tools like Standard Errors and Coefficient estimates.

True

New cards

For splines, it is best practice to use fewer knots to increase flexibility in regions where it may be necessary.

False

New cards

Generalized Additive Models allow us to use more than one predictor in our model.

True

New cards

Ridge Regression

New cards

Smoothing Splines

New cards

Linear Regression

New cards

Lasso Regression

New cards

Linear Regression

New cards

Logistic Regression

New cards

Ridge Regression

New cards

Polynomial Regression

New cards

Step Functions

New cards

Lasso Regression

New cards

Regression Splines

Explore top notes

6.2: Public and Private Goods

Updated 663d ago

Note

Aeneid (29 to19 BCE)

Updated 1128d ago

Note

Chemistry - test revision (up to chapter 8)

Updated 642d ago

Note

Waves

Updated 267d ago

Note

Ch 7 - Market equilibrium, price mechanism, and market efficiency

Updated 1041d ago

Note

Chapter 1 - Life Begins with Cells Pt.2

Updated 1402d ago

Note

Medical Jurisprudence and Ethics

Updated 1101d ago

Note

Medicine Core Block - Anatomy - Skeletal System And Muscular System - UCLan

Updated 1100d ago

Note

6.2: Public and Private Goods

Updated 663d ago

Note

Aeneid (29 to19 BCE)

Updated 1128d ago

Note

Chemistry - test revision (up to chapter 8)

Updated 642d ago

Note

Waves

Updated 267d ago

Note

Ch 7 - Market equilibrium, price mechanism, and market efficiency

Updated 1041d ago

Note

Chapter 1 - Life Begins with Cells Pt.2

Updated 1402d ago

Note

Medical Jurisprudence and Ethics

Updated 1101d ago

Note

Medicine Core Block - Anatomy - Skeletal System And Muscular System - UCLan

Updated 1100d ago

Note

Explore top flashcards

HKK Prov v.8

Updated 1088d ago

Flashcards (41)

Fyzika-semestralni zkouska-veliciny

Updated 412d ago

Flashcards (22)

CMS II Ortho: E2

Updated 365d ago

Flashcards (246)

Crucible Acts 1 and 2

Updated 1110d ago

Flashcards (35)

commerce end of year exam

Updated 1199d ago

Flashcards (96)

Psychosocial Midterm

Updated 966d ago

Flashcards (123)

Unidad 3-La rutina diaria

Updated 60d ago

Flashcards (101)

Maternal Child + Labor and Delivery (Test 1)

Updated 1099d ago

Flashcards (176)

HKK Prov v.8

Updated 1088d ago

Flashcards (41)

Fyzika-semestralni zkouska-veliciny

Updated 412d ago

Flashcards (22)

CMS II Ortho: E2

Updated 365d ago

Flashcards (246)

Crucible Acts 1 and 2

Updated 1110d ago

Flashcards (35)

commerce end of year exam

Updated 1199d ago

Flashcards (96)

Psychosocial Midterm

Updated 966d ago

Flashcards (123)

Unidad 3-La rutina diaria

Updated 60d ago

Flashcards (101)

Maternal Child + Labor and Delivery (Test 1)

Updated 1099d ago

Flashcards (176)