GB Exam 2

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/49

flashcard set

Earn XP

Description and Tags

UW Madison Gen Bus 307 - Spring 2024: Exam 2

Last updated 9:47 PM on 4/4/24
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

50 Terms

1
New cards

What is supervised learning

regroups methods to attempt to learn about distributions where the variables that can be split into categories

2
New cards

What are X variables

explanatory variables, predictors, regressors, independent

3
New cards

What are Y variables

outcomes, response variables, labels, dependent

4
New cards

what is a fitted regression equation

quantifies a linear relationship between two variables, y= intercept + slope * X

5
New cards

Log Liklihood, Equation? When is it used? Higher or lower?

higher is better, discrete y cases, P(Y test/ X test)

6
New cards

Mean Absolute Error, Equation? Lower or Higher?

lower, (1/n)E I Yi - Yhat I

7
New cards

Mean Absolute Percentage Error, Equation? Lower or Higher?

lower, (1/n)E I (Yi-Yhat)/(Yi) I

8
New cards

Root Mean Square Error, higher or lower, outliers, equation?

lower, greatly prenalized by outliers, sqr root((1/n)E(Yi-Yhat)²)

9
New cards

what is R²

how much accurately we can estimate the outcome variable with the explanatory variable, R²= 1-(SSErr-SSTot)

10
New cards

what is SSErr

sum of squared error from the regression, Represents the total amount of variation that we can’t explain with our regression, SSE trend line were plotted at the average = SST (Sum of Squares in Total)

11
New cards

how do you maximize R²

minimize the SSE loss

12
New cards

What is the range of R²

  • Closer to 1 = explain a lot of the variations in Y with our regression

  • Closer to 0 = can’t explain the variations in Y better with our regression

13
New cards

how do you interpret the slope

On average, an increase in study time by 1 hour is associated with an increase in grade by 5.2 points, everything else being equal.

14
New cards

how do you interpret the coefficient

On average, when a student spent 0 hours studying and skipped 0 classes, we expect their grade to be 57 points, everything else being equal.

15
New cards

what are p-value

how likely our data has no effect/relationship, low p-value = more confidence

16
New cards

What is OLS and what does it assume?

ordinary lease square regression, relationship between X & Y is linear, estimates are predictions are denoted with a hat, coefficient are obtained by minimizing the sum of squared residuals

17
New cards

what do you do when x=0 doesn’t make sense

could be outside range of data or unrealistic, or both then extrapolate

18
New cards

when are p-values significant?

Statistically significant at a confidence level if p-value < alpha

19
New cards

Generalized Linear Models

Extends the linear regression approach by allowing the distribution to be non-normal

20
New cards

for change of units when the variable is in log the change becomes ____? and if the varaible is standardized?

becomes % and standard deviations

21
New cards

how do you interpret R²

we can explain 24.5% of the variations in grades by looking at the variations in both the number of hours of study and in the number of class skipped

22
New cards

what is LINE?

linearity, independence, normality (errors), equal variance

23
New cards

how does GLM extend linear regression?

allows distribution to be non-normal, the mean Y to be function of a linear combination of Xs

24
New cards

what is the inverse of the mean function?

link function

25
New cards

the link identity what is it used for

linear relationships

26
New cards

what link log used for

when the mean needs to be positive

27
New cards

what link power used for

cured relationships

28
New cards

choosing the right distribution for continuous Y what is the normal distribution

a lot of averages, bell shaped, can be negative

29
New cards

choosing the right distribution for continuous Y what is the gamma distribution

a lot of times, potentiall skewed, always positive

30
New cards

choosing the right distribution for continuous Y what is the bernoulli distribution

probability of an event happening, binary, either 0 or 1

31
New cards

choosing the right distribution for continuous Y what is the poisson distribution

used for a lot of counts, positive integers

32
New cards

what is akaike information criterion

For cases with different number of variables across models, lower is better

33
New cards

what is overfitting?

the model is too flexible, great fit on training data, poor fit on new data

34
New cards

what is underfitting?

not flexible enough, poor fitting on training and new data

35
New cards

consequences of underfitting

bias, poor prediction performance, inability to capture the complexity of some patterns

36
New cards

what is regularization?

restricting the flexibility of a model

37
New cards

how do you regularize a dataset

estimate on a training set, adjust on a validation set, test prediction performance with a test set.

38
New cards

what do you do with too many variables?

use dimension reduction, solve overfitting issues, interpretation is still difficult, keep extra variables with variables selection

39
New cards

what is lasso?

Method where variable selection is performed through regularization. It shrinks the coefficients towards 0

40
New cards

what does 𝜆 control?

the strength of regularization, if 𝜆 is large the coefficient will be different from 0 𝜆 controls𝜆 controls

41
New cards

what are the drawback to lasso?

sensitive to x, issues with small datasets, scale sensitivity, loss of interpretability, bias

42
New cards

decision trees

create groups based on thresholds on X values

43
New cards

what are the advantages of decision trees?

don’t need to specify the relation between x and y, works for regression and classification, very easy to explain, mirrors decision making, graphs

44
New cards

what are the disadvantages of decision trees?

don’t have the same prediction accuracy as other methods

45
New cards
46
New cards
47
New cards
48
New cards
49
New cards
50
New cards