GB Exam 2

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/49

Earn XP

Description and Tags

UW Madison Gen Bus 307 - Spring 2024: Exam 2

Business

A-Level Statistics

Last updated 9:47 PM on 4/4/24

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

50 Terms

New cards

What is supervised learning

regroups methods to attempt to learn about distributions where the variables that can be split into categories

New cards

What are X variables

explanatory variables, predictors, regressors, independent

New cards

What are Y variables

outcomes, response variables, labels, dependent

New cards

what is a fitted regression equation

quantifies a linear relationship between two variables, y= intercept + slope * X

New cards

Log Liklihood, Equation? When is it used? Higher or lower?

higher is better, discrete y cases, P(Y test/ X test)

New cards

Mean Absolute Error, Equation? Lower or Higher?

lower, (1/n)E I Yi - Yhat I

New cards

Mean Absolute Percentage Error, Equation? Lower or Higher?

lower, (1/n)E I (Yi-Yhat)/(Yi) I

New cards

Root Mean Square Error, higher or lower, outliers, equation?

lower, greatly prenalized by outliers, sqr root((1/n)E(Yi-Yhat)²)

New cards

what is R²

how much accurately we can estimate the outcome variable with the explanatory variable, R²= 1-(SSErr-SSTot)

New cards

what is SSErr

sum of squared error from the regression, Represents the total amount of variation that we can’t explain with our regression, SSE trend line were plotted at the average = SST (Sum of Squares in Total)

New cards

how do you maximize R²

minimize the SSE loss

New cards

What is the range of R²

Closer to 1 = explain a lot of the variations in Y with our regression
Closer to 0 = can’t explain the variations in Y better with our regression

New cards

how do you interpret the slope

On average, an increase in study time by 1 hour is associated with an increase in grade by 5.2 points, everything else being equal.

New cards

how do you interpret the coefficient

On average, when a student spent 0 hours studying and skipped 0 classes, we expect their grade to be 57 points, everything else being equal.

New cards

what are p-value

how likely our data has no effect/relationship, low p-value = more confidence

New cards

What is OLS and what does it assume?

ordinary lease square regression, relationship between X & Y is linear, estimates are predictions are denoted with a hat, coefficient are obtained by minimizing the sum of squared residuals

New cards

what do you do when x=0 doesn’t make sense

could be outside range of data or unrealistic, or both then extrapolate

New cards

when are p-values significant?

Statistically significant at a confidence level if p-value < alpha

New cards

Generalized Linear Models

Extends the linear regression approach by allowing the distribution to be non-normal

New cards

for change of units when the variable is in log the change becomes ____? and if the varaible is standardized?

becomes % and standard deviations

New cards

how do you interpret R²

we can explain 24.5% of the variations in grades by looking at the variations in both the number of hours of study and in the number of class skipped

New cards

what is LINE?

linearity, independence, normality (errors), equal variance

New cards

how does GLM extend linear regression?

allows distribution to be non-normal, the mean Y to be function of a linear combination of Xs

New cards

what is the inverse of the mean function?

link function

New cards

the link identity what is it used for

linear relationships

New cards

what link log used for

when the mean needs to be positive

New cards

what link power used for

cured relationships

New cards

choosing the right distribution for continuous Y what is the normal distribution

a lot of averages, bell shaped, can be negative

New cards

choosing the right distribution for continuous Y what is the gamma distribution

a lot of times, potentiall skewed, always positive

New cards

choosing the right distribution for continuous Y what is the bernoulli distribution

probability of an event happening, binary, either 0 or 1

New cards

choosing the right distribution for continuous Y what is the poisson distribution

used for a lot of counts, positive integers

New cards

what is akaike information criterion

For cases with different number of variables across models, lower is better

New cards

what is overfitting?

the model is too flexible, great fit on training data, poor fit on new data

New cards

what is underfitting?

not flexible enough, poor fitting on training and new data

New cards

consequences of underfitting

bias, poor prediction performance, inability to capture the complexity of some patterns

New cards

what is regularization?

restricting the flexibility of a model

New cards

how do you regularize a dataset

estimate on a training set, adjust on a validation set, test prediction performance with a test set.

New cards

what do you do with too many variables?

use dimension reduction, solve overfitting issues, interpretation is still difficult, keep extra variables with variables selection

New cards

what is lasso?

Method where variable selection is performed through regularization. It shrinks the coefficients towards 0

New cards

what does 𝜆 control?

the strength of regularization, if 𝜆 is large the coefficient will be different from 0 𝜆 controls𝜆 controls

New cards

what are the drawback to lasso?

sensitive to x, issues with small datasets, scale sensitivity, loss of interpretability, bias

New cards

decision trees

create groups based on thresholds on X values

New cards

what are the advantages of decision trees?

don’t need to specify the relation between x and y, works for regression and classification, very easy to explain, mirrors decision making, graphs

New cards

what are the disadvantages of decision trees?

don’t have the same prediction accuracy as other methods

New cards

Explore top notes

2.5 Organizational (corporate) culture

Updated 1290d ago

Note

Conquest and Its Impact (IB)

Updated 369d ago

Note

french 1

Updated 1257d ago

Note

Movement in and out of cells (2.1-2.2)

Updated 1310d ago

Note

Chapter 5: Nucleic Acid Extraction

Updated 1095d ago

Note

Chapter Fourteen: Schizophrenia and Related Disorders

Updated 1093d ago

Note

Biology 120 Notes (Part 9) Phospholipids, Plasma Membrane, Diffusion, and Osmosis

Updated 1245d ago

Note

Disabilities

Updated 337d ago

Note

2.5 Organizational (corporate) culture

Updated 1290d ago

Note

Conquest and Its Impact (IB)

Updated 369d ago

Note

french 1

Updated 1257d ago

Note

Movement in and out of cells (2.1-2.2)

Updated 1310d ago

Note

Chapter 5: Nucleic Acid Extraction

Updated 1095d ago

Note

Chapter Fourteen: Schizophrenia and Related Disorders

Updated 1093d ago

Note

Biology 120 Notes (Part 9) Phospholipids, Plasma Membrane, Diffusion, and Osmosis

Updated 1245d ago

Note

Disabilities

Updated 337d ago

Note

Explore top flashcards

Bio Unit 4-5

Updated 819d ago

Flashcards (81)

Spanish School Subjects

Updated 1193d ago

Flashcards (37)

abeka history 10 section 5.1

Updated 890d ago

Flashcards (23)

Digestive system

Updated 850d ago

Flashcards (71)

Books and films in our life

Flashcards (119)

Flashcards (79)

Flashcards (41)

Biology Unit 10 - Ecology and Climate Change

Updated 645d ago

Flashcards (65)

Bio Unit 4-5

Updated 819d ago

Flashcards (81)

Spanish School Subjects

Updated 1193d ago

Flashcards (37)

abeka history 10 section 5.1

Updated 890d ago

Flashcards (23)

Digestive system

Updated 850d ago

Flashcards (71)

Books and films in our life

Flashcards (119)

Flashcards (79)

Flashcards (41)

Biology Unit 10 - Ecology and Climate Change

Updated 645d ago

Flashcards (65)