DM Exam 1

0.0(0)
studied byStudied by 3 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/25

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:49 PM on 3/12/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

26 Terms

1
New cards

Error(E)=

y_i-(mx_i+b)

2
New cards

Standard Error(SE)=

(\frac{var(E)}{var(x)})^{\frac{1}{2}}

3
New cards

95\% Confidence Interval=

m \pm t(1-\frac{\alpha}{2},n-2)\cdot SE

4
New cards

We reject the null hypothesis when…

…the p-value is less than or equal to 0.05 or the confidence interval doesn’t include 0.

5
New cards
<p>This target has a…</p>

This target has a…

…low bias, high variance.

6
New cards
<p>This target has a…</p>

This target has a…

…high bias, high variance.

7
New cards
<p>This target has a…</p>

This target has a…

…low bias, low variance.

8
New cards
<p>This target has a…</p>

This target has a…

…high bias, low variance.

9
New cards

Empirical Distribution Steps:

Take a random sample of length l with replacement.
Perform function.
Append to sample list.
Repeat n times.

10
New cards

Exploratory Data Analysis Steps:

Question
Investigate
Interpret
Ask more questions

11
New cards

Average Error=

\frac{1}{n}\sum E

12
New cards

Variance=

\frac{1}{n}\sum (x_i - avg(x))²

13
New cards

Mean Absolute Error (MAE)=

\frac{1}{n}\sum |E|

14
New cards

\frac{d}{dx}|x|=

\frac{x}{|x|}

15
New cards

Gradient Descent=

\theta_{t+1}=\theta_t - \alpha \cdot \frac{d}{d\theta} L(y,\hat{y})

16
New cards

The difference between Gradient Descent and OLS is…

…Gradient Descent sets m and b multiple times and uses the derivative of the gradient, while OLS relies heavily on matrix operations and sets m and b once.

17
New cards

MAE introduces bias by…

…treating all errors as positive, meaning the total error only gets added to, not subtracted from.

18
New cards

Stochastic Gradient Descent differs from Gradient Descent by…

…only using a small, random subset of the population each iteration.

19
New cards

Correlation

The measure of how much a variable will change given the change of another variable.

20
New cards

Error

The difference between the real value and the predicted value.

21
New cards

A model with high correlation and a high error…

…has variables that are very related to each other, but the model itself is an inaccurate predictor.

22
New cards

A model with a low correlation and a low error…

…is an accurate predictor, but uses variables that are unrelated to each other.

23
New cards

P(x|y)=

\frac{P(y|x)P(x)}{P(y)}

24
New cards

Given P(x|y) and P(y), P(x)=

P(x|y)P(y)

25
New cards

When given a large dataset, Gradient Descent can be faster than OLS because…

…OLS relies heavily on expensive matrix operations, while Gradient Descent relies on looping and the derivative of the loss function.

26
New cards

OLS achieves the line of best fit by…

…taking the derivative of the loss function with respect to each parameter, setting it to 0, and solving for the parameter.

Explore top flashcards