Models, Errors and the Normal Distribution

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/16

There's no tags or description

Looks like no tags are added yet.

Last updated 7:06 PM on 12/19/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

17 Terms

New cards

Formula

Error =

Error = Data - Model

𝑒𝑟𝑟𝑜𝑟_𝑖 = 𝑦_𝑖 − ŷ_𝑖

New cards

Mode as measuring error

ŷ_𝑖

The most common value
A very simplified measure

New cards

Mean as measuring error

Summing all data points then dividing by number of data points - Ȳ represents the mean.
Looks at the average value.
Poor measure of error - allows values to cancel each other out.

<ul><li><p>Summing all data points then dividing by number of data points - Ȳ represents the mean.</p></li><li><p>Looks at the average value.</p></li><li><p>Poor measure of error - allows values to cancel each other out.</p></li></ul><p></p>

New cards

Squared error as measuring error

𝑒𝑟𝑟𝑜𝑟_𝑖 = (𝑦_𝑖 − ŷ_𝑖)²

Can only be positive because you're squaring it.

New cards

Sum Squared Error (SSE) as measuring error

Sums all of the data points

New cards

Mean Squared Error (MSE) as measuring error

Often seen being used in model fitting literature.
Sum of all the squares divided by number of data points.
Problem with units being squared.

<ul><li><p>Often seen being used in model fitting literature.</p></li><li><p>Sum of all the squares divided by number of data points.</p></li><li><p>Problem with units being squared.</p></li></ul><p></p>

New cards

Root Mean Squared Error (RMSE) as measuring error

Unit generated makes the most sense in measuring the model error.

New cards

Model A: Judge this model

Equation: ŷ_𝑖 = B̂_𝑖 ∗ 𝑎𝑔𝑒_𝑖
Takes into account that there actually are differences across age.
Model for each data point now depends on age of the specific child, multiplied by a parameter (the slope of the blue line).
Problem that this model goes through 0 - no child born at 0cm.

<ul><li><p><strong><span>Equation</span></strong><span>: ŷ</span><sub>𝑖</sub> = <span>B̂</span><sub>𝑖</sub> ∗ 𝑎𝑔𝑒<sub>𝑖 </sub></p></li><li><p>Takes into account that there actually are differences across age.</p></li><li><p>Model for each data point now depends on age of the specific child, multiplied by a parameter (the slope of the blue line).</p></li><li><p>Problem that this model goes through 0 - no child born at 0cm.</p></li></ul><p></p>

New cards

Model B: Judge this model

Equation: ŷ_𝑖 = B̂₀+ B̂₁ ∗ 𝑎𝑔𝑒_𝑖
Adds a constant - good job at explaining data, but could be split into other factors.

<ul><li><p><strong><span>Equation</span></strong><span>: ŷ</span><sub>𝑖</sub> = <span>B̂</span><sub>0 </sub>+ <span>B̂</span><sub>1</sub> ∗ 𝑎𝑔𝑒<sub>𝑖 </sub></p></li><li><p>Adds a constant - good job at explaining data, but could be split into other factors.</p></li></ul><p></p>

New cards

Model C: Judge this model

Equation: ŷ_𝑖 = B̂₀+ B̂₁ ∗ 𝑎𝑔𝑒_𝑖
Equation: ŷ_j = B̂₂+ B̂₃ ∗ 𝑎𝑔𝑒_j
Two separate models.
Estimating slope and adding a constant for both male and female.

<ul><li><p><strong><span>Equation</span></strong><span>: ŷ</span><sub>𝑖</sub> = <span>B̂</span><sub>0 </sub>+ <span>B̂</span><sub>1</sub> ∗ 𝑎𝑔𝑒<sub>𝑖 </sub></p></li><li><p><strong><span>Equation</span></strong><span>: ŷ</span><sub>j</sub> = <span>B̂</span><sub>2 </sub>+ <span>B̂</span><sub>3</sub> ∗ 𝑎𝑔𝑒<sub>j</sub></p></li><li><p>Two separate models.</p></li><li><p>Estimating slope and adding a constant for both male and female.</p></li></ul><p></p>

New cards

Model D: Judge this model

ŷ_𝑖 = B̂₀+ B̂₁ ∗ 𝑎𝑔𝑒_𝑖
ŷ_j = B̂₂+ B̂₃ ∗ 𝑎𝑔𝑒_j
ŷ_𝑖 = Ȳ

<ul><li><p><span>ŷ</span><sub>𝑖</sub> = <span>B̂</span><sub>0 </sub>+ <span>B̂</span><sub>1</sub> ∗ 𝑎𝑔𝑒<sub>𝑖 </sub></p></li><li><p><span>ŷ</span><sub>j</sub> = <span>B̂</span><sub>2 </sub>+ <span>B̂</span><sub>3</sub> ∗ 𝑎𝑔𝑒<sub>j</sub></p></li><li><p><span>ŷ</span><sub>𝑖</sub> = <span>Ȳ</span></p></li></ul><p></p>

New cards

Equation for a normal distribution

𝑁(𝜇, 𝜎²)

𝜇 - specifies where centre of the distribution is placed.

𝜎² - how wide the distribution is.

New cards

Likelihood equation

𝑃(𝑥|𝜇, 𝜎²)
- Probability of obtaining a certain value - $x$ - given the two parameters in the model.
- Circled point closer to the middle - highly likely, compared to lower circled point.
- Can take all data points and multiply probabilities together to get likelihood of the data set - 𝑃(𝑥₁|𝜇, 𝜎²) * 𝑃(𝑥₂|𝜇, 𝜎²) * 𝑃(𝑥₃|𝜇, 𝜎²)