Lecture 3 - Degree of Freedom and Goodness of Fit

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/18

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

19 Terms

1
New cards

Degrees of Freedom:

number of independent pieces of information that are used to evaluate an estimate of a parameter… number of unconstrained variables

2
New cards

Degree of freedom is equal to:

Number of independent data points used in the estimate minus the number of parameters used in the estimation that have already been determined from the data set

3
New cards

Purpose of the goodness-of-fit

to test how well your data fits a certain model

4
New cards

Method for the goodness-of-fit

You calculate a χ2(chi^2) value from the difference between your observed data and the model

5
New cards

the chi-squared distribution is:

The theoretical framework that tells us how to evaluate the goodness-of-fit based on the calculated χ2 value… well-known and well-behaved distribution

6
New cards

What does chi-squared distribution depend on?

depends on the degrees of freedom (k) which is calculated by N-d (number of parameters, 1)

7
New cards

What is the mean and standard deviation of the chi-squared distribution?

the mean value equal to k and standard deviation equal to sqrt(2k)

8
New cards

What is the probability distribution of chi-squared distribution with k degrees of freedom?

describes the probability distribution of the sum of squares of k independent standard Gaussian random variables

9
New cards

What is the normal distribution mean and standard deviation?

mean of 0 and standard deviation of 1

10
New cards

If the least-squares fit is good then the residuals…

follow a gaussian distribution

11
New cards

least-squares fit:

trying to find the best fit model to your data points minimizing the difference between observed values (data) and predicted values (model)

12
New cards

Minimized χ² value (χ²_min)

refers to the value of χ² function at the point where this sum of squared differences is smallest, indicating the best fit… value of χ² when the residuals are minimized

the lower the value, the closer to your model the data is

13
New cards

P-value:

probability of obtaining a χ² value greater than or equal to calculated χ²… computed using intergral of χ² distribution

14
New cards

P-value indicates:

the probability of getting a χ² value as high or higher than χ²_min purely by chance if the model is correct

15
New cards

For a reasonable fit

  • χ²_min should be close to the mean of χ² distribution

    • χ²_min will be approximately equal to k

  • for larger values of k, χ² distribution becomes more symmetric

    • P(χ²_min ≈ k; k)

16
New cards

Accept the model

χ²_min is within ±2√(2k) of the mean k, it suggests the model fits well, and there is no reason to reject it

17
New cards

Model is questionable

If P(χ²_min; k) < 10⁻³, meaning the fit is poor, or χ²_min is more than 3 standard deviations (σ) away from the mean

18
New cards

Model is rejected

If P(χ²_min; k) < 10⁻⁴, or χ²_min is more than 4 standard deviations (σ) away from the mean… indicates poor fit which is why we reject the model

19
New cards

Testing goodness of fit summary

if P(χ²_min; k) = 1 check the uncertainty calculations on your measurements and appropriateness of fit function

<p>if P(χ²_min; k) = 1 check the uncertainty calculations on your measurements and appropriateness of fit function</p>