Lecture 3 - Degree of Freedom and Goodness of Fit

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/18

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

19 Terms

1
New cards

Degrees of Freedom:

number of independent pieces of information that are used to evaluate an estimate of a parameterā€¦ number of unconstrained variables

2
New cards

Degree of freedom is equal to:

Number of independent data points used in the estimate minus the number of parameters used in the estimation that have already been determined from the data set

3
New cards

Purpose of the goodness-of-fit

to test how well your data fits a certain model

4
New cards

Method for the goodness-of-fit

You calculate a Ļ‡2(chi^2) value from the difference between your observed data and the model

5
New cards

the chi-squared distribution is:

The theoretical framework that tells us how to evaluate the goodness-of-fit based on the calculated Ļ‡2 valueā€¦ well-known and well-behaved distribution

6
New cards

What does chi-squared distribution depend on?

depends on the degrees of freedom (k) which is calculated by N-d (number of parameters, 1)

7
New cards

What is the mean and standard deviation of the chi-squared distribution?

the mean value equal to k and standard deviation equal to sqrt(2k)

8
New cards

What is the probability distribution of chi-squared distribution with k degrees of freedom?

describes the probability distribution of the sum of squares of k independent standard Gaussian random variables

9
New cards

What is the normal distribution mean and standard deviation?

mean of 0 and standard deviation of 1

10
New cards

If the least-squares fit is good then the residualsā€¦

follow a gaussian distribution

11
New cards

least-squares fit:

trying to find the best fit model to your data points minimizing the difference between observed values (data) and predicted values (model)

12
New cards

Minimized Ļ‡Ā² value (Ļ‡Ā²_min)

refers to the value of Ļ‡Ā² function at the point where this sum of squared differences is smallest, indicating the best fitā€¦ value of Ļ‡Ā² when the residuals are minimized

the lower the value, the closer to your model the data is

13
New cards

P-value:

probability of obtaining a Ļ‡Ā² value greater than or equal to calculated Ļ‡Ā²ā€¦ computed using intergral of Ļ‡Ā² distribution

14
New cards

P-value indicates:

the probability of getting a Ļ‡Ā² value as high or higher than Ļ‡Ā²_min purely by chance if the model is correct

15
New cards

For a reasonable fit

  • Ļ‡Ā²_min should be close to the mean of Ļ‡Ā² distribution

    • Ļ‡Ā²_min will be approximately equal to k

  • for larger values of k, Ļ‡Ā² distribution becomes more symmetric

    • P(Ļ‡Ā²_min ā‰ˆ k; k)

16
New cards

Accept the model

Ļ‡Ā²_min is within Ā±2āˆš(2k) of the mean k, it suggests the model fits well, and there is no reason to reject it

17
New cards

Model is questionable

If P(Ļ‡Ā²_min; k) < 10ā»Ā³, meaning the fit is poor, or Ļ‡Ā²_min is more than 3 standard deviations (Ļƒ) away from the mean

18
New cards

Model is rejected

If P(Ļ‡Ā²_min; k) < 10ā»ā“, or Ļ‡Ā²_min is more than 4 standard deviations (Ļƒ) away from the meanā€¦ indicates poor fit which is why we reject the model

19
New cards

Testing goodness of fit summary

if P(Ļ‡Ā²_min; k) = 1 check the uncertainty calculations on your measurements and appropriateness of fit function

<p>if P(Ļ‡Ā²_min; k) = 1 check the uncertainty calculations on your measurements and appropriateness of fit function</p>