GW Blok 6 Seminar 3.1 Statistical inference for one or two samples

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards

What is the null hypothesis (H₀)?

A statement that there is no effect, no difference, or status quo. It always includes equality (e.g., =, ≤, ≥).

2
New cards

What is the alternative hypothesis (Ha)?

A statement that there is an effect or difference—what you are trying to find evidence for.

3
New cards

Are hypotheses about the sample or the population?

About the population parameters, not the sample statistics.

4
New cards

What does the significance level (α) represent?

  • The probability of rejecting the null hypothesis when it is actually true (Type I error). Commonly set at 0.05.

  • Het significantieniveau  is de grens waaronder de nulhypothese wordt verworpen. Het stelt de maximale kans op een type I-fout in, dit is het onterecht verwerpen van een ware H0.

  • Alfa is de critical values, bepaalt de grenzen van region of rejection.

5
New cards

Why can we never “prove” H₀ or Ha?

Because hypothesis testing works by assessing the likelihood of data assuming H₀ is true; it doesn’t prove, only provides evidence.

6
New cards

What is a type I error?

  • Rejecting the null hypothesis when it is actually true → false positive

  • Example (birth weight): Saying that the average birth weight ≠ 2000g, when in fact it is 2000g.

  • Probability: This error happens with probability α (e.g., 5% if significance level is 0.05).

  • Controlled by you — you set α before testing (commonly 0.05).

  • Interpretation: The maximum risk you're willing to take for claiming a false effect.

  • Thus, any conclusion based on statistical reasoning is uncertain, but with statistics it is possible to control for this uncertainty by means of the user-specified value α, which is called the ‘level of significance α’.

7
New cards

What is a type II error?

  • Not rejecting the null hypothesis when the alternative is true → false negative

  • Example: Saying that the average birth weight = 2000g, when in fact it isn’t.

  • Probability: Happens with probability β, which depends on factors like sample size and effect size.

  • Not directly controlled, but can be minimized with larger sample sizes or stronger effects.

  • Interpretation: Risk of missing a real effect.

8
New cards

What is the power of a test?

  • Probability of correctly rejecting the null hypothesis when the alternative is true.

  • Formula: Power = 1 – β

  • Goal: We usually aim for power ≥ 80% (β ≤ 0.2)

  • Increase power by:

    • Larger sample size (n)

    • Larger effect size (difference between μ and μ₀)

    • Lower data variability (standard deviation)

    • Higher α (but increases Type I error)

9
New cards

Summary tables of type of errors and power

knowt flashcard image
10
New cards

What is the relation between α and β?

  • Lower α → Higher β

  • Lower β (higher power) → May require larger α or bigger sample size

Think of it like a court case:

  • H₀ = Innocent

  • Hₐ = Guilty

  • Type I Error = convicting an innocent person → serious → keep α low

  • Type II Error = letting a guilty person go → power matters!

11
New cards

What is a p-value?

  • The p-value is the probability—assuming the null hypothesis is true—of obtaining a test statistic as extreme or more extreme than the one observed.

  • It reflects how compatible your sample data is with the null hypothesis.

  • If the null value (e.g. 2000) falls outside the (1 − α)% confidence interval:

    • ⇒ p-value ≤ α

    • Er is een significant verschil ⇒ Reject H₀

  • If it falls inside the CI:

    • ⇒ p-value > α

    • Er is geen significant verschil ⇒ Do not reject H₀

<ul><li><p>The p-value is the probability—assuming the null hypothesis is true—of obtaining a test statistic as extreme or more extreme than the one observed.</p></li><li><p>It reflects how compatible your sample data is with the null hypothesis.</p></li><li><p>If the null value (e.g. 2000) <strong>falls outside</strong> the (1 − α)% confidence interval:</p><ul><li><p>⇒ p-value ≤ α</p></li><li><p>Er is een significant verschil ⇒ <strong>Reject H₀</strong></p></li></ul></li></ul><ul><li><p>If it <strong>falls inside</strong> the CI:</p><ul><li><p>⇒ p-value &gt; α</p></li><li><p>Er is geen significant verschil ⇒ <strong>Do not reject H₀</strong></p></li></ul></li></ul><p></p>
12
New cards

How does the sample size affects the p-value

  • Large sample size: even small differences can lead to small p-values (i.e., statistically significant)

  • Small sample size: may not detect meaningful effects due to high variability

13
New cards

What is a one-sample problem?

  • Only one sample is used (no comparison between two groups)

  • We are comparing the sample against a fixed known/hypothesized value

14
New cards

What are the three methods of hypothesis testing?

  1. Confidence interval approach

  2. Test statistic (Z or T) approach

  3. P-value approach

15
New cards
<p>What is the confidence interval approach of hypothesis testing?</p>

What is the confidence interval approach of hypothesis testing?

  • If H0 is covered by the 95% confidence interval, it is likely that the value of 5.5 is the population value with 95% confidence

  • If H0 is NOT covered by the 95% confidence interval, it is unlikely that the value of 5.5 is the population value with 95% confidence

  • The region of acceptance: the interval between the upper and lower values of the CI

    • If sample average falls in the region of acceptance → significant difference → accept H0

  • The region of rejection: the interval outside the upper and lower values of the CI

    • If sample average falls in the region of rejection → no significant difference → reject H0

<ul><li><p>If H0 is covered by the 95% confidence interval, it is likely that the value of 5.5 is the population value with 95% confidence</p></li><li><p>If H0 is NOT covered by the 95% confidence interval, it is unlikely that the value of 5.5 is the population value with 95% confidence</p></li><li><p><strong>The region of acceptance:</strong> the interval between the upper and lower values of the CI</p><ul><li><p>If sample average falls in the region of acceptance → significant difference → accept H0</p></li></ul></li><li><p><strong>The region of rejection: </strong>the interval outside the upper and lower values of the CI</p><ul><li><p>If sample average falls in the region of rejection → no significant difference → reject H0</p></li></ul></li></ul><p></p>
16
New cards

What is the test statistic approach of hypothesis testing?

  • Use Z-statistic if (sigma2 is known) and T-statistic if (sigma2 is unknown)

    • Known if n > 30

    • Unknown if < 30

  • Region of acceptance: the test statistic lies between the critical values → accept H0

    • -1.96 < Z-statistic < 1.96

    • -2.201 < T-statistic < 2.201

      • Critical t-statistic is not standard → given on exam

  • Region of rejection: the test statistic lies outside the critical values → reject H0

    • Z-statistic ≤ -1.96 or Z-statistic ≥ 1.96

    • T-statistic ≤ -2.201 or T-statistic ≥ 2.201

      • Critical t-statistic is not standard → given on exam

<ul><li><p>Use Z-statistic if (sigma<sup>2</sup> is known) and T-statistic if (sigma<sup>2</sup> is unknown)</p><ul><li><p>Known if n &gt; 30</p></li><li><p>Unknown if &lt; 30</p></li></ul></li><li><p><strong>Region of acceptance: </strong>the test statistic lies between the critical values → accept H0</p><ul><li><p>-1.96 &lt; Z-statistic &lt; 1.96</p></li><li><p>-2.201 &lt; T-statistic &lt; 2.201</p><ul><li><p>Critical t-statistic is not standard → given on exam</p></li></ul></li></ul></li><li><p><strong>Region of rejection: </strong>the test statistic lies outside the critical values → reject H0</p><ul><li><p>Z-statistic ≤ -1.96 or Z-statistic ≥ 1.96</p></li><li><p>T-statistic ≤ -2.201 or T-statistic ≥ 2.201</p><ul><li><p>Critical t-statistic is not standard → given on exam</p><p></p></li></ul></li></ul></li></ul><p></p>
17
New cards

What is the p-value approach of hypothesis testing?

  • If the null value (e.g. 2000) falls outside the (1 − α)% confidence interval:

    • ⇒ p-value ≤ α

    • Er is een significant verschil ⇒ Reject H₀

  • If it falls inside the CI:

    • ⇒ p-value > α

    • Er is geen significant verschil ⇒ Do not reject H₀

18
New cards

Summary statistics for groups comparison

knowt flashcard image
19
New cards

What is the difference between a Z- and T-distribution?

<p></p>
20
New cards

What are the different formulas for z-score if the observation is in the population and in a sample? And when the sample mean is known and unknown?

  • Expected score for one individual on the exam (first two)

  • Expected score for a group on the exam (last two)

<ul><li><p>Expected score for one individual on the exam (first two)</p></li><li><p>Expected score for a group on the exam (last two)</p></li></ul><p></p>
21
New cards

What are the 6 steps of hypothesis testing?

  1. State the hypotheses

    • H0: μ variable 1 = μ variable 2 or μ variable 1 = specific value

    • Ha: μ variable 1 μ variable 2 or μ variable 1 specific value

  2. Choose a level of significance

    • α = 5% (1-α = 95% is the confidence level)

  3. Critical value

    • Critical t-values: +-(1-α /2), +- (n-1)

      • This value is given on the exam

    • Under the H0 and unknown σ, the t-score follows a t-distribution with n-1 degrees of freedom

    • Under the H0 and known s, the z-score follows a z-distribution

  4. Collect and summarize the data

    • n, x bar, and standard error of the estimate (x bar)

  5. Compute the test statistic

    • T-statistic or z-statistic

  6. Take a decision

    • Compare t-value/z-value with critical value

    • Compare p-value with significance level (α)

<ol><li><p>State the hypotheses</p><ul><li><p>H0: μ variable 1 = μ variable 2 or μ variable 1 = specific value</p></li><li><p>Ha: μ variable 1 <mark data-color="rgba(0, 0, 0, 0)" style="background-color: rgba(0, 0, 0, 0); color: inherit">≠</mark> μ variable 2 or μ variable 1 <mark data-color="rgba(0, 0, 0, 0)" style="background-color: rgba(0, 0, 0, 0); color: inherit">≠</mark> specific value</p></li></ul></li><li><p>Choose a level of significance</p><ul><li><p>α = 5% (1-α = 95% is the confidence level)</p></li></ul></li><li><p>Critical value</p><ul><li><p>Critical t-values: +-(1-α /2), +- (n-1)</p><ul><li><p>This value is given on the exam</p></li></ul></li><li><p>Under the H0 and unknown σ, the t-score follows a t-distribution with n-1 degrees of freedom</p></li><li><p>Under the H0 and known s, the z-score follows a z-distribution</p></li></ul></li><li><p>Collect and summarize the data</p><ul><li><p>n, x bar, and standard error of the estimate (x bar)</p></li></ul></li><li><p>Compute the test statistic</p><ul><li><p>T-statistic or z-statistic</p></li></ul></li><li><p>Take a decision</p><ul><li><p>Compare t-value/z-value with critical value</p></li><li><p>Compare p-value with significance level (α)</p></li></ul></li></ol><p></p>
22
New cards
<p>Decision making visualized</p>

Decision making visualized

knowt flashcard image
23
New cards

What is the relation between the p-value and hypothesis testing?

  • p-value: evidence of how favorable the null hypothesis is

    • (very) large p-values favor the null hypothesis

    • (very) small p-values favor the alternative hypothesis

  • IMPORTANT: p-value is NOT the probability that the null hypothesis is true

  • In SPSS specify what value of μ we want to test with the one sample test

24
New cards

What is the relation between confidence interval and hypothesis testing?

  • In SPSS specify what value of μ we want to test with the one sample test

<ul><li><p>In SPSS specify what value of μ we want to test with the one sample test</p></li></ul><p></p>