Sampling distributions and difference in means

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/37

flashcard set

Earn XP

Description and Tags

Sampling distributions and difference in means

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

38 Terms

1
New cards

Sampling Distributions

  • is the probability distribution of a statistic (e.g., mean, proportion) from multiple samples.

2
New cards

Probability

  • is a measure of the certainty associated with a future event or occurrence and is usually expressed as a number between 0 and 1 (or between 0 % and 100 %).

3
New cards

How to measure probability?

P (A) = Number of favourable cases/ Number of possible cases

P (A) = Number of times A occur/ Number of times A can occur

4
New cards

Example of probability measurement

Coin toss (A).

P (A) = 1/2 = 0,5

5
New cards

Characteristics of normal distribution of mean and variance

  1. Being bell-shaped, mesocturtic.

  2. Being symmetric

  3. Coincide mean, median and mode = central tendencies

  4. Having 95% of the indicators within the interval mean +- 2 standard deviations.

6
New cards

Central Limit Theorem (CLT)

  • The distribution of the sample mean approaches a normal distribution as sample size increases (n ≥ 30).

  • Mean of sampling distribution = population mean (µ).

  • Standard deviation of sampling distribution = standard error (SE).

7
New cards

Estimator

  • Is a function of the values in the sample.

  • Estimators can be calculated with the sample

8
New cards

Parameter

  • population

9
New cards

Normal distribution is indicated for data

  • follow a continuous scale: weight, height, age, cholesterol, blood pressure, uric acid

  • It has the advantage that under certain conditions

10
New cards

Law of large numbers

states that as a sample size increases, the sample mean (x-bar) gets closer to the population mean (µ)

11
New cards

There are two types of Law of Large Numbers

  • Weak Law

  • Strong law

12
New cards

Weak law

Sample mean converges in probability to the population mean.

13
New cards

Strong law

Sample mean converges almost surely to the population mean.

14
New cards

Implications of Law of Large Numbers

  • Larger samples provide more accurate estimates of population parameters.

15
New cards

Standard Error (SE) formula for mean

  • = Population standard deviation

  • = Sample size

16
New cards

Standard Error (SE) Formula for proportion

p = Population proportion

17
New cards

Confidence Intervals (CI)

  • gives a range of values where a population parameter likely falls.

18
New cards

Formula for CI of the Mean

19
New cards

Formula for CI of Proportions

20
New cards
21
New cards

Hypothesis Testing for Means

  1. State null (H₀) and alternative (H₁) hypotheses.

  2. Select significance level (∂., usually 0.05).

  3. Calculate test statistic (z or t-score).

  4. Compare with critical value OR find p-value.

  5. Decide: Reject H₀ if test statistic > critical value OR p-value <∂. (alfa)

22
New cards

  • t-Test Formula for Mean Comparison

23
New cards

Comparing Two Means

Used to compare means from two independent or paired samples.

24
New cards

ANOVA (Analysis of Variance)

  • Used to compare more than two means

25
New cards

Assumptions for t-Tests & ANOVA

  1. Normality – Data should be normally distributed.

  2. Independence – Samples should be independent.

  3. Homogeneity of Variance – Variances should be equal across groups (check with Levene’s test).

26
New cards

Concepts & formulas

27
New cards

PROBABILITY  - normal distribution

28
New cards

DESCRIPTIVE STATISTICS

29
New cards

Central Limit Theorem

  • states that sample distributions approach the normal distribution as the sample size increases.

  • This means that very small sample sizes can cause problems in approximating a normal distribution.

  • If X is a random variable of any distribution, with a large sample size (n > 30), it follows approximately a normal distribution.

  • This property is valid whatever the random variable

30
New cards

Analysis of normality

Parametric

Non-parametric

31
New cards

Parametric

A variable is considered parametric if it follows a normal distribution

32
New cards

Non-parametric

  • a variable which does not follow a normal distribution

  • This is especially important when defining the statistics in statistics.

  • Inferential

33
New cards

Shapiro-Wilk

  • If p > 0.05 it means that the distribution is normal, which means that it is considers a parametric variable

34
New cards

Example of Shapiro-Wilk

35
New cards

Distribution of averages

  • Set of "averages" of samples of a given size that have been drawn from a population.

  • It is the distribution (data set) of reference when we do a hypothesis test on a sample or more than one person.

36
New cards

Distribution model (normal distribution) Student's t-distribution

  • Symmetric with respect to the value 0, where the indices of central tendency coincide (unimodal).

  • Can take positive and negative values

  • Asymptotic about the abscissa axis

  • There is a whole family of T-distributions depending on their degrees of freedom (l.g.).

  • As the l.g. increases, the dis

  • parametric test to analyse it

37
New cards

Pearson's x2 distribution

  • Cannot take negative values

  • Positive asymmetric

  • There is a whole family of distributions depending on their degrees of freedom (l.g.).

  • Right-hand asymptotic only

  • As the l.g. increases, the distribution becomes closer to normal.

  • Non-parametric distribution

38
New cards

Snedecor F distribution

  • Cannot take negative values

  • Positive asymmetric

  • There is a whole family of distributions depending on their degrees of freedom (l.g.) in numerator and denominator.

  • Right-hand asymptotic only

  • As the l.g. of numerator and denominator increase, the distribution becomes closer to normal.

  • Non-parametric distribution