9.+Inferring+Population+Means

Probability Distribution

Discrete and Continuous Outcomes

Discrete Outcomes: Outcomes that can take specific values, often counts (e.g., number of students).
Continuous Outcomes: Outcomes that can take any value within a given range (e.g., height, weight).

Probabilities for Random Variables

Discrete Random Variables: Use probability mass functions (PMF).
Continuous Random Variables: Use probability density functions (PDF).

Normal Model

Defined by mean (µ) and standard deviation (σ).
Standard Normal Model: A normal distribution with mean of 0 and standard deviation of 1.

R Functions

To find probabilities from normal distribution: pnorm().
To find percentiles: qnorm().

Characteristics of Binomial Models

Defined by number of trials (n) and probability of success (p).
Model Symbols: P(X = k) indicates the probability of k successes in n trials.
Use pbinom() in R to find probabilities.

Mean and Standard Deviation of Binomial Distribution

Mean: µ = np
Standard Deviation: σ = √(np(1 - p))

Population Parameter vs. Sample Statistics

Population Parameter: A value describing a characteristic of a population (e.g., population mean µ).
Sample Statistics: A value describing a characteristic of a sample (e.g., sample mean x̄).
Common symbols: Population mean (µ), Sample mean (x̄), Population proportion (P), Sample proportion (p̂).

Central Limit Theorem (CLT) for Sample Proportions

Conditions: Random sampling, independent samples, large sample size (n * p ≥ 10 and n * (1 - p) ≥ 10).
Distribution: As sample size increases, the distribution of sample proportions approaches a normal distribution.

Confidence Interval of Population Proportions

Interpretation: A range of values that likely contains the true population proportion, with a specified confidence level.

Hypothesis Testing for Population Proportions

Null Hypothesis (H0): Assumes no effect or no difference.
Alternative Hypothesis (H1): Assumes there is an effect or a difference.
Types: One-sided and two-sided tests.
Significance Level (α): The probability of rejecting the null hypothesis when it is true.
Interpreting Test Statistics: Compare test statistic to critical value or use p-value approach.

R Functions for Confidence Intervals and Hypothesis Testing

For calculating confidence intervals: prop.test() for proportions.
For hypothesis testing, also use: prop.test().

Type I Error and Type II Error

Type I Error: Rejecting H0 when it is true.
Type II Error: Failing to reject H0 when H1 is true.

Comparing Proportions from Two Populations

Use two-sample tests for proportions to compare population differences.

Central Limit Theorem for Sample Means

Conditions: Random sampling, independence, large sample size (n ≥ 30).
Distribution: Approaches normal as sample size increases.

t-Statistic and t-Test

Used for hypothesis testing for means when population standard deviation is unknown.
Shapiro-Wilk Test: Assess normality (H0: data is normally distributed).
Variance Test: Test for equality of variances (H0: variances are equal).

Independent vs Paired Samples

Independent Samples: Samples not related.
Paired Samples: Measurements taken from the same group at different times (e.g., pre-test and post-test).

R Functions for t-Test

For one-sample t-test: t.test(x = , mu = ).
For two-sample t-test: t.test(x = , y = ).

Interpretation of t-Test Results

Compare p-value to significance level to draw conclusions about H0.