Assumptions for Inference

Proportions(z)

one sample

Individuals are independent → SRS and <10% of population
Sample is sufficiently large → Successes and failures ≥ 10

two sample

Samples are independent → (How was the data collected)
Data in each sample are independent → Both SRS and <10% of populations
Both samples are sufficiently large → Success and failures ≥ 10 for both

Means(t)

One sample
(df = n-1)

Individuals are independent → SRS and <10% of population
Population has a Normal model → Histogram is unimodal and symmetric*

Matched pairs
(df = n-1)

Data are matched → (think about design)
Individuals are independent → SRS and <10% of population
Population of differences is Normal → Histograms of differences is unimodal and symmetric

2 independent samples
(df from calculator)

Samples are independent → (think about design)
Data in each sample are independent → SRSs and < 10% of populations
Both populations are Normal → Both histograms are unimodal and symmetric*

Distributions (Chi-square)

Goodness of fit
(df = #cells - 1)

Data are counts → (yes/no)
Data in sample are independent → SRS and <10% of population
Sample is sufficiently large → All expected counts ≥ 5

Homogeneity
(df = (r-1)(c-1))

Data are counts → (yes/no)
Data in groups are independent → SRS and <10% of population
Groups are sufficiently large → All expected counts ≥ 5

Independence
(df = (r-1)(c-1))

Data are counts → (yes/no)
Data are independent → SRS and <10% of population
Sample is sufficiently large → All expected counts ≥ 5

Regression
(df = n-2)

Association
ß = 0?

Form of relationship is linear → Scatterplot look approx. linear
Errors are independent → No apparent pattern in residuals plot
Variability of errors is constant → Residuals plot has consistent spread
Errors have a Normal model → Histogram of residuals is approximately unimodal and symmetric