Assumptions for Inference
Proportions(z)
one sample
Individuals are independent → SRS and <10% of population
Sample is sufficiently large → Successes and failures ≥ 10
two sample
Samples are independent → (How was the data collected)
Data in each sample are independent → Both SRS and <10% of populations
Both samples are sufficiently large → Success and failures ≥ 10 for both
Means(t)
One sample
(df = n-1)
Individuals are independent → SRS and <10% of population
Population has a Normal model → Histogram is unimodal and symmetric*
Matched pairs
(df = n-1)
Data are matched → (think about design)
Individuals are independent → SRS and <10% of population
Population of differences is Normal → Histograms of differences is unimodal and symmetric
2 independent samples
(df from calculator)
Samples are independent → (think about design)
Data in each sample are independent → SRSs and < 10% of populations
Both populations are Normal → Both histograms are unimodal and symmetric*
Distributions (Chi-square)
Goodness of fit
(df = #cells - 1)
Data are counts → (yes/no)
Data in sample are independent → SRS and <10% of population
Sample is sufficiently large → All expected counts ≥ 5
Homogeneity
(df = (r-1)(c-1))
Data are counts → (yes/no)
Data in groups are independent → SRS and <10% of population
Groups are sufficiently large → All expected counts ≥ 5
Independence
(df = (r-1)(c-1))
Data are counts → (yes/no)
Data are independent → SRS and <10% of population
Sample is sufficiently large → All expected counts ≥ 5
Regression
(df = n-2)
Association
ß = 0?
Form of relationship is linear → Scatterplot look approx. linear
Errors are independent → No apparent pattern in residuals plot
Variability of errors is constant → Residuals plot has consistent spread
Errors have a Normal model → Histogram of residuals is approximately unimodal and symmetric