7.1: Statistics and Parameters
Statistics and Parameters
- Statistic: a number that describes some characteristic of the sample
- Relevant symbols
- Mean: x̄
- Proportion: p̂
- Standard deviation: S
- Parameter: a number that describes some characteristic of the population
- Relevant symbols
- Mean: μ
- Proportion: p
- Standard deviation: σ
- Samples are taken to try to estimate the population (μ or p)
- Ultimate goal: Estimate parameters based on statistics
Distribution, Variability, and Bias
- Sampling distribution: the distribution of values taken by the statistic in all possible samples of the same size from the same population
- Eg. sample mean, proportion
- Sampling variability: how much results vary between samples
- Every time a sample is taken, the results will vary
- Measured using the spread of the random sample
- Based primarily on the size of the random sample
- As a general rule of thumb,
- Larger sample: less variability
- Smaller sample: more variability
- The spread does not depend on the size of the population, as long as it is at least 10x larger than the sample
- Biased statistic: a statistic which consistently overestimates or underestimates the parameter (mean or proportion)
- Unbiased statistic: a statistic in which the distribution of samples is centered around the true population’s parameter
Bias vs. variance
Means and Proportions
- Proportion problems generally involve categorical variables
- Binomial distributions will become approximately normal distributions if np≥10 and n(1-p)≥10
- n = sample size
- p = population proportion
- p̂ = sample proportion
- Normal distribution means the use of z-scores as a standard measure

- Statistics are unbiased if they are equal to the true parameters, so p̂ is an unbiased estimator of p
Verifying Conditions
- If an SRS is taken of size n from a large population with proportion p,
- Some conditions must be stated and checked
- Is the population more than 10x larger than the sample size?
- The 10% condition verifies that the standard deviation formula may be used
- Is np ≥ 10 and is n(1-p) ≥ 10?
- This verifies that a normal approximation may be used
- If these conditions are met, the mean of the sample proportions will equal the true population proportion
