Variability Notes

Variability can be defined as:
- A quantitative distance measure based on the differences between scores.
- Describes the spread of scores or distance of a score from the mean.
Purposes of measure of variability:
- Describe the distribution (clustered vs. spread out).
- Measures how well an individual score represents the distribution.

Variability helps to differentiate between distributions, even if they have similar means.
Example: Two treatments (A and B) might have different means, but the variability within each treatment group can influence the interpretation of the results.

Discrete Variable: Highest score – Lowest score.
- Example: 1, 2, 3, 4, 5, 15. Range = 15 – 1 = 14.
Continuous Variable: (Upper Real Limit of Highest score) – (Lower Real Limit of Lowest score).
- Example: 1, 2, 3, 4, 5, 15. Range = 15.5 – 0.5 = 15.

Standard deviation:
- Most common and important measure of variability.
- A measure of the standard or average distance from the mean.
- Describes whether the scores are clustered closely around the mean or are widely scattered.
- Calculation differs for population and samples.
Variance:
- A necessary companion concept to standard deviation.

Goal of inferential statistics:
- Draw general conclusions about the population based on limited information from a sample.
Samples differ from the population:
- Samples have less variability.
- Computing the variance and standard deviation in the same way as for a population would give a biased estimate of the population values.
- The bias in sample variability is consistent and predictable, which means it can be corrected.

To fix the bias in sample variability, we use degrees of freedom (df), which is the number of scores in the sample that are independent and free to vary.
Corrects for the fact that we used the sample mean.
When using your sample mean to calculate spread:
- You're not using the true average from the whole population, so the variation looks smaller than it really is.
- Dividing by (n – 1) instead of n corrects this and gives a better estimate of the real (population) spread.

For both populations and samples, it is easy to represent mean and standard deviation:
- A vertical line in the “center” denotes the location of the mean.
- A horizontal line to the right, left (or both) denotes the distance of one standard deviation.

Adding a constant to each score:
- The mean is changed.
- The standard deviation is unchanged.
Multiplying each score by a constant:
- The mean is changed.
- The standard deviation is also changed and is multiplied by that constant.