Notes on Transformations, Z-Scores, and Percentiles
- Adding/Subtracting a constant 'a':
- Measures of center/location (mean, median, percentiles) change by +a or −a.
- Measures of variability (range, IQR, standard deviation) do not change.
- Shape of the distribution does not change. Shifts distribution left/right.
- Multiplying/Dividing by a positive constant 'b':
- Measures of center/location and variability are multiplied/divided by b.
- Shape of the distribution does not change. Scales the distribution.
- Combined Transformations (linear transformation Y=bX+a):
- Mean: μY=bμX+a
- Standard Deviation: σY=∣b∣σX
- Shape remains unchanged.
Standardizing Distributions: Z-scores
- Definition: A z-score measures how many standard deviations a data value X is from the mean μ.
- Formula: Z=σX−μ
- Effects on Distribution:
- Shape: Unchanged.
- Center: Mean of z-scores is 0.
- Variability: Standard deviation of z-scores is 1.
- Purpose: Allows comparison of values from different distributions on a common scale.
Percentiles and Cumulative Relative Frequency
- p-th percentile: The value below which p percent of the observations fall.
- Formula: P(X≤xp)=100p
- Cumulative Relative Frequency Graphs: Used to estimate percentiles for individual values and vice versa.
Check Your Understanding (Knoebels Amusement Park Example)
- Initial: Mean μ=1.705 dollars, Std Dev σ=0.447 dollars.
- Convert to Cents (Multiply by 100):
- Shape: Unchanged.
- Mean: 1.705×100=170.5 cents.
- Std Dev: 0.447×100=44.7 cents.
- Increase by 25 Cents (Add 25):
- Shape: Unchanged.
- Mean: 170.5+25=195.5 cents.
- Std Dev: 44.7 cents (unchanged by addition).
- Convert to Z-scores:
- Shape: Unchanged.
- Mean: 0.
- Std Dev: 1.
- The new mean (195.5 cents) and SD (44.7 cents) would be used for the z-score calculation.Z′=44.7X′−195.5
- Transformations:
- Add/Subtract 'a': Location measures change by 'a'. Variability (range, IQR, SD) and shape unchanged.
- Multiply/Divide by 'b': Location measures and variability scale by 'b'. Shape unchanged.
- Linear Transformation Y=bX+a:
- Mean: μY=bμX+a
- Std Dev: σY=∣b∣σX
- Z-scores: Z=σX−μ
- Standardized distribution has mean 0, standard deviation 1, and original shape.
- Percentiles: The p-th percentile is the value xp such that P(X≤xp)=100p.
- Cumulative relative frequency graphs help visualize