Calculating and Interpreting Z-Scores: Transformations and the Empirical Rule

Definition and Understanding of Raw Scores

  • Raw Score (xx): A raw score is a measurement exactly in the form it was collected. It represents the original data point before any transformations (like converting to a zz-score) have been applied.
  • Nature of Raw Scores: Once a score is transformed into something else, it is no longer considered a raw score.

Problem 5: Solving for Standard Deviation (ss)

  • Given Information:
    • Mean (μ\mu or xˉ\bar{x}) = 2020
    • Raw score (xx) = 19.519.5
    • zz-score (zz) = 0.25-0.25
  • Interpreting the zz-score (z=0.25z = -0.25):
    • The Sign: The negative sign indicates the score is below the average (mean).
    • The Magnitude: The score is 0.250.25 standard deviations below the mean.
    • Comparison: A zz-score of 3-3 is very far below the mean, 2-2 is pretty far, and 1-1 is somewhat far. A zz-score of 0.25-0.25 is not very far below the mean; it is relatively close to the center.
  • The Mathematical Formula:
    • z=xmeansz = \frac{x - \text{mean}}{s}
  • Calculation Steps:
    1. Substitute the known values: 0.25=19.520s-0.25 = \frac{19.5 - 20}{s}
    2. Isolate the variable ss by multiplying both sides by ss: 0.25×s=19.520-0.25 \times s = 19.5 - 20
    3. Simplify the numerator: 19.520=0.519.5 - 20 = -0.5
    4. The equation becomes: 0.25×s=0.5-0.25 \times s = -0.5
    5. Divide by 0.25-0.25 to solve for ss: s=0.50.25s = \frac{-0.5}{-0.25}
    6. Final result: s=2s = 2

Problem 6: Conceptualizing Deviations and Calculating zz

  • Given Information:
    • Standard deviation (ss) = 44
    • Akira scored 77 points above the mean.
  • Constraint: The problem does not provide the specific raw score (xx) or the specific mean (e.g., the mean could be 100100 and the score 107107, or the mean 5050 and the score 5757). It only provides the relationship between the two.
  • Defining the ‘Deviation’:
    • The numerator of the zz-score formula (xmeanx - \text{mean}) is formally called the deviation.
    • The deviation tells you if a score was above or below the mean and by how many real-world units (points, dollars, inches, etc.).
    • In this problem, the deviation is given as positive 77 (since it is 7 points above the mean).
  • Calculating the zz-score:
    • Conceptual Formula: z=deviationstandard deviationz = \frac{\text{deviation}}{\text{standard deviation}}
    • Plugging in values: z=74z = \frac{7}{4}
    • Final result: z=1.75z = 1.75
  • Interpretation: Akira is 1.751.75 standard deviations above the mean. Because this value is close to 22, she performed very well. While 77 points might seem small or large depending on the specific test, a zz-score of 1.751.75 consistently indicates a high relative standing.

Interpreting zz-scores: Magnitude and Direction

  • Generic Interpretation: A zz-score represents how many standard deviations a score is above or below the average.
  • Examples:
    • z=1.2z = 1.2: 1.21.2 standard deviations above average.
    • z=1.2z = -1.2: 1.21.2 standard deviations below average.
    • z=0.1z = 0.1: 0.10.1 standard deviations above average.
    • z=2.3z = -2.3: 2.32.3 standard deviations below average.
  • Who Scored Above Average? Those with positive zz-scores (1.21.2 and 0.10.1).
  • Furthest Above Average: The highest positive value (z=1.2z = 1.2).
  • Furthest Below Average: The negative value with the largest magnitude (z=2.3z = -2.3).
  • Most Average: The score closest to zero (z=0.1z = 0.1). The sign does not matter for being "average"; only the proximity to zero (absolute value) matters.
  • Least Average: The score farthest from zero (z=2.3z = -2.3). This represents the furthest distance from the mean in any direction.
  • Exam Preferences:
    • Between positive zz-scores: It is better to have a higher score (e.g., 1.21.2 is better than 0.10.1).
    • Between negative zz-scores: It is better to have a smaller magnitude (e.g., 1.2-1.2 is better than 2.3-2.3 because being slightly below average is preferable to being significantly below average).

The Utility of zz-scores

  1. Relative Standing: zz-scores specify where a score sits within its distribution. A raw score (e.g., a height of 67 inches67\text{ inches}) does not reveal if you are tall or short relative to a group, but a zz-score does.
  2. Cross-Distribution Comparison: zz-scores allow for the direct comparison of scores from entirely different distributions (different tests, teachers, or subjects).
Case Study: Heather vs. Jasmine
  • Scenario: Two students take exams from different professors.
  • Heather's Class: Mean (μ\mu) = 7070, Standard Deviation (σ\sigma) = 1010. Heather scored 9595.
    • Deviation: 9570=2595 - 70 = 25
    • zz-score: z=2510=2.5z = \frac{25}{10} = 2.5
  • Jasmine's Class: Mean (μ\mu) = 8080, Standard Deviation (σ\sigma) = 55. Jasmine scored 9595.
    • Deviation: 9580=1595 - 80 = 15
    • zz-score: z=155=3.0z = \frac{15}{5} = 3.0
  • Conclusion: Even though both earned a raw score of 9595, Jasmine performed better relative to her class (z=3.0z = 3.0) than Heather did relative to hers (z=2.5z = 2.5). Jasmine is three full standard deviations above her class mean, likely represent the very top of her class.

zz-score Transformations

  • Definition: Transformation is the process of converting every single raw score in a sample or population into its corresponding zz-score.
  • Three Constant Characteristics of a zz-score Distribution:
    1. The Mean is always 00: Since a raw score equal to the mean results in a zz-score of 00, the average of all transformed zz-scores will always be 00 (μz=0\mu_z = 0).
    2. The Standard Deviation is always 11: The process of standardizing the scores scales the distribution so that the standard deviation is exactly 11 (σz=1\sigma_z = 1).
    3. The Shape remains the same: Transformational scaling does not change the shape of the distribution. If the raw scores follow a normal distribution, the zz-scores will be normal. If the raw scores are skewed, the zz-score distribution remains skewed.
  • Common Misconception: Changing scores to zz-scores does NOT "fix" or "normalize" a distribution. It only standardizes it.

The Empirical Rule (68-95-99.7 Rule)

  • Condition: This rule applies only if you are working with a Normal Distribution (bell-shaped curve).
  • The Rule Breakdown:
    • 68% within 1 SD: Approximately 68%68\% of scores fall between z=1z = -1 and z=1z = 1.
    • 95% within 2 SD: Approximately 95%95\% of scores fall between z=2z = -2 and z=2z = 2.
    • 99.7% within 3 SD: Approximately 99.7%99.7\% of scores fall between z=3z = -3 and z=3z = 3.
  • Area Under the Curve: Conceptually similar to calculus, we look at the area between points on a number line as a percentage of the total area.
  • Outliers: Only 0.3%0.3\% of people fall beyond three standard deviations from the mean in either direction (100%99.7%=0.3%100\% - 99.7\% = 0.3\%). This means scoring higher than a z=3z = 3 or lower than a z=3z = -3 is extremely rare (less than half of one percent of the population).
  • Summary of Magnitude:
    • 00: Exactly at the mean.
    • ±1\pm 1: Solid middle majority (68% to 70%68\% \text{ to } 70\%
    • ±2\pm 2: Pretty far out (in the outer 5%5\%
    • ±3\pm 3: Extreme outlier.