Analyzing SAT Score Percentiles Using the Normal Model and the Empirical Rule

General Parameters and Assumptions

  • Standard SAT Statistics:     * Mean (μ\mu): The overall mean for all test takers is approximately 500500.     * Standard Deviation (σ\sigma): The overall standard deviation is approximately 100100.     * Variability: While the mean and standard deviation may differ from these target values in any specific year by small amounts, these figures serve as a reliable overall approximation for analysis.

Problem Scenario and Planning

  • The Question: Suppose a student earns a score of 600600 on one part of the SAT. The goal is to determine the student's standing (percentile) among all other test takers.
  • The Problem-Solving Process (Think, Show, Tell):     * Step 1: Think (Planning):         * Objective: State clearly what is sought. The objective is to determine how a specific SAT score compares with the scores of all other students.         * Modeling Requirement: To accomplish this comparison, the distribution of scores must be modeled.         * Identifying Variables: Let yy represent the variable of interest, which is the SAT score.         * Data Type: SAT scores are classified as quantitative data. They do not have meaningful units other than "points."

Conditions and Model Specification

  • Nearly Normal Condition: Before applying a normal model, this condition must be checked.     * Data-Driven Check: If raw data were available, one would inspect a histogram to verify the shape of the distribution.     * Assumed Distribution: In this scenario, while raw data is absent, it is explicitly stated that SAT scores are "roughly unimodal and symmetric," justifying the use of the normal model.
  • Model Parameters:     * The model used is a normal distribution with specified parameters: a mean of 500500 and a standard deviation of 100100.     * This is denoted notationally as N(500,100)N(500, 100).

Mechanics and Visualization

  • Step 2: Show (Mechanics): This phase involves the mathematical calculations and visual representations necessary to reach a conclusion.
  • Z-score Calculation:     * The z-score represents how many standard deviations a value is from the mean.     * Formula applied: z=yμσz = \frac{y - \mu}{\sigma}     * Calculation for a score of 600600: z=600500100=1z = \frac{600 - 500}{100} = 1     * This result indicates the score of 600600 is exactly 11 standard deviation above the mean.
  • Normal Model Visualization:     * A simple sketch of the normal curve is used to map the distribution.     * The 68-95-99.7 (Empirical) Rule: This rule dictates the distribution of data points within standard deviations:         * Within 1 Standard Deviation (Blue): Approximately 68%68\% of the values fall within one standard deviation of the mean (400400 to 600600).         * Within 2 Standard Deviations (Green): Approximately 95%95\% of the values fall within two standard deviations of the mean (300300 to 700700).         * Within 3 Standard Deviations (Red): Approximately 99.7%99.7\% of the values fall within three standard deviations of the mean (200200 to 800800).     * Individual Placement: By locating the score of 600600 on the sketch, it is confirmed to be exactly at the boundary of the first standard deviation above the mean.

Conclusion and Interpretation

  • Step 3: Tell (Evaluation): The final step involves interpreting the mathematical findings within the context of the original question.
  • Calculating the Upper Tail:     * Because the normal distribution is symmetric, and 68%68\% of scores are within one standard deviation (±1σ\pm 1\,\sigma) of the mean, the remaining scores comprise 32%32\% of the population (100%68%=32%100\% - 68\% = 32\%).     * These remaining scores are split equally between the lower tail (more than 11 SD below the mean) and the upper tail (more than 11 SD above the mean).     * Therefore, the percentage of students scoring higher than 11 standard deviation above the mean is half of 32%32\%, which is 16%16\%.
  • Determining Percentile:     * If approximately 16%16\% of test takers scored better than 600600, then the remaining portion of the population scored at or below that value.     * Calculation: 100%16%=84%100\% - 16\% = 84\%     * Final Contextual Result: A score of 600600 on the SAT is higher than approximately 84%84\% of all scores on the test.