STK110 Empirical Rule and Bell-Shaped Distribution Analysis

Presentation Context and Dashboard Information

  • Course Code: STK110
  • Topic: Empirical Rule and Normal Distribution Application.
  • Contextual Metadata:     * Slide Number: Slide 60 of 76.     * Timestamp: 16 April at 15:00.     * Interface Details: The presentation software displays a zoom level of 95%95\%.     * Reference Codes: A partial identifier "*** IN 19970" is visible on the interface.     * *Key Statistical Benchmarks: A sequence of values is displayed on the interface representing standard deviation increments from the mean:          2424         * 3636         * 4848         * 6060         * 7272         * 8484         * 9696

Characteristics of STK110 Semester Marks

  • Distribution Type: The marks are described as "bell-shaped." In statistics, this confirms that the data follows a Normal (Gaussian) Distribution. Common properties include:     * Symmetry: The distribution is symmetric about the mean.     * Central Tendency: The mean, median, and mode are essentially equal at the center of the distribution.     * Asymptotic Nature: The tails of the curve approach the horizontal axis but never touch it.
  • Mean (μ\mu): The average semester mark is established as 60%60\%.
  • Standard Deviation (σ\sigma): The spread of the data or average distance from the mean is 12%12\%.

The Empirical Rule (68-95-99.7 Rule)

  • The provided material utilizes the Empirical Rule, which describes the percentage of data falling within specific standard deviations from the mean in a bell-shaped distribution:     * One Standard Deviation (μ±1σ\mu \pm 1\sigma): Approximately 68%68\% of the data falls within this range. Based on the data, this range is 601260 - 12 to 60+1260 + 12, or [48,72][48, 72].     * Two Standard Deviations (μ±2σ\mu \pm 2\sigma): Approximately 95%95\% of the data falls within this range. For this dataset, that is 602(12)60 - 2(12) to 60+2(12)60 + 2(12), or [36,84][36, 84].     * Three Standard Deviations (μ±3σ\mu \pm 3\sigma): Approximately 99.7%99.7\% of the data falls within this range. For this dataset, that is 603(12)60 - 3(12) to 60+3(12)60 + 3(12), or [24,96][24, 96].

Calculation of Marks Below Threshold (48%)

  • Task: Calculate the percentage of students' marks below 48%48\%.
  • Step 1: Determine the Z-score/Standard Deviation Distance:     * The value 48%48\% is exactly one standard deviation below the mean: 486012=1σ\frac{48 - 60}{12} = -1\sigma.
  • Step 2: Apply the Empirical Rule:     * The rule states that 68%68\% of observations are between 1σ-1\sigma (48%48\%) and +1σ+1\sigma (72%72\%).     * The total area under the curve is 100%100\%.     * The area outside of the center range is 100%68%=32%100\% - 68\% = 32\%.
  • Step 3: Account for Symmetry:     * Because the bell curve is symmetric, the 32%32\% area outside the center is split equally between the two tails.     * The tail below 1σ-1\sigma (48%48\%) is 32%2=16%\frac{32\%}{2} = 16\%.
  • Final Result: 16%16\% of students' marks are below 48%48\%.

Calculation of the 97.5th Percentile (P97.5P_{97.5})

  • Task: Calculate the mark representing the 97.5th percentile (P97.5P_{97.5}).
  • Definition: A percentile is the value below which a given percentage of observations fall. For P97.5P_{97.5}, 97.5%97.5\% of the area is to the left, and 2.5%2.5\% is to the right.
  • Relating to Standard Deviations:     * We know from the Empirical Rule that 95%95\% of data falls between μ±2σ\mu \pm 2\sigma.     * The total area remaining in both tails is 100%95%=5%100\% - 95\% = 5\%.     * Because the distribution is symmetric, each tail contains 5%2=2.5%\frac{5\%}{2} = 2.5\%.     * Therefore, the area to the left of μ+2σ\mu + 2\sigma includes the main 95%95\% center plus the bottom 2.5%2.5\% tail (95%+2.5%=97.5%95\% + 2.5\% = 97.5\%).
  • Execution:     * P97.5P_{97.5} corresponds to μ+2σ\mu + 2\sigma.     * Calculation: 60+2(12)=60+24=8460 + 2(12) = 60 + 24 = 84.
  • Final Result: The 97.5th percentile mark is 84%84\%.

Identification of Statistical Outliers

  • Task: Determine the thresholds below or above which a mark is considered an outlier.
  • Statistical Criterion: In the context of the Empirical Rule and bell-shaped distributions, values that lie more than three standard deviations (3σ3\sigma) from the mean are typically categorized as outliers.
  • Calculating the Upper Threshold:     * Formula: μ+3σ\mu + 3\sigma     * Calculation: 60+3(12)=60+36=9660 + 3(12) = 60 + 36 = 96
  • Calculating the Lower Threshold:     * Formula: μ3σ\mu - 3\sigma     * Calculation: 603(12)=6036=2460 - 3(12) = 60 - 36 = 24
  • Final Result: A semester mark below 24%24\% or above 96%96\% will be considered an outlier.