Statistics Practice Problems: Normal Distribution and Empirical Rule Study Guide
The Empirical Rule and Normal Distribution Fundamentals
- The Empirical Rule (also known as the 68−95−99.7 rule) provides a statistical guideline for data that follows a normal distribution.
- Standard Distribution Segments:
* The area within one standard deviation (μ±1σ) of the mean encompasses approximately 68% of the data. This is divided into two segments of 34% on either side of the mean (μ).
* The area within two standard deviations (μ±2σ) of the mean encompasses approximately 95% of the data. The additional area between 1σ and 2σ (on both the positive and negative ends) accounts for 13.5% each (95%−68%=27%, and 227%=13.5%).
* The area within three standard deviations (μ±3σ) of the mean encompasses approximately 99.7% of the data. The additional area between 2σ and 3σ accounts for approximately 2.35% on each side (99.7%−95%=4.7%, and 24.7%=2.35%).
* The area beyond three standard deviations (the tails) accounts for the remaining 0.3%, which is divided into 0.15% in each tail (μ−3σ and μ+3σ).
Problem 1: Analyzing Employee Commute Times
- Given Parameters:
* Distribution Type: Normal Distribution.
* Mean (μ): 40minutes.
* Standard Deviation (σ): 5minutes.
- Objective: Calculate the percentage of employees with a commute time between 35 and 50minutes.
- Step-by-Step Breakdown:
* The lower bound of 35minutes is equal to μ−1σ (40−5=35).
* The upper bound of 50minutes is equal to μ+2σ (40+(2×5)=50).
* Summing the distribution percentages from the Empirical Rule:
* From μ−1σ to μ: 34%
* From μ to μ+1σ: 34%
* From μ+1σ to μ+2σ: 13.5%
* Final Calculation: 34%+34%+13.5%=81.5%.
* Conclusion: Approximately 81.5% of employees have a commute time between 35 and 50minutes.
Problem 2: Television Consumption Habits in Teenagers
- Given Parameters:
* Total Sample Size (n): 600teenagers.
* Mean (μ): 14hours/week.
* Standard Deviation (σ): 2.5hours/week.
- Objective: Determine how many teenagers in the group of 600 spend between 9 and 19hours watching TV per week.
- Distribution Analysis:
* Lower bound calculation: 9hours=14−(2×2.5)=μ−2σ.
* Upper bound calculation: 19hours=14+(2×2.5)=μ+2σ.
* According to the Empirical Rule, the area within 2 standard deviations of the mean encompasses 95% of the population.
- Numerical Estimation:
* Total expected number = 95%×600.
* Calculation: 0.95×600=570.
- Conclusion: We would expect 570teenagers to spend between 9 and 19hours watching TV each week.
- Given Parameters:
* Distribution Type: Normal Distribution.
* Mean Score (μ): 95points.
* Standard Deviation (σ): 2points.
* Observed Performance (x): 89points.
- Performance Assessment (Standard Deviations):
* Calculate the Z-score (the number of standard deviations the value is from the mean).
* Formula: Z=σx−μ
* Calculation: Z=289−95=2−6=−3.
- Interpretation:
* A performance of 89 points is exactly 3standard deviations below the mean score (μ−3σ).
* Conclusion: The team performed significantly below average. In a normal distribution, only 0.15% of the population scores lower than 3 standard deviations below the mean, making this an extremely poor performance compared to the average match.
- Scenario Part A: Group Expectations:
* Sample Size (n): 400participants.
* Mean (μ): 50push-ups.
* Standard Deviation (σ): 8push-ups.
* Target Range: 42 to 58push-ups.
* Range Analysis: 42=50−8=μ−1σ and 58=50+8=μ+1σ.
* Empirical Rule Percentage: 68% of the population falls within one standard deviation.
* Calculation: 0.68×400=272.
* Result: Approximately 272participants are expected to complete between 42 and 58push-ups.
- Scenario Part B: Medal Eligibility (Top 2.5%):
* Objective: Determine if someone doing 67push-ups deserves a medal.
* Criteria: Exceed expectations and perform in the top 2.5%.
* Statistical Threshold: In a normal distribution, the area beyond 2standard deviations on the high side is 2.35%+0.15%=2.5%.
* Upper limit for top 2.5%: μ+2σ=50+(2×8)=50+16=66push-ups.
* Evaluation: The participant performed 67push-ups, which is greater than the threshold of 66.
* Conclusion: Yes, the participant deserves a medal because their performance of 67 push-ups places them within the top 2.5% of the group.
Problem 5: Nutritional Analysis of Adult Calorie Intake
- Given Parameters:
* Mean Daily Intake (μ): 2,200calories.
* Standard Deviation (σ): 300calories.
- Task 1: Percentage between 1,900 and 3,100 calories:
* Lower limit: 1,900=2,200−300=μ−1σ.
* Upper limit: 3,100=2,200+(3×300)=μ+3σ.
* Empirical Rule Summation: Percent=(area from μ−1σ to μ)+(area from μ to μ+3σ).
* Calculation: 34%+(34%+13.5%+2.35%)=34%+49.85%=83.85%.
* Result: Roughly 83.85% of adults consume between 1,900 and 3,100calories per day.
- Task 2: Expected headcount out of 500 adults:
* Calculation: 0.8385×500=419.25.
* Result: We would expect approximately 419adults out of 500 to be within this range.
- Task 3: Assessing an intake of 1,300 calories:
* Standard Deviation Calculation: Z=3001,300−2,200=300−900=−3.
* This intake is exactly 3standard deviations below the mean (μ−3σ).
* Percentage ranking: Only 0.15% of the population consumes less than this amount.
* Conclusion: Someone eating 1,300calories per day would be considered on the extreme "low side." It is not considered "okay" or average as it represents a significant statistical outlier compared to the typical intake of that region.