pt 2: Normal Distribution and Percentiles

Chapter 6: Normal Distribution - Review and Examples

  • Review of Formulas:

    • The formula sheet contains expected value formulas for binomial distributions and formulas from Chapter 6.
    • Key formula: Transformation formula: z=xμσz = \frac{x - \mu}{\sigma}, where:
      • xx = observation
      • μ\mu = mean
      • σ\sigma = standard deviation
  • Normal Distribution Table:

    • Use either the table or calculator for z-values.
    • Calculators have a built-in z-function.
  • Probability Calculations:

    • Problems involve finding P(X < value).
    • Given: X has a normal distribution, with σ\sigma and μ\mu.
  • Transformation:

    • Apply the transformation formula: z=xμσz = \frac{x - \mu}{\sigma}.
    • Convert X to Z, then use the calculator.
  • Area Representation:

    • Plot the mean in the middle.
    • The given value (e.g., 18.6) is located relative to the mean.
    • The inequality sign indicates the area of interest.
  • Complement Rule:

    • If calculating probabilities to the right, use the complement rule: 1 - P(X < value).
    • Transform to Z, then calculate.
  • Conditional Probability Example:

    • Given download time is more than 17.3 seconds, find the probability it's less than 19.2 seconds.

    • Apply the conditional probability rule: P(AB)=P(AB)P(B)P(A|B) = \frac{P(A \cap B)}{P(B)}

    • Where:

      • AA: X < 19.2
      • BB: X > 17.3
    • Overlapping Events:

      • Determine the overlap of events.
      • If no overlap, events are mutually exclusive.
      • If overlap exists, calculate the intersection.
    • Calculation:

      • P(17.3 < X < 19.2) = P(X < 19.2) - P(X < 17.3)
      • Transform X values to Z values using z=xμσz = \frac{x - \mu}{\sigma}.
      • Use the calculator to find probabilities.
  • Calculator Usage:

    • Input Z-values into the calculator to find probabilities.
    • Example: For z = 0.24, probability = 0.5948.
  • Table Usage:

    • The table contains negative and positive values.
    • Find the Z-value in the table to read off the probability.
  • Marginal and Joint Probabilities:

    • Reference to marginal and joint probabilities is made.
  • Important Note:

    • Calculating probabilities where X=valueX = value results in zero because there is no area.
  • Expected Values and Variances:

    • For normal distribution, mean and standard deviation are typically given.

Importance and Applications

*P-values
* These concepts are important for p-values, hypothesis tests, and confidence intervals in later courses.

*Regression
* Normal distribution underlies regression models.

Percentiles

  • Percentiles are the inverse of probability calculations.

  • Formula:

    • To calculate percentiles, use: x=μ+zασx = \mu + z_{\alpha} \cdot \sigma
    • Where α\alpha represents a given probability.
  • Alpha:

    • α=0.5\alpha = 0.5 corresponds to the median.
    • α=0.25\alpha = 0.25 corresponds to the first quartile.
    • α=0.75\alpha = 0.75 corresponds to the third quartile.
  • Percentile Calculation:

    • Given a probability, find the corresponding X value.

Examples of Percentile Calculations

  • Definitions:

    • Percentile: General term.
    • Quartiles: Specific percentiles (25th, 50th, 75th).
    • P=80P = 80 (80th percentile), α=0.8\alpha = 0.8
  • Example 1: Finding the 80th Percentile of Download Time.

    • Given: X = download time, normally distributed with μ=18\mu = 18 and σ=5\sigma = 5.

    • The 80th percentile divides the distribution such that 80% is to the left.

    • Calculation:

      • P<em>80=μ+z</em>80σP<em>{80} = \mu + z</em>{80} \cdot \sigma
      • Find z80z_{80} using a calculator (inverse normal function).
      • z800.8416z_{80} \approx 0.8416
      • P80=18+(0.8416)(5)=22.208P_{80} = 18 + (0.8416)(5) = 22.208
    • Interpretation: 80% of downloads are less than 22.208 seconds.

  • Example 2: Finding X Value for a Given Probability.

    • Find X such that 5% of download times are less than X.

    • Percentile: P5P_5. α=0.05\alpha = 0.05

    • Calculation:

      • x=μ+z0.05σx = \mu + z_{0.05} \cdot \sigma
      • Find z0.05z_{0.05} using a calculator (inverse normal function).
      • z0.051.6449z_{0.05} \approx -1.6449
      • x=18+(1.6449)(5)=9.7755x = 18 + (-1.6449)(5) = 9.7755
    • 5% of download times will be less than 9.7755 seconds.

  • Example 3: Quartiles.

    • Quartiles divide data into four parts.

    • Calculations:

      • Q<em>1Q<em>1 (25th percentile): x=μ+z</em>0.25σx = \mu + z</em>{0.25} \cdot \sigma , The value for z0.250.6745z_{0.25} \approx -0.6745, x=18+(0.6745)(5)=14.6275x = 18 + (-0.6745)(5) = 14.6275
      • Q2Q_2 (50th percentile/median): It's equal to the mean which is 18.
      • Q<em>3Q<em>3 (75th percentile): x=μ+z</em>0.75σx = \mu + z</em>{0.75} \cdot \sigma, The value for z0.750.6745z_{0.75} \approx 0.6745, x=18+(0.6745)(5)=21.3725x = 18 + (0.6745)(5) = 21.3725

Interquartile range
* Calculate the interquartile range (IQR): Q<em>3Q</em>1=21.372514.6275=6.745Q<em>3 - Q</em>1 = 21.3725 - 14.6275 = 6.745
* It means that 50% of the download time will be between 14.63 and 21.37 seconds.

  • Example 4: Symmetry and Probability.

    • Find A and B such that probability between A and B is 90%, symmetrically around the mean.

    • Setup:

      • P(A < X < B) = 0.90, is set symmetrically around μ\mu.

      • The area to the left of A is 0.05, and to the right of B is 0.05.

      • A is the 5th percentile, and B is the 95th percentile.

    • Formula

      • A=μ+z<em>0.05σ,1.64A = \mu + z<em>{0.05} \cdot \sigma, -1.64 , and B=μ+z</em>0.95σ,1.64B = \mu + z</em>{0.95} \cdot \sigma, 1.64