Notes on Normal Distribution

Example Calculations

  • To calculate probabilities in a normal distribution, we follow a structured approach:

    • Suppose we want to find (P(Z ≤ 1.53)). This requires us to refer to the standard normal table, which provides cumulative probabilities for various Z-values.

    • If we need to compute (P(Z > 1.53)), we use the complementary rule:

      • (P(Z > 1.53) = 1 - P(Z ≤ 1.53)) resulting in approximately 0.0630. This means that there's a 6.30% chance of a random variable being greater than 1.53.

Probability of Intervals

  • For finding probabilities across an interval within a normal distribution, we can utilize the following formula:

    • (P(a ≤ Z ≤ b) = P(Z ≤ b) - P(Z ≤ a) = Φ(b) - Φ(a)). This calculates the probability that the Z-value falls between two points, a and b on the Z-distribution.

Distribution N(µ, σ²)

  • In the context of a normal distribution characterized by mean (µ) and variance (σ²), the process of standardization is essential for comparison purposes:

    • The transformation to the standard normal distribution is managed by the formula: (Z = \frac{X - µ}{σ}). This standardizes the value X to its Z-score.

    • In this case, the random variable (X) follows a normal distribution represented as (X ∼ N(µ, σ²)). The key attributes are:

      • Mean = µ; Variance = σ²; Standard Deviation = σ.

Practical Applications

  • Example: One practical application of these calculations is assessing the probabilities of real-world data, such as standardized test scores or heights in a population.

    • For instance, consider the SAT scores, which typically have a Mean = 500 and a Standard Deviation = 100:

      • To find the probability of scores falling between 550 and 650, we standardize and reference the z-table:

        • P(550 < X < 650) can be calculated as P(0.5 < Z < 1.5). This computation results in an approximate probability of 0.2417, signifying that around 24.17% of students achieve scores in this range.

Percentiles

  • Percentiles are crucial in statistics as they indicate the relative standing of a value within a dataset. The k-th percentile of a normal distribution is determined at:

    • (P(X ≤ x_k) = k
      ange{100}). For example, the 90th percentile represents a score that surpasses 90% of the data points in the distribution.

    • To find specific percentiles, one typically employs the inverse of the cumulative distribution function (CDF) associated with the normal distribution.

Example: Top 3% Criterion

  • To illustrate a practical application of percentiles, consider the requirement to be in the top 3% of SAT scores:

    • We need to first find the Z-value that corresponds to the 97th percentile, which typically is Z = 1.88. This value indicates that only 3% of the overall statistical population scores higher.

    • To convert this Z-value back to an actual SAT score, we employ the transformation:

      • (X = 500 + 100(1.88)). This calculation indicates that a score of 688 is necessary to be positioned in the top 3 % of test-takers.