The Normal Probability Distribution

Chapter 7: The Normal Probability Distribution

7.1 Properties of the Normal Distribution

Learning Objectives

  • Objective 1: Use the uniform probability distribution.
  • Objective 2: Graph a normal curve.
  • Objective 3: State the properties of the normal curve.
  • Objective 4: Explain the role of area in the normal density function.

Objective 1: Use the Uniform Probability Distribution

  • Example: Imagine a scenario where a friend is always late. Let the random variable X represent the time from the scheduled meeting until your friend arrives. This random variable can take on values from 0 minutes (on time) to 30 minutes (maximum lateness).
    • The likelihood of being late is consistent across the range; thus, each interval within 0 to 30 minutes is equally likely.
    • Conclusion: The random variable X follows a uniform probability distribution, meaning it has equal probability across the stated range.

Properties of a Probability Density Function (pdf)

  • A probability density function (pdf) is used for continuous random variables and needs to meet two essential conditions:
    1. The total area under the graph of the function must equal 1.
    2. The graph's height must remain above or equal to 0 for all values of the random variable.

Graphical Representation of the Uniform Distribution

  • Illustration in the form of a rectangle demonstrating the uniform distribution shows:
    • Width of rectangle = 30 (the total time range).
    • Height is calculated to ensure Area = Height × Width = 1.

Area Representing Probability

  • The area under the pdf curve between two values indicates the probability of the random variable falling within that interval.

    • Example Probabilities:
    • (a) Probability friend is between 10 and 20 minutes late - area represented by a specific shaded rectangle.
    • Width of this rectangle is 10 minutes with height  (calculated from pdf).
    • Total area for this rectangle = Width × Height corresponds to the probability.
  • (b) Probability within a certain time frame represented by shading - if it's stated as 0.2 area, the corresponding width needs to be calculated to find out the specific time (6 minutes).

Objective 2: Graph a Normal Curve

Model Representation

  • A model in mathematics, for continuous random variables, is represented by a normal curve. The red curve in visual representation indicates the normal distribution characteristics.

Continuous Random Variable and Normal Distribution

  • A continuous random variable is considered normally distributed if its relative frequency histogram approximates a normal curve's shape.
  • Inflection points in the normal curve exist at:
    • Points: μ - σ (mean minus standard deviation) and μ + σ (mean plus standard deviation).

Effects of Mean and Standard Deviation on the Normal Curve

  • Graphical changes when modifying mean (μ) or standard deviation (σ):
    • (a) Increasing mean shifts the curve rightward while preserving its shape.
    • (b) Increasing standard deviation flattens and spreads the curve out without changing the center.

Objective 3: State the Properties of the Normal Curve

  1. The curve is symmetric about its mean (μ).
  2. Mean, median, and mode are identical, implying a single peak where the highest point is located at x = μ.
  3. The normal curve has inflection points located at μ - σ and μ + σ.
  4. The total area under the curve equals 1.
  5. The area to the right of μ is equal to the area to the left below it; both equal 0.5.
  6. As x approaches positive or negative infinity, the graph gets infinitesimally close to the horizontal axis but never touches it.
  7. Empirical Rule:
    • Approximately 68% of the area under the curve is found between μ - σ and μ + σ.
    • Approximately 95% is found between μ - 2σ and μ + 2σ.
    • Approximately 99.7% is located between μ - 3σ and μ + 3σ.
    • This is often illustrated using specific areas such as 0.15%, 2.35%, and 34% to explain the distribution's behavior statistically.

Objective 4: Explain the Role of Area in the Normal Density Function

  • Clinical statistics were utilized, presenting data from a pediatrician's three-year-old female patients. The mean height and standard deviation were computed.
  • Example Analysis:
    • Drawing histograms and normal curves for the relative frequency of heights shows how the normal curve can effectively summarize the characteristics of the heights of children.
    • By comparing areas between the ranges gives us insights into the proportion of children within those intervals, both visually (through shading) and numerically (via area calculations).
    • The areas under the curve thus can be used to approximate population proportions of various criteria (example: height ranges).

Probability Density Function for Normal Distribution

  • The mathematical form for the normal probability density function is stated as follows:

(f(x) = rac{1}{eta imes ext{sqrt}(2 ext{π})} e^{- rac{(x - ext{m})^2}{2eta^2}})

  • Where m is the mean, and β is the standard deviation.

Area Under Normal Distribution Curve Characteristics

  • Assumed data from public health information regarding cholesterol levels can also be illustrated using normal curves, aiding understanding of proportions of populations exceeding specific thresholds.
    • This dependence on area interpretations facilitates understanding typical and atypical results in statistics as it quantifies probable values across various ranges.

Additional Concepts and Examples

  • Real-life examples exploring heights and cholesterol levels emphasize the significance of utilizing normal approximations in statistical data analysis.
  • Continuous distributions like normal distributions tend to be used as tuning parameters to gauge the likelihood of specific outcomes within statistics, revealing broader patterns and implications for larger data sets through sophisticated graphical analyses.

7.2 Applications of the Normal Distribution

Learning Objectives

  • Objective 1: Find and interpret the area under a normal curve.
  • Objective 2: Find the value of a normal random variable.

Objective 1: Find and Interpret the Area under a Normal Curve

  • To standardize a normal random variable defined by its mean μ and standard deviation σ, we convert:
    Z = rac{X - ext{μ}}{ ext{σ}}

  • Standard Normal Distribution: If variable X is transformed, Z will then reflect standard normal distribution conditions with μ = 0 and σ = 1. Thus allowing probabilities tied to Z values to be analyzed more uniformly using respective statistical software.

Standard Normal Curve Table and IQ Scores

  • Example illustrating IQ scores typically assume normalization about $ ext{μ} = 100$ and $ ext{σ} = 15$. Consideration of the z-score 120 results in a metric of 1.33 standard deviations above average.
  • Utilization of the Composite Methodology through computing the areas each side of the normalized z-value facilitates observation of population behaviors relative to average intelligence distributions.

Area Calculations and Complement Rule Example

  • Demonstrating the use of the Complement Rule: ext{Area} = 1 - 0.9082 = 0.0918
    • Indicating the probability associated with the right-tail area divergence from inputs reflecting z= 1.33.

Assessing Proportions Based on Known Distributions

  • Parameters such as mean height or cholesterol levels demonstrate calculative usage within normal distribution to calculate probabilities against various examined points. In summation, total partition proportions against known demographics inspecting the occurrence probability in terms of percentile ranks.

7.3 Assessing Normality

Learning Objectives

  • Objective: Use normal probability plots to assess normality.

Normal Probability Plots: Introduction and Importance

  • A normal probability plot signifies an essential graph plotting observed data against normal scores, which themselves serve as predictive z-scores.
  • Principal processes regarding drawing the normal probability plot entail sorting observed values, computing for expected score allocations, and matching plotted visuals to assess alignment with the ideal curve representation indicative of a normal distribution trend.

Interpretation and Correlation Conclusions

  • The resulting linear correlation coefficient is significant for determining if the data approximates a normal distribution. The critical correlation value to escalate assessments quantifies the overall linear association between observed measurements and expected z-scores recognizing formulated data samples.

7.4 The Normal Approximation to the Binomial Probability Distribution

Learning Objectives

  • Objective: Approximate binomial probabilities using the normal distribution.

Characteristics of a Binomial Probability Experiment

  1. Conducted over n independent trials.
  2. Definable outcomes categorized as success or failure.
  3. Probability of success remains constant across trials, denoting the parameters of binomial distribution analysis.

Use of Normal Approximation Criteria

  • Binomial probability distributions become relatively symmetric and bell-shaped as the number of trials approaches larger sequences when accounting for:
    np(1 - p) ext{ must exceed } 10
  • Lastly, deployment and calculation concepts surrounding the normal approximation of binomial distributions outline methods to better manage computational complexity in statistics and empirical data assessments.