Continuous Probability Distributions and the Normal Distribution

Comparison of Discrete and Continuous Probability Distributions

  • Definition of Discrete Variables (Chapter 5 Recap):   - Discrete variables deal with countable outcomes.   - Example: Outcomes like 1, 2, and 3 are distinct, individual points on a number line.

  • Definition of Continuous Variables (Chapter 6):   - Continuous variables deal with uncountable outcomes.   - Instead of distinct points, the outcome is a range (e.g., any real number between 1 and 3).   - Example: A value could be 1.000000011.00000001, 1.000000021.00000002, or 2.99999992.9999999.   - These variables typically arise from a measuring process rather than a counting process.

  • Probability Calculation for Exact Values:   - Discrete Side: It is possible to calculate the probability for an exact value, such as P(X=5)P(X = 5).   - Continuous Side: The probability of an exact value is always zero (P(X=5)=0P(X = 5) = 0).   - Reasoning: Because there are an infinite number of possible outcomes (e.g., 5.15.1, 5.25.2, 4.94.9, 4.994.99), the chance of landing on one specific, exact point is so small it is mathematically considered zero.

  • Probability Over Intervals:   - Because exact points cannot be calculated, probabilities for continuous variables are calculated over intervals.   - Example: Instead of finding P(X=5)P(X = 5), we calculate P(4.5 < X < 5.5). This serves as a close approximation for the probability of the value being roughly five.

Probability Mathematical Functions: PMF vs. PDF

  • Probability Mass Function (PMF):   - Used for discrete distributions (Chapter 5).   - Graphing: The y-axis represents the probability of xx (P(X)P(X)) and the x-axis represents the outcome xx.   - Representation: Probabilities are shown as the height of a line or bar at each distinct integer.

  • Probability Density Function (PDF):   - Used for continuous distributions (Chapter 6).   - Graphing: The x-axis represents xx, but the y-axis represents f(x)f(x), which is a function that gives probability density.   - Representation: It is a smooth, continuous function.   - Crucial Metric: Probability is represented as the area under the curve between two bounds (aa and bb), rather than the height at a single point.

  • Core Laws of Probability:   - Discrete Rule: The sum of all individual probabilities must equal one: P(x)=1\sum P(x) = 1.   - Continuous Rule: The total area under the density curve must equal one, as this representing the entire sample space.   - Integration: To find the area under the curve (the probability), one must integrate the function. For a theoretical range, integrating from negative infinity to positive infinity equals one: f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1.

The Normal Distribution and its Parameters

  • Definition: The Normal Distribution is the most common continuous probability distribution.

  • Rationale for the Name "Normal":   - It has a very predictable shape.   - It is highly prevalent in real-life data collection; gathered data often naturally takes this distribution shape when plotted.

  • Characterizing Parameters: The distribution is uniquely defined by two metrics:   - Mean (μ\mu): This represents the center of the distribution.   - Standard Deviation (σ\sigma): This represents the spread, or how much data points deviate from the mean on average.

  • Notation: The notation for a normal distribution is XN(μ,σ)X \sim N(\mu, \sigma) or XN(μ,σ2)X \sim N(\mu, \sigma^2). If you know the variance, you can find the standard deviation by taking the square root; if you have the standard deviation, you square it to find the variance.

The Normal Density Function Formula

  • Mathematical Form: The function f(x)f(x) that creates the specific bell-shaped curve is defined as:   - f(x)=1σ2πe12(xμσ)2f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{1}{2} (\frac{x-\mu}{\sigma})^2}

  • Constants within the Function:   - π\pi: This is the mathematical constant approximately equal to 3.143.14.   - ee: The base of the natural logarithm.

  • Calculus Context: To find the probability that xx lies between bounds aa and bb, one would technically integrate the function:   - P(a < X < b) = \int_a^b \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{1}{2} (\frac{x-\mu}{\sigma})^2} \,dx   - Note: This integral has no closed-form solution, meaning it cannot be solved with standard algebraic integration. Historically, it required brute-force computation or approximation.

The Standard Normal Distribution (ZZ)

  • Definition: A specific case of the normal distribution where the mean is zero and the standard deviation is one (μ=0,σ=1\mu = 0, \sigma = 1).

  • Standardization Process: Converting an xx value to a zz value scales any normal distribution so it can be compared to the standard normal table.

  • Conversion Formula:   - Z=xμσZ = \frac{x - \mu}{\sigma}

  • Purpose: Tables and calculators are generally programmed based on the Standard Normal Distribution. By converting xx to zz, you can look up probabilities for any distribution regardless of its original mean or standard deviation.

Procedure for Finding Normal Probabilities

  1. Convert XX to ZZ: Use the formula Z=xμσZ = \frac{x - \mu}{\sigma}. This transforms the statement from the XX scale to the ZZ scale (e.g., P(X < 18.6) becomes P(Z < 0.12)).

  2. Look up the Probability: Use a Standard Normal Table (ZZ-table) or a calculator.     - Lower Tail Probability: Most tools default to providing the probability that ZZ is less than or equal to a value (P(Zz)P(Z \leq z)).

Using the HP Calculator for Probabilities

  • Reason for Requirement: The HP calculator is necessary for this module because it stores the standard normal table results internally.

  • Step-by-Step Calculator Process:   - Value Input: Enter the calculated zz value (e.g., 0.120.12).   - Function: Use the "Blue Shift" followed by the "3" button. Above the 3 key is the notation ZPZ \rightarrow P, which converts a zz value into a probability.   - Adjusting Decimals: To see more precision, press "Orange Shift", then "=" (Display), then a number (e.g., 4 or 5) to set the decimal places.

  • Table Lookup Alternative: In a zz-table, you find the first part of the value (e.g., 0.10.1) in the first column and the second decimal (e.g., 0.020.02) in the top row. The intersection provides the probability (e.g., 0.54780.5478).

Examples and Probability Rules

  • Case Study: File Download Times:   - Given: xx is the download time in seconds (continuous).   - Parameters: XN(μ=18,σ=5)X \sim N(\mu=18, \sigma=5).   - Scenario A: Finding P(X < 18.6):     - Convert: Z=18.6185=0.12Z = \frac{18.6 - 18}{5} = 0.12.     - Probability: P(Z < 0.12) = 0.5478.   - Scenario B: Upper Tail Probabilities (P(X > 18.6)):     - Because the total area is 1, and calculators/tables typically give the lower tail (left side), use the complement rule.     - Formula: P(X > a) = 1 - P(X < a).     - Calculation: 1 - P(Z < 0.12) = 1 - 0.5478 = 0.4522.

  • The Continuity Property (Inequalities):   - In continuous distributions (Chapter 6), it does not matter if a sign is "strictly less than" (<) or "less than or equal to" (\leq).   - P(X < 18.6) = P(X \leq 18.6).   - Explanation: The difference between the two is a single point (18.618.6), and the probability of a single point in a continuous distribution is zero. Therefore, there is no meaningful difference.   - Warning: This logic cannot be applied to discrete distributions (Chapter 5), where the difference between integers is significant.