Continuous Probability Distributions and the Normal Distribution
Comparison of Discrete and Continuous Probability Distributions
Definition of Discrete Variables (Chapter 5 Recap): - Discrete variables deal with countable outcomes. - Example: Outcomes like 1, 2, and 3 are distinct, individual points on a number line.
Definition of Continuous Variables (Chapter 6): - Continuous variables deal with uncountable outcomes. - Instead of distinct points, the outcome is a range (e.g., any real number between 1 and 3). - Example: A value could be , , or . - These variables typically arise from a measuring process rather than a counting process.
Probability Calculation for Exact Values: - Discrete Side: It is possible to calculate the probability for an exact value, such as . - Continuous Side: The probability of an exact value is always zero (). - Reasoning: Because there are an infinite number of possible outcomes (e.g., , , , ), the chance of landing on one specific, exact point is so small it is mathematically considered zero.
Probability Over Intervals: - Because exact points cannot be calculated, probabilities for continuous variables are calculated over intervals. - Example: Instead of finding , we calculate P(4.5 < X < 5.5). This serves as a close approximation for the probability of the value being roughly five.
Probability Mathematical Functions: PMF vs. PDF
Probability Mass Function (PMF): - Used for discrete distributions (Chapter 5). - Graphing: The y-axis represents the probability of () and the x-axis represents the outcome . - Representation: Probabilities are shown as the height of a line or bar at each distinct integer.
Probability Density Function (PDF): - Used for continuous distributions (Chapter 6). - Graphing: The x-axis represents , but the y-axis represents , which is a function that gives probability density. - Representation: It is a smooth, continuous function. - Crucial Metric: Probability is represented as the area under the curve between two bounds ( and ), rather than the height at a single point.
Core Laws of Probability: - Discrete Rule: The sum of all individual probabilities must equal one: . - Continuous Rule: The total area under the density curve must equal one, as this representing the entire sample space. - Integration: To find the area under the curve (the probability), one must integrate the function. For a theoretical range, integrating from negative infinity to positive infinity equals one: .
The Normal Distribution and its Parameters
Definition: The Normal Distribution is the most common continuous probability distribution.
Rationale for the Name "Normal": - It has a very predictable shape. - It is highly prevalent in real-life data collection; gathered data often naturally takes this distribution shape when plotted.
Characterizing Parameters: The distribution is uniquely defined by two metrics: - Mean (): This represents the center of the distribution. - Standard Deviation (): This represents the spread, or how much data points deviate from the mean on average.
Notation: The notation for a normal distribution is or . If you know the variance, you can find the standard deviation by taking the square root; if you have the standard deviation, you square it to find the variance.
The Normal Density Function Formula
Mathematical Form: The function that creates the specific bell-shaped curve is defined as: -
Constants within the Function: - : This is the mathematical constant approximately equal to . - : The base of the natural logarithm.
Calculus Context: To find the probability that lies between bounds and , one would technically integrate the function: - P(a < X < b) = \int_a^b \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{1}{2} (\frac{x-\mu}{\sigma})^2} \,dx - Note: This integral has no closed-form solution, meaning it cannot be solved with standard algebraic integration. Historically, it required brute-force computation or approximation.
The Standard Normal Distribution ()
Definition: A specific case of the normal distribution where the mean is zero and the standard deviation is one ().
Standardization Process: Converting an value to a value scales any normal distribution so it can be compared to the standard normal table.
Conversion Formula: -
Purpose: Tables and calculators are generally programmed based on the Standard Normal Distribution. By converting to , you can look up probabilities for any distribution regardless of its original mean or standard deviation.
Procedure for Finding Normal Probabilities
Convert to : Use the formula . This transforms the statement from the scale to the scale (e.g., P(X < 18.6) becomes P(Z < 0.12)).
Look up the Probability: Use a Standard Normal Table (-table) or a calculator. - Lower Tail Probability: Most tools default to providing the probability that is less than or equal to a value ().
Using the HP Calculator for Probabilities
Reason for Requirement: The HP calculator is necessary for this module because it stores the standard normal table results internally.
Step-by-Step Calculator Process: - Value Input: Enter the calculated value (e.g., ). - Function: Use the "Blue Shift" followed by the "3" button. Above the 3 key is the notation , which converts a value into a probability. - Adjusting Decimals: To see more precision, press "Orange Shift", then "=" (Display), then a number (e.g., 4 or 5) to set the decimal places.
Table Lookup Alternative: In a -table, you find the first part of the value (e.g., ) in the first column and the second decimal (e.g., ) in the top row. The intersection provides the probability (e.g., ).
Examples and Probability Rules
Case Study: File Download Times: - Given: is the download time in seconds (continuous). - Parameters: . - Scenario A: Finding P(X < 18.6): - Convert: . - Probability: P(Z < 0.12) = 0.5478. - Scenario B: Upper Tail Probabilities (P(X > 18.6)): - Because the total area is 1, and calculators/tables typically give the lower tail (left side), use the complement rule. - Formula: P(X > a) = 1 - P(X < a). - Calculation: 1 - P(Z < 0.12) = 1 - 0.5478 = 0.4522.
The Continuity Property (Inequalities): - In continuous distributions (Chapter 6), it does not matter if a sign is "strictly less than" (<) or "less than or equal to" (). - P(X < 18.6) = P(X \leq 18.6). - Explanation: The difference between the two is a single point (), and the probability of a single point in a continuous distribution is zero. Therefore, there is no meaningful difference. - Warning: This logic cannot be applied to discrete distributions (Chapter 5), where the difference between integers is significant.