Probability Distributions and Sampling Distributions Notes
Probability Distributions
Probability Density Function (pdf)
- Describes likelihood of continuous outcomes for random variables
- Must sum to 1
- Calculable for events
- Notation: f(x)
Probability Mass Function (pmf)
- Describes likelihoods for discrete random variables
- Masses sum to one
- Notation: p(x)
Cumulative Probability Functions
Cumulative Distribution Function (cdf)
- Describes probability of all outcomes ≤ specific outcome
- Shown as F(x) or P(X ≤ x)
- Increasing in nature
- Used for probabilities less than, greater than, or between values
Binomial Distribution
Models two mutually exclusive outcomes (e.g., success vs. failure)
Key parameters:
- n = number of trials
- p = probability of success
Trials assumed independent with constant p
Applications: quality control, customer satisfaction, loan defaults
Example: Coin toss to find the probability of getting 3 heads in 10 trials:
- P(X=3) = (10 choose 3) * (0.5)^3 * (0.5)^7 = 0.117188
Example of Customer Shipment Acceptance
A shipment is acceptable if ≤10% defects.
If sampling 10 products (n=10, p=0.1), probability of rejection (≥2 defects) is 26.4%
Reducing the defect rate to 5% lowers rejection probability to 8.6%
Normal Distributions
Used to model continuous variables
Defined by mean (μ) and standard deviation (σ)
Symmetrical shape with total area = 1
Examples: Demand management with normal distribution of units sold
Key applications include calculating probabilities of being below/above specific thresholds
Excel Tools for Normal Distribution
NORM.DIST function used to find cumulative probability
- P(X ≤ x) or P(X < x) calculations
NORM.INV function for finding specific demand levels corresponding to a probability
Examples of demand calculations based on provided μ and σ values
Standard Normal Distribution
Z-distribution with μ = 0 and σ = 1
Appears on the z-axis indicating standard deviations from mean
Z-scores facilitate comparison among different distributions
Central Limit Theorem (CLT)
As sample size increases (n > 30), sample means approach a normal distribution regardless of parent distribution
Assumptions include independence, identical distribution, and finite mean/variance
Sampling Distributions
Enables analysis of a sample to infer properties about a population
Importance of random sampling to minimize bias and ensure representativeness
Waiting Times in Customer Service
Average wait times affect customer satisfaction; expectations around 5-10 minutes
Statistical tools can help understand probability distributions of wait times
Calculating Waiting Times
Example: Average service time = 3.6 minutes; SD = 1 minute.
- Probability (x > 5) = 1 - NORM.DIST(5, 3.6, 1, TRUE) = 0.0808
- 8% chance of waiting over 5 minutes