Introduction to the Binomial Distribution and Discrete Probability Models
Overview of Discrete Probability Distributions
The study of discrete probability distributions focuses on specific models that define probabilities for random variables.
While there are many types of discrete distributions, this module focuses on three primary ones:
Binomial Distribution: Used to define probabilities for independent trials.
Poisson Distribution: Used to define probabilities for events occurring over a specific period of time.
Hypergeometric Distribution: Used to define probabilities for dependent trials (where trials are not independent).
All discrete probability distributions must obey two fundamental rules:
The sum of probabilities for all unique events in the sample space must equal 1 ().
Expected value and variance can be calculated for these distributions. While standard methods apply, specific distributions often have mathematical shortcuts for these calculations.
Characteristics of the Binomial Distribution
To identify if a situation follows a binomial distribution, it must meet the following criteria:
Fixed Number of Observations (): There is a set number of trials or observations, denoted by the parameter .
Binary Outcomes: Each trial results in one of only two possible outcomes, classified as a "success" or a "failure."
These outcomes must be mutually exclusive (cannot happen at the same time) and collectively exhaustive (nothing else can happen).
"Success" is defined as the occurrence of the specific event of interest; it does not necessarily imply a positive qualitative outcome.
Constant Probability of Success (): The probability of success, denoted by the symbol (not to be confused with the mathematical constant ), remains the same from one trial to the next.
Probability of Failure: Since there are only two outcomes, the probability of failure is always equal to .
Independence: Each observation or trial is independent of all others. The outcome of one trial has no effect on the outcome of previous or future trials.
Counter-Examples (Dependency):
Weather: If it rained yesterday, there is statistically a higher chance it will rain today. Historical patterns affect future outcomes.
Stock Prices: Historical fluctuations often have some impact on future price movements, indicating a level of dependency.
Application Examples of the Binomial Distribution
The binomial distribution is applicable in various fields where outcomes are binary:
Manufacturing: Labeled items are classified as either "defective" or "acceptable."
Contract Bidding: A firm bidding for a contract either "receives the contract" or "does not receive the contract."
Marketing Research: Survey responses are categorized as "Yes, I will buy" or "No, I will not buy."
Human Resources: Job applicants either "accept the offer" or "reject the offer."
Counting Rule 5: Combinations
Counting Rule 5 is essential for determining the number of ways a specific number of successes can occur within a set number of trials.
Conceptual Example: In tosses of a coin, how many ways can you obtain exactly heads?
Way 1: Heads, Heads, Tails (HHT)
Way 2: Heads, Tails, Heads (HTH)
Way 3: Tails, Heads, Heads (THH)
There are exactly 3 ways to achieve 2 heads in 3 tosses.
The Combination Formula:
Notation: In binomial contexts, the notation (long brackets with over ) is frequently used as shorthand for combinations (). It is not a fraction and does not use a dividing line.
The Binomial Probability Distribution Formula
The binomial formula calculates the probability that a random variable will result in exactly successes, given the parameters and .
Formula Structure:
Component Breakdown:
: Number of trials.
: Constant probability of success.
: The number of successes of interest.
: The total number of ways to arrange successes in trials.
: The probability of the successes occurring.
: The probability of the failures occurring ( is the number of failures).
Parameters: The values and are referred to as parameters because they define the specific nature of the distribution.
Numerical Examples
Example 1: Fair Coin Tosses
Scenario: A fair coin is flipped times. Success is defined as landing on heads.
Parameters: , .
Sample Space for successes (): . Note that cannot exceed .
Calculation for :
Example 2: Low Probability Event
Scenario: Five observations () with a probability of success .
Calculation for :
Expanding the logic: This accounts for 1 success () and 4 failures () multiplied by the 5 different positions the success could occupy.
Result:
Calculation for :
Result in scientific notation: (or ).
Intuition: If an event is very unlikely to happen (), the chance of it happening 5 times in a row is extremely low.
Distribution Behavior and Excel Implementation
The shape of a binomial distribution is directly influenced by its parameters, particularly the probability of success ().
Excel Function
The function used is
=BINOM.DIST(x, n, pi, cumulative).x: Number of successes.n: Number of trials.pi: Probability of success.cumulative: A boolean value. For individual probabilities, usefalse(or 0).
Impact of on Distribution Shape
Low (e.g., ): The distribution is skewed to the right. The highest probability occurs at low values of (e.g., zero or one success).
Moderate (e.g., ): The distribution is symmetrical and balanced. The probability mass is centered around the middle of the sample space.
High (e.g., ): The distribution is skewed to the left. High probabilities are associated with a high number of successes.
Calculator Tips
On most Casio calculators, the combination function () is often accessed by pressing the
SHIFTkey followed by the÷(division) key.While the HP calculator can perform these calculations, it may be less intuitive as it does not display the expression as clearly as modern Casio models.