Chapter 3: Discrete Random Variables and Probability Distributions
Discrete Random Variables & Probability Distributions
Chapter Outline
3.1 Probability Distributions and Probability Mass Functions
3.2 Cumulative Distribution Functions
3.3 Mean and Variance of a Discrete Random Variable
3.4 Discrete Uniform Distribution
3.5 Binomial Distribution
3.6 Geometric and Negative Binomial Distributions
3.7 Hypergeometric Distribution
**3.8 Poisson Distribution
Learning Objectives
Determine Probabilities: Calculate probabilities from probability mass functions and vice versa.
Cumulative Distribution Functions: Determine probabilities and probability mass functions from cumulative distribution functions and vice versa.
Calculate Means and Variances: Compute means and variances for discrete random variables.
Understand Assumptions: Grasp the assumptions underlying discrete probability distributions.
Select Appropriate Distributions: Choose the correct discrete probability distribution for probability calculations.
Calculate Probabilities and Determine Means and Variances: For common discrete probability distributions.
Probability Distributions and Mass Functions
A random variable is a function assigning a real number to each outcome in a random experiment's sample space.
The probability distribution of a random variable describes the probabilities associated with its possible values.
A discrete random variable has a probability distribution specifying the list of possible values of along with their probabilities, or expressed via a function or formula.
Example 3.1: Flash Recharge Time
Testing recharge time in three cellphone cameras.
Probability of a camera passing the test is 0.8, and cameras perform independently.
Table 3.1 shows the sample space and probabilities.
Example: Probability of first two cameras passing and third failing (ppf) is .
The random variable denotes the number of cameras passing the test.
Example 3.3: Digital Channel
A bit transmitted through a digital channel may be received in error.
Let equal the number of bits received in error in the next 4 bits transmitted.
The probability distribution of is given by possible values and their probabilities.
Probability Mass Function
For a discrete random variable with possible values , a probability mass function satisfies:
Example 3.4: Wafer Contamination
Let be the number of wafers analyzed to detect a large particle of contamination.
Probability that a wafer contains a large particle is 0.01, and wafers are independent.
Sample space: , where denotes a wafer with a large particle and denotes a wafer without.
Range of values of is
General formula: , for
Cumulative Distribution Functions
The cumulative distribution function is the probability that a random variable , with a given probability distribution, will be found at a value less than or equal to .
Symbolically,
Example 3.5
Consider the probability distribution for the digital channel example. Find the probability of three or fewer bits in error.
The event is the total of the events: , , , and .
For the probability calculation, see the shaded row in the table.
Example 3.6
Determine the probability mass function of from the following cumulative distribution function:
Note: Even if the random variable can assume only integer values, the cumulative distribution function is defined at non-integer values.
Cumulative Distribution Function and Properties
For a discrete random variable , satisfies the following properties:
If , then
Mean and Variance of a Discrete Random Variable
Used to summarize a probability distribution
Mean: measure of center or middle of the probability distribution
For a discrete random variable, a weighted average of possible values with weights equal to probabilities
Variance: measure of the dispersion, or variability in the distribution
For a discrete random variable, a weighted measure of each possible squared deviation with weights equal to probabilities
Mean or expected value:
Variance:
Standard deviation:
Example 3.7: Digital Channel
is the number of bits received in error of the next 4 transmitted. Use the table to calculate the mean & variance.
Mean:
Variance:
Expected Value of a Function of a Discrete Random Variable
If is a discrete random variable with probability mass function
The variance can be considered as an expected value of a specific function of , namely,
Example 3.9: Digital Channel
What is the expected value of the square of the number of bits in error?
Discrete Uniform Distribution
Let be a discrete random variable ranging from , for .
There are values in the inclusive interval.
Therefore,
Mean and Variance of Discrete Uniform Distribution
Mean:
Variance:
Example 3.11: Number of Voice Lines
Let the random variable denote the number of the 48 voice lines that are in use at a particular time. Assume that is a discrete uniform random variable with a range of 0 to 48. Find & .
Practical Interpretation: The average number of lines in use is 24, but the dispersion (as measured by ) is large. Therefore, at many times far more or fewer than 24 lines are used.
Binomial Distribution
The random variable that equals the number of trials that result in a success is a binomial random variable with parameters 0 < p < 1 and . The probability mass function is:
, for
For constants and , the binomial expansion is:
Example 3.14: Binomial Coefficient
Exercises in binomial coefficient calculation:
Recall:
Example 3.15a: Organic Pollution
Each sample of water has a 10% chance of containing a particular organic pollutant. Assume that the samples are independent with regard to the presence of the pollutant. Find the probability that, in the next 18 samples, exactly 2 contain the pollutant.
Answer: Let denote the number of samples that contain the pollutant in the next 18 samples analyzed. Then is a binomial random variable with and .
Example 3.15b: Organic Pollution
Determine the probability that at least 4 samples contain the pollutant.
Answer: The problem calls for calculating but is easier to calculate the complementary event, , so that:
Example 3.15c: Organic Pollution
Determine the probability that 3 ≤ X < 7.
Answer:
P(3 ≤ X < 7) = \sum_{x=3}^{6} \binom{18}{x} (0.1)^x (0.9)^{18-x} = 0.168 + 0.070 + 0.022 + 0.005 = 0.265
Binomial Mean and Variance
If is a binomial random variable with parameters and ,
The mean of is:
The variance of is:
These quantities are derived by summing Bernoulli random variables and using the definitions of the mean and variance of discrete random variables.
Example 3.16: Binomial Mean and Variance
For the number of transmitted bits received in error in Example 3.13, and . Find the mean and variance of the binomial random variable.
Answer:
Geometric Distribution
Binomial distribution has
Fixed number of trials
Random number of successes
Geometric distribution has reversed roles
Random number of trials
Fixed number of successes, in this case 1
, for
Example 3.18: Wafer Contamination
The probability that a wafer contains a large particle of contamination is 0.01. Assume that the wafers are independent. What is the probability that exactly 125 wafers need to be analyzed before a particle is detected?
Let denote the number of samples analyzed until a large particle is detected. Then is a geometric random variable with parameter .
Geometric Mean and Variance
Mean:
Variance:
Example 3.19: Mean and Standard Deviation
Consider the transmission of bits in Example 3.17. The probability that a bit transmitted through a digital transmission channel is received in error is . Assume that the transmissions are independent events, and let the random variable denote the number of bits transmitted until the first error. Find the mean and standard deviation.
Mean:
Variance:
Standard deviation:
Lack of Memory Property
For a geometric random variable, the trials are independent
Count of the number of trials until the next success can be started at any trial without changing the probability distribution of the random variable.
Implication: the system presumably will not wear out.
For all transmissions the probability of an error remains constant.
Hence, the geometric distribution is said to lack any memory.
Example 3.20: Lack of Memory Property
In Example 3.17, the probability that a bit is transmitted in error is . Suppose 50 bits have been transmitted. What is the mean number of bits transmitted until the next error?
The mean number of bits transmitted until the next error, after 50 bits have already been transmitted, is , the same result as the mean number of bits until the first error.
Negative Binomial Distribution
A generalization of a geometric distribution in which the random variable is the number of Bernoulli trials required to obtain successes
, for
Mean & Variance of Negative Binomial
Mean:
Variance:
Example 3.22: Camera Flashes
The probability that a camera passes a particular test is 0.8, and the cameras perform independently. What is the probability that the third failure is obtained in five or fewer tests?
Let denote the number of cameras tested until three failures have been obtained. The requested probability is . Here has a negative binomial distribution with and .
Hypergeometric Distribution
Samples are selected from a finite population without replacement
, for
Example 3.23: Sampling without replacement
A day’s production of 850 manufactured parts contains 50 parts that do not conform to customer requirements Two parts are selected at random without replacement from the day’s production. Let A and B denote the events that the first and second parts are nonconforming, respectively. What is the probability that both parts conform, one part does not conform, and both parts do not conform?
Let denote the number of parts that do not conform. Therefore,
Example 3.24a: Parts from Suppliers
A batch of parts contains 100 parts from a local supplier of circuit boards and 200 parts from a supplier in the next state. If 4 parts are selected randomly, without replacement, what is the probability that they are all from the local supplier?
Example 3.24b: Parts from Suppliers
What is the probability that two or more parts in the sample are from the local supplier?
Example 3.24c: Parts from Suppliers
What is the probability that at least one part in the sample is from the local supplier?
Hypergeometric Mean & Variance
Mean:
Variance:
Poisson Distribution
The random variable that equals the number of events in a Poisson process is a Poisson random variable with parameter 0 < \lambda, and
for
Example 3.27a: Wire Flaws
Flaws occur at random along the length of a thin copper wire. Let denote the random variable that counts the number of flaws in a length of mm of wire and suppose that the average number of flaws is 2.3 per mm. Find the probability of exactly 10 flaws in 5 mm of wire.
Let denote the number of flaws in 5 mm of wire. Then has the Poisson distribution with
Example 3.27b: Wire Flaws
Find the probability of at least 1 flaw in 2 mm of wire.
Let denote the number of flaws in 2 mm of wire. Then has the Poisson distribution with
Poisson Mean & Variance
The mean and variance of the Poisson model are the same.
For example, if particle counts follow a Poisson distribution with a mean of 25 particles per square centimeter, the variance is also 25 and the standard deviation of the counts is 5 per square centimeter.
If the variance of a data is much greater than the mean, then the Poisson distribution would not be a good model for the distribution of the random variable.