Probability Functions for Continuous Random Variables and Special Distributions

Characterization of Continuous Random Variables

A continuous random variable (CRV) is defined as a variable capable of assuming infinitely many values within a given range. Unlike discrete variables, where data consists of distinct, countable points, CRV data represents measurements on a continuous scale. Standard examples include physical quantities such as pressure, height, mass, weight, density, volume, temperature, and distance.

A key conceptual distinction for CRVs is the probability of a single point. Consider a student waiting for a bus that arrives every 3030 minutes. While the variable XX (waiting time) can technically take the value of exactly 7.2119167.211916 minutes, the probability of an exact occurrence is mathematically zero (P(X=7.211916)=0P(X = 7.211916) = 0). Consequently, in continuous probability, we do not focus on the probability of a variable assuming a single particular value; instead, we determine the probability that the variable falls within a specific interval, such as P(5X10)P(5 \le X \le 10).

The Probability Density Function (PDF)

The probability distribution of a CRV is defined by a density function denoted as f(x)f(x). This function assigns probabilities to intervals of decimal numbers. Geometrically, the probability that XX assumes a value in the interval [a,b][a, b] is equal to the area under the curve y=f(x)y = f(x), bounded below by the xx-axis and on the sides by the vertical lines x=ax = a and x=bx = b.

For a function to qualify as a valid Probability Density Function (PDF), it must satisfy several rigorous conditions:

  1. Non-negativity: f(x)0f(x) \ge 0 for all values within the range of XX.

  2. Continuity: The function must be piecewise continuous.

  3. Normalization: The total area under the entire curve must equal 11, expressed as f(x)dx=1\int_{-\infty}^{\infty} f(x) dx = 1.

  4. Interval Probability: The probability of XX being between aa and bb is calculated via integration: P(aXb)=abf(x)dxP(a \le X \le b) = \int_{a}^{b} f(x) dx.

An important property to note is that for any CRV, the inclusion of endpoints does not change the probability. That is: P(a \le X \le b) = P(a < X \le b) = P(a \le X < b) = P(a < X < b).

Analytical Example: Flight Delay Distributions

Consider a scenario where XX represents the delay in hours of a PIA flight from the UK, modeled by the function: f(x)=0.20.02Xf(x) = 0.2 - 0.02X for 0x100 \le x \le 10

To verify if this is a valid PDF, we calculate the area under the function. Geometrically, at x=0x = 0, f(0)=0.2f(0) = 0.2, and at x=10x = 10, f(10)=0f(10) = 0. This forms a triangle. The area of this triangle is given by: Area=12×base×height=12×10×0.2=1.0\text{Area} = \frac{1}{2} \times \text{base} \times \text{height} = \frac{1}{2} \times 10 \times 0.2 = 1.0 Calculus verification confirms this: 010(0.20.02x)dx=[0.2x0.01x2]010=(21)0=1\int_{0}^{10} (0.2 - 0.02x) dx = [0.2x - 0.01x^2]_0^{10} = (2 - 1) - 0 = 1

Specific probability calculations using this PDF include:

  1. Delay less than 44 hours (P(X < 4)): Geometrically, this forms a trapezium with heights f(0)=0.2f(0) = 0.2 and f(4)=0.12f(4) = 0.12. Area=12(0.2+0.12)×4=0.64\text{Area} = \frac{1}{2}(0.2 + 0.12) \times 4 = 0.64

  2. Delay between 22 and 66 hours (P(2X6)P(2 \le X \le 6)): heights are f(2)=0.16f(2) = 0.16 and f(6)=0.08f(6) = 0.08. Area=12(0.16+0.08)×4=0.48\text{Area} = \frac{1}{2}(0.16 + 0.08) \times 4 = 0.48

  3. Delay more than 66 hours (P(X > 6)): heights are f(6)=0.08f(6) = 0.08 and f(10)=0f(10) = 0. Area=12(0.08+0)×4=0.16\text{Area} = \frac{1}{2}(0.08 + 0) \times 4 = 0.16

Advanced Density Function Examples

Consider the function f(x)=136x(6x)f(x) = \frac{1}{36}x(6 - x) for the interval 0x60 \le x \le 6. Integration is required to find specific probabilities:

  • For P(X5)P(X \ge 5), the integral is evaluated from 55 to 66: 56136(6xx2)dx=136[3x2x33]560.0741\int_{5}^{6} \frac{1}{36}(6x - x^2) dx = \frac{1}{36} [ 3x^2 - \frac{x^3}{3} ]_5^6 \approx 0.0741

  • For P(X3)P(X \le 3), the integral is evaluated from 00 to 33: 136[189]=0.5\frac{1}{36} [ 18 - 9 ] = 0.5

  • For P(3X5)P(3 \le X \le 5), the integral evaluates to 0.42590.4259.

In another example, find the constant kk for f(x)=kx2f(x) = kx^2 where 0x40 \le x \le 4. Applying the normalization property: 04kx2dx=1kx3304=1k643=1k=364\int_{0}^{4} kx^2 dx = 1 \rightarrow k \frac{x^3}{3} |_0^4 = 1 \rightarrow k \frac{64}{3} = 1 \rightarrow k = \frac{3}{64} Using this result, conditional probability can be assessed. For sets A={x:x2}A = \{x : x \ge 2\} and B={x:x3.5}B = \{x : x \le 3.5\}, find P(AB)P(A|B). First, calculate P(AB)=P(2X3.5)P(A \cap B) = P(2 \le X \le 3.5): 23.53x264dx=3.532364=0.5449\int_{2}^{3.5} \frac{3x^2}{64} dx = \frac{3.5^3 - 2^3}{64} = 0.5449 Then calculate P(B)=P(0X3.5)P(B) = P(0 \le X \le 3.5): 03.53x264dx=3.5364=0.6699\int_{0}^{3.5} \frac{3x^2}{64} dx = \frac{3.5^3}{64} = 0.6699 P(AB)=0.54490.6699=0.8134P(A|B) = \frac{0.5449}{0.6699} = 0.8134

The Exponential Distribution

The Exponential distribution is essential for modeling waiting times until specific events occur, assuming events happen independently at a constant average rate. Common applications include the time until a radioactive particle decays, time until hardware failure, or the arrival time of customers in a queue.

The PDF is defined as: f(x)=λeλxf(x) = \lambda e^{-\lambda x} for x0x \ge 0 Where λ\lambda represents the rate parameter, derived from the mean μ\mu as λ=1μ\lambda = \frac{1}{\mu}.

The Cumulative Distribution Function (CDF) is derived as: F(x)=P(Xx)=1eλxF(x) = P(X \le x) = 1 - e^{-\lambda x} Calculations follow these patterns:

  • Right-tail probability: P(X > x) = e^{-\lambda x}

  • Interval probability: P(aXb)=eλaeλbP(a \le X \le b) = e^{-\lambda a} - e^{-\lambda b}

If at a shopping mall, customers enter at an average of 22 per minute (μ=2\mu = 2), then λ=0.5\lambda = 0.5.

  • The probability a customer enters in less than one minute: P(X < 1) = 1 - e^{-0.5 \times 1} = 0.3935

  • The probability a customer enters after 22 minutes: P(X > 2) = e^{-0.5 \times 2} = 0.3679

The Memoryless Property of Exponential Distributions

A unique characteristic of the exponential distribution is the memoryless property. It posits that the probability of an event occurring in the future is independent of how much time has already elapsed. For instance, if a machine part has already lasted SS years, the probability it lasts an additional tt years is the same as the probability a brand-new part lasts tt years: P(X \ge S + t | X > S) = P(X > t)

As an application, if repair equipment has an average repair rate of 22 machines per hour (λ=0.5\lambda = 0.5), the probability that a repair takes at least 1010 hours given it has already exceeded 99 hours is calculated as: P(X \ge 10 | X > 9) = P(X > 1) = e^{-0.5 \times 1} = 0.6065

The Normal Probability Distribution

The Normal distribution is the most prominent continuous distribution, characterized by its symmetric bell-shaped curve centered around the mean (μ\mu). High-density data clusters near the mean, and the probability decreases as values move further away. The tails of the curve are asymptotic, meaning they approach the xx-axis but never intersect it.

Key properties include:

  • Symmetry: The left and right sides are mirror images.

  • Distribution: Exactly 50%50\% of values lie above the mean and 50%50\% below.

  • Parameters: Defined by the mean (μ\mu) and variance (σ2\sigma^2).

  • The PDF formula is: f(x)=1σ2πe12(xμσ)2f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{1}{2}(\frac{x-\mu}{\sigma})^2}

Standardization allows us to transform any Normal distribution into the Standard Normal distribution (ZZ), where μ=0\mu = 0 and σ=1\sigma = 1. The transformation formula is: Z=XμσZ = \frac{X - \mu}{\sigma}

Normal Distribution Applications and Problem Solving

Practical applications of the Normal distribution range from measuring human height to machine error rates.

Example: Soft Drink Machine Regulation A machine fills cups with an average of 200ml200\,ml and a standard deviation of 15ml15\,ml.

  • Fraction of cups with less than 224ml224\,ml: Z = \frac{224 - 200}{15} = 1.6 \rightarrow P(Z < 1.6) = 0.9452

  • Probability between 191ml191\,ml and 209ml209\,ml: Z_1 = \frac{191-200}{15} = -0.6, Z_2 = \frac{209-200}{15} = 0.6 \rightarrow P(-0.6 < Z < 0.6) = P(0.6) - P(-0.6) = 0.7257 - 0.2743 = 0.4514

  • Overflow probability for 230ml230\,ml cups (P(X > 230)): Z = \frac{230 - 200}{15} = 2 \rightarrow P(Z > 2) = 0.0228. In 10001000 drinks, roughly 2323 cups will overflow.

Example: Manufacturing Bolts Bolts have a mean diameter of 0.25inches0.25\,inches and σ=0.02\sigma = 0.02. Defective bolts are those with diameter < 0.20 or > 0.28.

  • P(X < 0.20) = Z(-2.5) = 0.0062

  • P(X > 0.28) = Z(1.5) = 0.0669

  • Total defective probability = 0.07310.0731. Out of 10,00010,000 bolts, 731731 are defective.

Normal Approximation to the Binomial Distribution

The Binomial distribution can be approximated by the Normal distribution provided the sample size is large enough. The criteria for this are np > 5 and nq > 5. When using this approximation, a "continuity correction" is applied because we are using a continuous distribution to estimate discrete outcomes. This involves adjusting the discrete value by 0.50.5.

In a coin toss experiment (200200 times, p=0.5p = 0.5):

  • Mean: μ=np=100\mu = np = 100

  • Variance: σ2=npq=50σ=507.07\sigma^2 = npq = 50 \rightarrow \sigma = \sqrt{50} \approx 7.07

  • Probability of at least 9595 heads: P(X94.5)P(X \ge 94.5) using correction. Z=94.51007.07=0.7778P(Z0.7778)=0.7811Z = \frac{94.5 - 100}{7.07} = -0.7778 \rightarrow P(Z \ge -0.7778) = 0.7811

For a factory producing 2%2\% defective bolts, in a box of 10,00010,000:

  • Mean: μ=200\mu = 200, Variance: σ2=196σ=14\sigma^2 = 196 \rightarrow \sigma = 14

  • Probability of at most 190190 defective bolts: P(X190.5)P(X \le 190.5) Z=190.520014=0.6786P(Z0.6786)=0.2487Z = \frac{190.5 - 200}{14} = -0.6786 \rightarrow P(Z \le -0.6786) = 0.2487