Continuous Random Variables: PDFs, CDFs, Expected Value, and Variance
Continuous Random Variables and Probability Density Functions (PDFs)
Definition of a Continuous Random Variable: A random variable that can take on any value within a given range in the real number system, often an interval [a,b] (where a or b can be infinite).
Probability Density Function (PDF), denoted fX(x):
It's analogous to density in physics (e.g., mass density). If you integrate the density of an object, you get its total mass. In probability, integrating a PDF gives the total probability.
Properties:
The PDF must be non-negative: fX(x)≥0 for all x.
The total area under the curve of the PDF over its entire range of possible values must equal one: ∫<em>−∞∞f</em>X(x)dx=1.
For a defined range [a,b], this becomes ∫<em>abf</em>X(x)dx=1.
Computing Probability: To find the probability that a continuous random variable X falls within a certain subset (or interval) A of its range, you integrate the PDF over that subset: P(X∈A)=∫<em>Af</em>X(x)dx For an interval [c,d] within the range: P(c≤X≤d)=∫<em>cdf</em>X(x)dx
Cumulative Distribution Function (CDF)
Definition, denoted FX(x): The cumulative distribution function gives the probability that the random variable X takes on a value less than or equal to a specific value x.
It represents the area under the PDF curve up to a point x.
Formula:F<em>X(x)=P(X≤x)=∫</em>−∞xfX(t)dt (A dummy variable t is often used for the integration to avoid confusion with the upper limit x).
Relationship between PDF and CDF: By the Fundamental Theorem of Calculus, the derivative of the CDF gives the PDF: f<em>X(x)=dxdF</em>X(x)
Expected Value and Variance for Continuous Random Variables
Expected Value (Mean), denoted E[X] or μX:
It represents the long-term average value of the random variable.
Formula:E[X]=∫<em>−∞∞x⋅f</em>X(x)dx This is a weighted average where each value of x is weighted by its probability density.
Expected Value of a Function of X, denoted E[g(X)]:
For any function g(x), its expected value is: E[g(X)]=∫<em>−∞∞g(x)⋅f</em>X(x)dx
Variance, denoted Var(X) or σX2:
Measures the spread or dispersion of the data around the mean.
The variance indicates the spread of the data. A higher variance means greater spread.
Example: U-Substitution in Finding 'c'
Let fX(x)=cxcos(x2) for x∈[0,π/2] and 0 otherwise.
Finding the constant 'c'
Condition:∫0π/2cxcos(x2)dx=1
U-Substitution: Let u=x2. Then du=2xdx, so xdx=21du.
Change Limits of Integration:
If x=0, then u=02=0.
If x=π/2, then u=(π/2)2=π/2.
Substitute and Integrate: ∫<em>0π/2ccos(u)21du=12c∫</em>0π/2cos(u)du=1 2c[sin(u)]0π/2=1 2c(sin(π/2)−sin(0))=1 2c(1−0)=1 2c=1 c=2
Result: The PDF is fX(x)=2xcos(x2) for x∈[0,π/2] and 0 otherwise.
Note on Positivity: It's important to ensure the function is non-negative over its range. For x∈[0,π/2], x is positive, and x2∈[0,π/2]. In this range, cos(x2) is positive, so the entire PDF is positive.
This integral is complex and cannot be solved by simple u-substitution or elementary integration by parts directly as presented. It may require advanced techniques like power series expansion (Taylor series) and term-by-term integration for approximation, or might be a type of Fresnel integral. This demonstrates that not all integrals for expected values are straightforward to compute analytically.