Section 4.5–4.6: Continuous Random Variables & the Normal Distribution

Review from previous section 4.4
- Last week focused on discrete random variables (probability mass functions, sums of probabilities).
- Key bridge: many discrete models (e.g.
- Binomial) can be approximated by continuous models (especially the normal) for large sample sizes.
Fundamental distinction
- Continuous random variable $X$ takes on every value in an interval of the real line.
- Because values are uncountably infinite, $P(X = x) = 0$ for any single point; probabilities are only meaningful over intervals.
Probability Density Function (PDF) $f(x)$
- Smooth curve drawn over the real-number axis; sometimes called frequency function or just probability distribution.
- Area under the curve over an interval corresponds to probability:
  $P(a \le X \le b) = \int_a^b f(x)\,dx$
- Total area under entire curve is $1$ .
- Graphical intuition: shade region from $a$ to $b$ → shaded area gives the desired probability.

Central role
- Describes numerous natural/industrial phenomena (heights, measurement error, IQ, bulb lifetime, etc.).
- Serves as a limiting approximation for many discrete distributions (notably binomial via De Moivre–Laplace/Central Limit Theorem).
- Forms bedrock of classical statistical inference (confidence intervals, hypothesis testing).
Shape & descriptive characteristics
- Bell-shaped, symmetrical (a.k.a. mound-shaped).
- Mean, median, and mode coincide at $x = \mu$ (curve is perfectly balanced there).
- Inflection points located one standard deviation on either side (at $\mu \pm \sigma$ ).
PDF formula $f(x) = \frac{1}{\sigma\sqrt{2\pi}}\,e^{- \frac12\bigl(\frac{x-\mu}{\sigma}\bigr)^2}$
- Parameters:
- $\mu$ = mean (center)
- $\sigma$ = standard deviation (spread)
- Transcendental constants $e \approx 2.71828$ and $\pi \approx 3.14159$ appear.
- Memorization not required; recognition and parameter interpretation are essential.
Visual parameter effects (3 example curves)
1. $\mu = 0,\; \sigma = 1$ → standard curve (purple), widest of shown.
2. $\mu = 3,\; \sigma = 1$ → curve shifted right, same spread.
3. $\mu = -4,\; \sigma = 0.5$ → curve shifted left, narrower and taller because smaller $\sigma$ .
Area–probability equivalence holds: $P(c \le X \le d) = \int_c^d f(x)\,dx$ .
- If calculus unfamiliar, mentally replace integral by area of shaded strip.

Special case with $\mu = 0,\; \sigma = 1$ .
Variable usually denoted $Z$ .
Historically, probabilities obtained from printed $Z$-tables:
- E.g. $P(0 \le Z \le 1.96) = 0.4750$ from table (row 1.9, column 0.06).
Modern workflow: use calculators or software:
- TI-84: 2nd → DIST (VARS) → normalcdf(lower, upper).
- Example: normalcdf(-1.26,1.26) returns $0.7923$ ; table gives slightly less accurate 0.7922 (rounding).

Any normal $X \sim \mathcal N(\mu,\sigma^2)$ can be converted to $Z$ via $Z = \frac{X-\mu}{\sigma}$
- Called z-score; measures distance from mean in units of standard deviation.
- Enables use of standard normal tables or functions.
Procedure for probability questions
1. Sketch bell curve; mark mean and shade desired area.
2. Convert interval bounds $x1, x2$ to z-scores:
 $z1 = \frac{x1-\mu}{\sigma},\quad z2 = \frac{x2-\mu}{\sigma}$
3. Use normalcdf(z_1,z_2).
4. Translate result back if interpreting in $X$ context.
Calculator shortcut: TI-84 allows normalcdf(lower,upper,μ,σ) directly, but understanding standardisation remains conceptually critical.

$X \sim \mathcal N(5,10^2)$
- Find $P(5 \le X \le 6.2)$ .
 - z-scores: $z1 = 0$ (5 at mean), $z2 = (6.2-5)/10 = 0.12$ .
 - normalcdf(0,0.12) → $0.0478$ .
Same distribution, $P(2.9 \le X \le 7.1)$ .
- $z1 = (2.9-5)/10 = -0.21$ , $z2 = +0.21$ .
- normalcdf(-0.21,0.21) → $0.1663$ .
Light-bulb lifetime
- $\mu = 2000$ h, $\sigma = 200$ h ( $X \sim \mathcal N(2000,200^2)$ ).
- a) $P(2000 \le X \le 2400)$ .
 - $z1 = 0$ , $z2 = (2400-2000)/200 = 2$ .
 - Area: normalcdf(0,2) → $0.4772$ .
- b) P(X < 1470).
 - $z = (1470-2000)/200 = -2.65$ .
 - Two approaches:
 1. Compute left-tail directly: normalcdf(-1e99,-2.65) → $0.0040$ .
 2. Compute strip to 0 then subtract from 0.5: normalcdf(-2.65,0) = 0.4960; $0.5 - 0.4960 = 0.0040$ .
- Interpretation: Very unlikely (0.4 %) that a bulb fails before 1470 h.

Given a probability (area), find $z$ such that $P(Z \le z) = p$ .
TI-84: invNorm(p) (or inverse normal in menu).
- Input = area to left of desired point.
- Output = quantile (z-value).

Determine $a$ such that $P(0 \le Z \le a) = 0.1217$ .
- Total area to left of $a$ : $0.5 + 0.1217 = 0.6217$ .
- invNorm(0.6217) → $a \approx 0.3099 \approx 0.31$ .
Find $z0$ with $P(Z \le z0) = 0.2858$ .
- Left area smaller than 0.5 ⇒ quantile is negative.
- invNorm(0.2858) → $z_0 \approx -0.57$ .
Symmetric central probability: find $z0$ s.t. $P(-z0 \le Z \le z_0) = 0.7294$ .
- Tail area combined: $1 - 0.7294 = 0.2706$ ; each tail $0.1353$ .
- Focus on left tail: invNorm(0.1353) → $-1.1$ .
- Therefore $z_0 = 1.1$ .

If $P(a \le X \le b)$ is given and $X \sim \mathcal N(\mu,\sigma^2)$ :
1. Convert problem to $Z$ via standardisation.
2. Use invNorm to get appropriate $z$ -quantile(s).
3. Convert back: $x = \mu + z\sigma$ .

$X \sim \mathcal N(5,10^2)$ ; find $a$ s.t. $P(5 \le X \le a) = 0.1217$ .
1. Diagram: strip to right of mean with given area.
2. Corresponding $Z$ problem identical to earlier $0 \le Z \le z$ with area 0.1217 → $z = 0.3099$ .
3. Translate: $a = \mu + z\sigma = 5 + 0.3099(10) \approx 8.1$ .

Practical relevance: Engineers, scientists, and quality-control analysts rely on normal probabilities to predict defect rates (e.g., bulb life example) and design tolerance intervals.
Ethical dimension: Misinterpreting probabilities (e.g., ignoring tail risk) can lead to faulty decision-making in public policy or product safety.
Link to earlier material: The binomial distribution—treated in the discrete section—converges to the normal when number of trials is large and $p$ not extreme (De Moivre-Laplace). Understanding standardisation prepares you for the Central Limit Theorem in later chapters.
Tool mindset: While technology replaces hand-integration, conceptual mastery (areas ↔ probabilities, standardisation, tails) is vital for correct model choice and critical evaluation of software outputs.

[ ] Recognise definition and properties of a PDF for continuous variables.
[ ] Memorise that probabilities = areas under the curve.
[ ] Identify & interpret $\mu$ and $\sigma$ in any normal PDF.
[ ] Convert $X \sim \mathcal N(\mu,\sigma^2)$ ↔ $Z \sim \mathcal N(0,1)$ via $Z = (X-\mu)/\sigma$ .
[ ] Use normalcdf for forward probabilities, invNorm for quantiles.
[ ] Always sketch curve, mark means, endpoints, and shade area—this prevents sign mistakes & mis-ordering of bounds.