Notes: Page-by-Page Discrete Random Variables and Moments
Page 1: Expected value basics and Bernoulli/geometric setup
Expected value definition:
Bernoulli example (two outcomes 0 or 1): if and , then
Geometric setup (first success on trial X):
PMF:
Intuition: number of trials until first success; larger p → smaller expected number of trials.
Page 2
Expected value of geometric distribution via calculus:
Trick: differentiate the geometric series with respect to the parameter and interchange sum and derivative (under regularity conditions: differentiability and uniform convergence).
Result:
Quick intuition: if p=0.2, then .
Note on interchangeability: requires differentiability of the term and a justification (uniformly bounded sum); these are standard calculus conditions.
Reminder: geometric mean/expectation is not trivial despite its simple form.
Page 3
Not all distributions have a finite expected value.
Example: Cauchy distribution (heavy tails) can yield that does not exist (diverges).
If you try and the sum diverges, the expectation does not exist.
Practical takeaway: to guarantee an expectation exists, it is common to require \mathbb{E}[|X|] < \infty (absolute integrability).
Relevance: heavy-tailed distributions have large tail probability mass which can make the expectation undefined or infinite.
Page 4
Expectation of a function of a random variable:
Profit example (three computers):
Cost:
Sell price: per unit; unsold units repurchased at each.
If sold x units, revenue:
Profit:
Therefore:
If the function is linear, , then (linearity of expectation).
Practical use: compute (\mathbb{E}[X]) once and apply it to the linear form; for non-linear h, compute the full sum.
Page 5
Variance definition for a random variable:
Discrete distribution variance:
Relation to data variance: distribution variance uses probability weights (not dividing by n or n-1); data variance uses sample-based averaging with a denominator (n-1) for unbiased estimation.
Alternative form: which is often easier to compute.
Standard deviation: ; interprets dispersion in the same units as X.
Interpretation: larger variance/SD implies greater dispersion or volatility in the distribution.
Page 6
Summary and preview:
We covered means (expected value) and variance for discrete random variables, including linear and non-linear transformations.
Noted that some distributions do not have finite expectations; absolute integrability is a typical condition for existence.
Next steps: discrete random variables chapter continuation, well-known discrete distributions, and when to use them to model real-life systems; practice computing means/variances and understanding distribution shapes.