Notes: Page-by-Page Discrete Random Variables and Moments

Page 1: Expected value basics and Bernoulli/geometric setup

  • Expected value definition: E[X]=xxP(X=x)\mathbb{E}[X] = \sum_x x \cdot P(X=x)

  • Bernoulli example (two outcomes 0 or 1): if P(X=1)=p<em>1P(X=1)=p<em>1 and P(X=0)=p</em>0=1p<em>1P(X=0)=p</em>0=1-p<em>1, then E[X]=0p</em>0+1p<em>1=p</em>1\mathbb{E}[X] = 0\cdot p</em>0 + 1\cdot p<em>1 = p</em>1

  • Geometric setup (first success on trial X):

    • PMF: P(X=x)=(1p)x1p,x=1,2,3,P(X=x) = (1-p)^{x-1} p, \quad x=1,2,3,…

    • Intuition: number of trials until first success; larger p → smaller expected number of trials.

Page 2

  • Expected value of geometric distribution via calculus:

    • E[X]=x=1x(1p)x1p\mathbb{E}[X] = \sum_{x=1}^{\infty} x (1-p)^{x-1} p

    • Trick: differentiate the geometric series with respect to the parameter and interchange sum and derivative (under regularity conditions: differentiability and uniform convergence).

    • Result: E[X]=1p\mathbb{E}[X] = \frac{1}{p}

    • Quick intuition: if p=0.2, then E[X]=5\mathbb{E}[X] = 5.

  • Note on interchangeability: requires differentiability of the term and a justification (uniformly bounded sum); these are standard calculus conditions.

  • Reminder: geometric mean/expectation is not trivial despite its simple form.

Page 3

  • Not all distributions have a finite expected value.

    • Example: Cauchy distribution (heavy tails) can yield E[X]\mathbb{E}[X] that does not exist (diverges).

    • If you try E[X]=xxP(X=x)\mathbb{E}[X] = \sum_x x \cdot P(X=x) and the sum diverges, the expectation does not exist.

  • Practical takeaway: to guarantee an expectation exists, it is common to require \mathbb{E}[|X|] < \infty (absolute integrability).

  • Relevance: heavy-tailed distributions have large tail probability mass which can make the expectation undefined or infinite.

Page 4

  • Expectation of a function of a random variable: E[h(X)]=xh(x)P(X=x)\mathbb{E}[h(X)] = \sum_x h(x) \cdot P(X=x)

  • Profit example (three computers):

    • Cost: 3×500=15003\times 500 = 1500

    • Sell price: 10001000 per unit; unsold units repurchased at 200200 each.

    • If sold x units, revenue: 1000x+200(3x)=800x+6001000x + 200(3-x) = 800x + 600

    • Profit: Profit=RevenueCost=(800x+600)1500=800x900\text{Profit} = \text{Revenue} - \text{Cost} = (800x + 600) - 1500 = 800x - 900

    • Therefore: E[extProfit]=800E[X]900\mathbb{E}[ ext{Profit}] = 800\mathbb{E}[X] - 900

  • If the function is linear, h(x)=ax+bh(x)=a x + b, then E[h(X)]=aE[X]+b\mathbb{E}[h(X)] = a\mathbb{E}[X] + b (linearity of expectation).

  • Practical use: compute (\mathbb{E}[X]) once and apply it to the linear form; for non-linear h, compute the full sum.

Page 5

  • Variance definition for a random variable: Var(X)=E[(Xμ)2],μ=E[X]\operatorname{Var}(X) = \mathbb{E}[(X - \mu)^2],\quad \mu = \mathbb{E}[X]

  • Discrete distribution variance: Var(X)=x(xμ)2P(X=x)\operatorname{Var}(X) = \sum_x (x - \mu)^2 \cdot P(X=x)

  • Relation to data variance: distribution variance uses probability weights (not dividing by n or n-1); data variance uses sample-based averaging with a denominator (n-1) for unbiased estimation.

  • Alternative form: Var(X)=E[X2](E[X])2\operatorname{Var}(X) = \mathbb{E}[X^2] - (\mathbb{E}[X])^2 which is often easier to compute.

  • Standard deviation: SD(X)=Var(X)\operatorname{SD}(X) = \sqrt{\operatorname{Var}(X)}; interprets dispersion in the same units as X.

  • Interpretation: larger variance/SD implies greater dispersion or volatility in the distribution.

Page 6

  • Summary and preview:

    • We covered means (expected value) and variance for discrete random variables, including linear and non-linear transformations.

    • Noted that some distributions do not have finite expectations; absolute integrability is a typical condition for existence.

    • Next steps: discrete random variables chapter continuation, well-known discrete distributions, and when to use them to model real-life systems; practice computing means/variances and understanding distribution shapes.