Probabilistic Decision Making (1) - Comprehensive Study Notes

Random Variable

  • A random variable is the result of a chance event, that you can measure or count. It is denoted by a variable such as XX.

  • It maps outcomes of a random process to numerical values (discrete or continuous).

Uncertainty and Decision Making

  • Example: Pricing decisions under uncertainty.

  • Consumer’s Willingness To Pay (WTP):

    • WTP = 25,00025{,}000 with probability 0.50.5

    • WTP = 20,00020{,}000 with probability 0.50.5

  • This framing treats price sensitivity and demand as probabilistic, guiding expected profitability under uncertainty.

  • Implication: decisions depend on the distribution of WTP rather than a single point estimate.

Probability Distribution (illustrative examples)

  • Example: Number of times a U.S. adult has been married (random variable XX).

    • Possible outcomes: 0,1,2,3+0, 1, 2, 3+

    • Corresponding probabilities: P(X=0)=0.31,<br>nP(X=1)=0.52,<br>P(X=2)=0.13,<br>P(X=3+)=0.04P(X=0)=0.31,<br>n P(X=1)=0.52,<br>P(X=2)=0.13,<br>P(X=3+)=0.04

  • The sum of all probabilities equals 1: <br>P(X=0)+P(X=1)+P(X=2)+P(X=3+)=1.<br><br>P(X=0)+P(X=1)+P(X=2)+P(X=3+)=1.<br>

  • Fair coin example (data-generating process vs. inference):

    • True data-generating process (for a fair coin): P(extHead)=P(extTail)=0.5P( ext{Head})=P( ext{Tail})=0.5

    • Statistical inference (sampling): toss a coin n times and observe a realized sample; the sample proportion of heads varies across samples.

    • Law of Large Numbers (LLN): as non o \infty, the sample proportion converges to 0.50.5. Intuition: random fluctuations cancel out with more data; larger samples give better estimates of the truth.

Law of Large Numbers (LLN) – intuition and implications

  • With repeated sampling, sample statistics stabilize around the true population values.

  • Practical takeaway: confidence in estimates improves with larger sample sizes.

Statistical Inference: Sample Proportions in Bellwether Examples

  • Visual intuition (proportions of heads in coin tosses) shows that as sample size grows (e.g., n=1,n=50,n=100,n=1,000,000n=1, n=50, n=100, n=1{,}000{,}000), the observed proportion concentrates around 0.50.5.

  • Early samples can be far from the true probability, but deviations shrink with larger nn.

Expected Value (Mean)

  • Coin-toss game (X ∈ {Head, Tail}; Head payoff 30,Tailpayoff30, Tail payoff10):

    • Probabilities: P(extHead)=0.5,P(extTail)=0.5P( ext{Head})=0.5, P( ext{Tail})=0.5

    • Payoffs: Head = 30,extTail=1030, ext{ Tail} = 10

    • Expected reward: E[X]=30imes0.5+10imes0.5=20E[X] = 30 imes 0.5 + 10 imes 0.5 = 20

  • General definition: for a discrete random variable with outcomes xx and probabilities P(X=x)P(X=x),
    E[X]=xxP(X=x)E[X] = \sum_x x \, P(X=x)

  • Terminology:

    • Random variable: XX (realized value: xx)

    • Expectation/Expected value: E[X]E[X]

Pricing and Market Prediction

  • If a contract pays 11 with probability pp and 00 with probability 1p1-p, then its fair price is: 1imesp+0imes(1p)=p1 \, imes \, p + 0 \, imes \, (1-p) = p.

  • In markets: the price often reflects the probability of the event, i.e., the fair value aligns with the subjectively assigned probability.

Variance (Spread around the Mean)

  • Definition: Variance measures how far a set of numbers are spread out from their average value.

  • Example: Income by state (illustrative figures): State B has higher variance than State A; e.g., State A vs State B income distributions show greater dispersion in State B.

  • Notation: extVar(X)=E[(XE[X])2]=x(xE[X])2P(X=x)ext{Var}(X) = E[(X - E[X])^2] = \sum_x (x - E[X])^2 \, P(X=x)

  • A Coin-toss with Head payoff 4040 and Tail payoff 00 (probabilities 0.5,0.50.5, 0.5) has:

    • Expected value: E[X]=40(0.5)+0(0.5)=20E[X] = 40(0.5) + 0(0.5) = 20

    • Variance: extVar(X)=(4020)2(0.5)+(020)2(0.5)=400ext{Var}(X) = (40-20)^2(0.5) + (0-20)^2(0.5) = 400

  • Example: Two games with equal expected value but different variance

    • Game I: Head = 2525, Tail = 1515; P=0.5P=0.5 for each

    • E[X]=25(0.5)+15(0.5)=20E[X] = 25(0.5) + 15(0.5) = 20

    • extVar(X)=(2520)2(0.5)+(1520)2(0.5)=25ext{Var}(X) = (25-20)^2(0.5) + (15-20)^2(0.5) = 25

    • Game II: Head = 4040, Tail = 00; P=0.5P=0.5 for each

    • E[X]=20E[X] = 20

    • extVar(X)=(4020)2(0.5)+(020)2(0.5)=400ext{Var}(X) = (40-20)^2(0.5) + (0-20)^2(0.5) = 400

  • Practice cue: use extVar(X)=E[(XE[X])2]=x(xE[X])2P(X=x)ext{Var}(X) = E[(X - E[X])^2] = \sum_x (x - E[X])^2 P(X=x)

Median and Mean (Illustrative class example)

  • Mean income for this class (example): about 2,371,0002{,}371{,}000

  • This is surprisingly high relative to typical year-incomes in many age groups; typical range reported: between 28,00028{,}000 and 33,00033{,}000 per year.

  • Median income: 39,45039{,}450 (illustrative)

  • The distribution can be plotted on a log scale to show long tails (Mean vs Median can diverge when distributions are skewed).

Expected Value vs Average; Law of Large Numbers in practice

  • Focus in this course: the “average” is used to infer the expected value.

  • Examples: Did an advertisement increase sales? Is a pricing ending in 9 effective? Compare sales under conditions A vs B.

  • Key principle: the sample average converges to the expected value as sample size grows: Xˉn<br>ightarrowE[X]extasn<br>ightarrow\bar{X}_n <br>ightarrow E[X] ext{ as } n <br>ightarrow \infty.

Key Probability and Algebraic Properties

  • Linearity of expectation:

    • E[X+Y]=E[X]+E[Y]E[X+Y] = E[X] + E[Y]

    • E[X+a]=E[X]+aE[X + a] = E[X] + a, for any constant aa

    • E[aX]=aE[X]E[aX] = a \, E[X], for any constant aa

  • Variance under linear shifts and scalings:

    • extVar[X+a]=extVar[X]ext{Var}[X + a] = ext{Var}[X]

    • extVar[aX]=a2extVar[X]ext{Var}[aX] = a^2 \, ext{Var}[X]

Discrete vs Continuous Random Variables

  • Discrete random variable:

    • Countable set of possible values (e.g., number of customers in an hour, number of rainy days in a year).

  • Continuous random variable:

    • Can take infinitely many values (e.g., weight, height, or a continuous percentage).

Binomial Distribution

  • Binomial distribution is a discrete distribution.

  • Setup: each trial outcome is either success (1) or failure (0).

    • Random variable XX represents the sum of the outcomes after nn trials: X=<em>i=1nx</em>iX = \sum<em>{i=1}^n x</em>i with xi0,1x_i \in {0,1}

    • Probability of success on a single trial: pp

    • Number of trials: nn

  • Notation: XBinomial(n,p)X \sim \mathrm{Binomial}(n, p)

  • Expectation and variance:

    • E[X]=npE[X] = n p

    • extVar(X)=np(1p)ext{Var}(X) = n p (1 - p)

Binomial Distribution: Examples

  • Coin-toss game (first Binomial example):

    • Number of trials: n=100n = 100, probability of heads per trial: p=0.5p = 0.5

    • Payoff: Head = 11, Tail = 00 per trial; total payoff is the number of heads, i.e., X = \text{#heads}

    • Expected payoff: E[X]=np=100×0.5=50E[X] = n p = 100 \times 0.5 = 50

    • Interpretation: expected number of heads is 50; if payoff is per head, expected payoff is 50×1=5050\times 1 = 50 dollars.

  • Other binomial-setup examples include:

    • Taco-order: two taco types; probability of spicy pork ordering: p=0.30p = 0.30; with total orders nn; expected spicy pork sales: E[X]=npE[X] = n p.

    • Coffee-order: two types of coffee; probability of iced latte: p=0.40p = 0.40; with total customers nn; expected revenue or quantity depends on assigned payoff; variance of counts can be computed using binomial variance formula.

  • Greens in Regulation (GIR) example:

    • Greens in Regulation over 72 holes modeled with binomial distributions: field ~ 68%, Scottie ~ 70%

    • Visualized distributions show probabilities across the number of GIRs observed; e.g., 0, 20, 40, 60, 72 GIRs, etc.

Normal Distribution

  • Normal distribution is a continuous random variable.

  • Parameterization: XN(μ,σ2)X \sim N(\mu, \sigma^2) where μ\mu is the mean and σ2\sigma^2 is the variance.

  • Shape: bell-shaped, symmetric around the mean, single peak.

  • Notation examples: Φ<em>μ,σ</em>ε(X)\Phi<em>\mu,\sigma</em>\varepsilon (X) or simpler, just the standard normal when standardized.

  • Key properties:

    • Total area under the curve equals 1: f(x)dx=1\int_{-\infty}^{\infty} f(x) \, dx = 1.

    • About the empirical rule: P(\mu - \sigma < X < \mu + \sigma) = 0.6827\, (68.27\%) when X ~ N(\mu, \sigma^2).

  • Visual representations show how changing μ\mu and σ2\sigma^2 shifts and stretches the curve.

Practical Questions and Summary

  • When modeling uncertainty in marketing research, use probabilistic distributions to represent key quantities (e.g., demand, WTP, sales).

  • Compare expected values to observed sample averages; use LLN to justify using sample means for inference as sample size grows.

  • Distinguish between discrete vs continuous variables, and choose appropriate distributions (Binomial for counts of successes, Normal for measurement-like data with aggregation).

  • Remember the core equations:

    • Expected value: E[X]=<em>xxP(X=x)E[X] = \sum<em>x x \, P(X=x) (discrete) or E[X]=</em>xfX(x)dxE[X] = \int</em>{-\infty}^{\infty} x \, f_X(x) \, dx (continuous)

    • Variance: Var(X)=E[(XE[X])2]=x(xE[X])2P(X=x)\text{Var}(X) = E[(X - E[X])^2] = \sum_x (x - E[X])^2 P(X=x) (discrete) or Var(X)=E[X2](E[X])2\text{Var}(X) = E[X^2] - (E[X])^2

    • Linearity of expectation: E[X+Y]=E[X]+E[Y]E[X+Y] = E[X] + E[Y]; scaling: E[aX]=aE[X]E[aX] = a \, E[X]

    • Variance under linear transformations: Var(X+a)=Var(X)\text{Var}(X + a) = \text{Var}(X); Var(aX)=a2Var(X)\text{Var}(aX) = a^2 \text{Var}(X)

Connections to foundations and real-world relevance

  • These concepts support probabilistic decision making in marketing: pricing under uncertainty, evaluating promotional effects, and risk assessment.

  • The LLN justifies using large-sample averages to estimate population-level effects (e.g., effect of an advertisement on sales).

  • Binomial and Normal distributions underpin many marketing metrics: conversion counts, defect rates, and aggregate demand as sample sizes grow.

Quick references to formulas (summary)

  • Expected value (discrete): E[X]=xxP(X=x)E[X] = \sum_x x \, P(X=x)

  • Expected value (continuous): E[X]=<em>xf</em>X(x)dxE[X] = \int<em>{-\infty}^{\infty} x \, f</em>X(x) \, dx

  • Variance: Var(X)=E[(XE[X])2]=x(xE[X])2P(X=x)\text{Var}(X) = E[(X - E[X])^2] = \sum_x (x - E[X])^2 P(X=x)

  • Linear properties: E[X+Y]=E[X]+E[Y],  E[aX]=aE[X],  E[X+a]=E[X]+aE[X+Y] = E[X] + E[Y],\; E[aX] = a E[X],\; E[X+ a] = E[X] + a

  • Variance under scaling and shifts: Var(X+a)=Var(X),  Var(aX)=a2Var(X)\text{Var}(X + a) = \text{Var}(X),\; \text{Var}(aX) = a^2 \text{Var}(X)

  • Binomial: XBinomial(n,p),  E[X]=np,  Var(X)=np(1p)X \sim \mathrm{Binomial}(n, p),\; E[X] = n p,\; \text{Var}(X) = n p (1-p)

  • Normal: X \sim N(\mu, \sigma^2),\; P(\mu - \sigma < X < \mu + \sigma) = 0.6827\, (68.27\%)

"}