Probabilistic Decision Making (1) - Comprehensive Study Notes
Random Variable
A random variable is the result of a chance event, that you can measure or count. It is denoted by a variable such as .
It maps outcomes of a random process to numerical values (discrete or continuous).
Uncertainty and Decision Making
Example: Pricing decisions under uncertainty.
Consumer’s Willingness To Pay (WTP):
WTP = with probability
WTP = with probability
This framing treats price sensitivity and demand as probabilistic, guiding expected profitability under uncertainty.
Implication: decisions depend on the distribution of WTP rather than a single point estimate.
Probability Distribution (illustrative examples)
Example: Number of times a U.S. adult has been married (random variable ).
Possible outcomes:
Corresponding probabilities:
The sum of all probabilities equals 1:
Fair coin example (data-generating process vs. inference):
True data-generating process (for a fair coin):
Statistical inference (sampling): toss a coin n times and observe a realized sample; the sample proportion of heads varies across samples.
Law of Large Numbers (LLN): as , the sample proportion converges to . Intuition: random fluctuations cancel out with more data; larger samples give better estimates of the truth.
Law of Large Numbers (LLN) – intuition and implications
With repeated sampling, sample statistics stabilize around the true population values.
Practical takeaway: confidence in estimates improves with larger sample sizes.
Statistical Inference: Sample Proportions in Bellwether Examples
Visual intuition (proportions of heads in coin tosses) shows that as sample size grows (e.g., ), the observed proportion concentrates around .
Early samples can be far from the true probability, but deviations shrink with larger .
Expected Value (Mean)
Coin-toss game (X ∈ {Head, Tail}; Head payoff 10):
Probabilities:
Payoffs: Head =
Expected reward:
General definition: for a discrete random variable with outcomes and probabilities ,
Terminology:
Random variable: (realized value: )
Expectation/Expected value:
Pricing and Market Prediction
If a contract pays with probability and with probability , then its fair price is: .
In markets: the price often reflects the probability of the event, i.e., the fair value aligns with the subjectively assigned probability.
Variance (Spread around the Mean)
Definition: Variance measures how far a set of numbers are spread out from their average value.
Example: Income by state (illustrative figures): State B has higher variance than State A; e.g., State A vs State B income distributions show greater dispersion in State B.
Notation:
A Coin-toss with Head payoff and Tail payoff (probabilities ) has:
Expected value:
Variance:
Example: Two games with equal expected value but different variance
Game I: Head = , Tail = ; for each
Game II: Head = , Tail = ; for each
Practice cue: use
Median and Mean (Illustrative class example)
Mean income for this class (example): about
This is surprisingly high relative to typical year-incomes in many age groups; typical range reported: between and per year.
Median income: (illustrative)
The distribution can be plotted on a log scale to show long tails (Mean vs Median can diverge when distributions are skewed).
Expected Value vs Average; Law of Large Numbers in practice
Focus in this course: the “average” is used to infer the expected value.
Examples: Did an advertisement increase sales? Is a pricing ending in 9 effective? Compare sales under conditions A vs B.
Key principle: the sample average converges to the expected value as sample size grows: .
Key Probability and Algebraic Properties
Linearity of expectation:
, for any constant
, for any constant
Variance under linear shifts and scalings:
Discrete vs Continuous Random Variables
Discrete random variable:
Countable set of possible values (e.g., number of customers in an hour, number of rainy days in a year).
Continuous random variable:
Can take infinitely many values (e.g., weight, height, or a continuous percentage).
Binomial Distribution
Binomial distribution is a discrete distribution.
Setup: each trial outcome is either success (1) or failure (0).
Random variable represents the sum of the outcomes after trials: with
Probability of success on a single trial:
Number of trials:
Notation:
Expectation and variance:
Binomial Distribution: Examples
Coin-toss game (first Binomial example):
Number of trials: , probability of heads per trial:
Payoff: Head = , Tail = per trial; total payoff is the number of heads, i.e., X = \text{#heads}
Expected payoff:
Interpretation: expected number of heads is 50; if payoff is per head, expected payoff is dollars.
Other binomial-setup examples include:
Taco-order: two taco types; probability of spicy pork ordering: ; with total orders ; expected spicy pork sales: .
Coffee-order: two types of coffee; probability of iced latte: ; with total customers ; expected revenue or quantity depends on assigned payoff; variance of counts can be computed using binomial variance formula.
Greens in Regulation (GIR) example:
Greens in Regulation over 72 holes modeled with binomial distributions: field ~ 68%, Scottie ~ 70%
Visualized distributions show probabilities across the number of GIRs observed; e.g., 0, 20, 40, 60, 72 GIRs, etc.
Normal Distribution
Normal distribution is a continuous random variable.
Parameterization: where is the mean and is the variance.
Shape: bell-shaped, symmetric around the mean, single peak.
Notation examples: or simpler, just the standard normal when standardized.
Key properties:
Total area under the curve equals 1: .
About the empirical rule: P(\mu - \sigma < X < \mu + \sigma) = 0.6827\, (68.27\%) when X ~ N(\mu, \sigma^2).
Visual representations show how changing and shifts and stretches the curve.
Practical Questions and Summary
When modeling uncertainty in marketing research, use probabilistic distributions to represent key quantities (e.g., demand, WTP, sales).
Compare expected values to observed sample averages; use LLN to justify using sample means for inference as sample size grows.
Distinguish between discrete vs continuous variables, and choose appropriate distributions (Binomial for counts of successes, Normal for measurement-like data with aggregation).
Remember the core equations:
Expected value: (discrete) or (continuous)
Variance: (discrete) or
Linearity of expectation: ; scaling:
Variance under linear transformations: ;
Connections to foundations and real-world relevance
These concepts support probabilistic decision making in marketing: pricing under uncertainty, evaluating promotional effects, and risk assessment.
The LLN justifies using large-sample averages to estimate population-level effects (e.g., effect of an advertisement on sales).
Binomial and Normal distributions underpin many marketing metrics: conversion counts, defect rates, and aggregate demand as sample sizes grow.
Quick references to formulas (summary)
Expected value (discrete):
Expected value (continuous):
Variance:
Linear properties:
Variance under scaling and shifts:
Binomial:
Normal: X \sim N(\mu, \sigma^2),\; P(\mu - \sigma < X < \mu + \sigma) = 0.6827\, (68.27\%)
"}