Geog 2251 Lecture 5 W25 (1)
Probability Theory
Probability theory is the branch of mathematics that describes random behavior.
Originated from the study of games of chance.
Now widely used in various fields:
Stock markets
Human behavior
Population characteristics
Human mortality
Disease
Sports
Variability in Measurements
Natural variability exists in all measurements, making any computed statistic prone to differences from the true value.
Probability expresses this uncertainty.
E.g., a statistically significant result (95% confidence) implies less than a 5% chance that results are due to chance.
Random Phenomena
Individual outcomes are uncertain, yet outcomes display regular distributions over large repetitions.
Probability of a specific outcome = proportion of times that outcome occurs in extensive trials.
Definition of Probability
Probability measures the likelihood of an event occurring, calculated as:
Probability = Number of favorable cases / Total number of cases (Relative frequency)
Example: Probability of rolling a 6 on a 6-sided die = 1/6 = 0.17
Types of Probability
Empirical Probability
Determined through repeated experimentation.
E.g., flipping a coin 500 times with heads appearing 262 times:(P(\text{heads}) = \frac{262}{500} = 0.524)
Theoretical Probability
Based on possible outcomes and mathematical computations.
Example: Drawing a card from a standard deck where each card is equally likely.
Subjective Probabilities
Adjusted based on personal experience and intuition, e.g., weather forecasts.
Law of Large Numbers
As sample size increases, empirical and theoretical probabilities converge, e.g., more coin tosses result in empirical probability approaching 0.5.
Distinct symbols for empirical and theoretical distributions (mean, standard deviation, variance).
Odds
Odds relate probability to occurrences and non-occurrences, mathematically expressed as ratios.
E.g., Odds of drawing a red marble from a bag of colored marbles.
Probability Rules
Probability of an impossible event = 0.
Probability of a certain event = 1.
All probabilities lie between 0 and 1.
The sum of probabilities of all possible outcomes = 1.
Addition and Multiplication Rules
Addition Rule: For mutually exclusive events, add individual probabilities.E.g., probability of getting heads or tails = 0.5 + 0.5 = 1.
Multiplication Rule: Probability that two events (A and B) both occur = P(A) x P(B).
Example: Probability of winning a lottery: (P = \frac{1}{49} \times \frac{1}{48} \times \frac{1}{47} \times \ldots)
Probability Distributions
Represented by histograms that show event frequencies and curve shapes:
Types of distributions include:
Continuous: Normal, T, Chi-Squared.
Discrete: Binomial, Poisson.
Continuous vs. Discrete Distributions
Continuous Variables: Take infinite values (e.g., height, temperature).
Discrete Variables: Take finite values (e.g., number of stores).
Binomial and Poisson Distributions
Binomial Distribution: Outcomes of a success/failure experiment (Bernoulli Trials).
e.g. Lottery wins.
Poisson Distribution: Addresses the probability of rare events per unit time or space.
e.g., number of earthquakes in a year.
The Normal Distribution
The most common distribution (bell curve), applies to many natural phenomena (e.g., heights, weights).
Characterized as:
Bell-shaped
Unimodal
Symmetrical
Mode, median, and mean are equal.
Empirical Rule of Normal Distribution
Approximately:
68% within 1 standard deviation of the mean.
95% within 2 standard deviations.
99.7% within 3 standard deviations.
Standard Normal Distribution and Z-scores
Normal distributions can be standardized with Z-scores.
A Z-score indicates how many standard deviations an observation is from the mean.
E.g., IQ scores with mean 100 and std deviation 15.
68% of scores lie within (85, 115).
Calculating Probabilities and Using Z-tables
After standardizing Z-scores, probabilities of x values are determined.
Probabilities are calculated from Z-tables, represent areas under the curve.
Interpreting Z-table Values
Area under the curve to the left of Z indicates the probability:
For example, P(Z=0) = 0.5, meaning half the area lies below the mean.
Example Calculations
Example calculation for heights in a normal distribution:
Mean height 163cm, if looking for the probability of a student being less than 170cm:
Find corresponding Z-value, use Z-table.
Results indicate probabilities for ranges:
Area left of 170 provides the probability of getting a height under 170cm.
Area Between Two Values
To find the probability between two heights (e.g., 165 and 170cm):
Area left of 170 – area left of 165 gives the probability between.
Height Probability Calculations
For height < 150cm or other specific values, Z-calculations correspond to exact probabilities using respective Z-tables.
E.g., Z = -2.17 relates to height calculations on the left.
Final computations will show overall probabilities in context and can include probability above/below mean.