/
Traffic engineering requires collecting and analyzing large amounts of data to perform traffic studies.
Statistics is crucial for:
Determining required data quantities.
Making confident inferences from collected data.
Full data measurement for average characteristics is often impractical.
Example: Average speed of vehicles on a road cannot be directly observed for all vehicles due to logistics.
Engineers sample from the population to estimate characteristics and the necessary degree of certainty.
How many samples are required?
What is the confidence level in the estimate?
What statistical distribution best represents the observed data?
Did the traffic engineering design change population characteristics?
Not a substitute for a course in statistics; references provided for further reading.
Discrete Functions: Composed of discrete variables; whole numbers only (e.g., number of children in a family).
Continuous Functions: Composed of continuous variables; any value possible within a range (e.g., height of individuals).
Common Discrete Distributions: Bernoulli, binomial, Poisson.
Common Continuous Distributions: Normal, exponential, chi-square.
Some events are predictable (e.g., gas pedal pressure and vehicle speed), while others are random (e.g., radioactive particle emission).
Engineer's perspective: simplify modeling based on prediction feasibility.
Raw traffic data can be organized into classes for clarity.
Common distributions: normal, exponential, chi-square, Bernoulli, binomial, and Poisson.
Creating frequency distribution tables helps simplify data analysis.
Mean: Average of observations.
Median: Middle value when data is ordered.
Mode: Most frequent value. Can be bimodal (two modes).
Variance: Squared deviation from the mean.
Standard Deviation (STD): Square root of variance. Indicates data spread.
The coefficient of variation indicates the relative spread of outcomes versus the mean.
It has a bell-shaped curve and is defined by mean and variance.
Probability of outcomes lies under the curve.
Total area under the curve equals 1.
Commonly used for speed, travel time, and delays measurements.
Translates an arbitrary normal distribution to a standard normal distribution (mean of 0, variance of 1) using z-scores.
Calculating probabilities involves comparison against standard normal distribution tables.
Approx. 68.3% of values lie within ±1 standard deviation from the mean.
Approx. 95% lie within ±1.96 standard deviations.
Approx. 99.7% lie within ±3 standard deviations.
If 70 estimates of the mean are collected, they tend to be normally distributed.
Standard error of the mean shrinks the more samples are taken, calculated as the population standard deviation divided by the square root of sample size.
Older equations for computing necessary sample sizes to achieve specified tolerances and confidence.
Example calculations demonstrate how increased confidence can lead to significantly larger sample size requirements.
The summation leads to the expected value equal to sum of means.
Through examples, demonstrates how random variables are calculated including standard deviation and variance.
Simplest discrete distribution, two outcomes.
Probability function defined for fixed events and independent outcomes.
Binomial distribution derived from repeated Bernoulli trials.
Used for counting occurrences within a fixed interval.
Defined by a single parameter specifying mean and variance.
Relevant for modeling vehicle occurrences in traffic studies.
Formulate null and alternative hypotheses. Assess effectiveness with probability of Type I and Type II errors defined.
Errors quantified, with examples illustrating potential outcomes in real studies.
Statistics is integral to traffic studies and aids decision-making under uncertainty. A range of statistical techniques beyond presented can enhance study outcomes.
Suggestions for further reading and study are provided, alongside problems to reinforce learning.