Constructing Graphical and Tabular Displays of Data
Frequency and Relative Frequency
The frequency of a class is the total count of observations within that class.
The relative frequency of a class is the proportion of total observations belonging to that class.
For a numerical variable, the sum of all relative frequencies is equal to .
Constructing Histograms
To construct a frequency histogram, write lower class limits (e.g., , , , , , and ) equally spaced on the horizontal axis and frequencies (e.g., , , , , , ) on the vertical axis.
To construct a relative frequency histogram, the vertical axis typically displays proportions (e.g., , , , , ).
Histograms consist of rectangles where the width represents the class width and the height represents the frequency or relative frequency.
Density Histograms
In a density histogram, the area of each bar is equal to the relative frequency of that bar's class.
The total area of all bars in a density histogram must equal .
Proportions can be determined using bar areas:
In a study of Major League Baseball® (MLB) ticket prices, the proportion of stadiums with prices between and inclusive was found by summing bar areas: .
The proportion for prices less than was .
The proportion for prices "at least" can be found by summing all bars from upward () or by using the complement: .
Distribution Shapes and Modality
Unimodal: A distribution with one mound.
Bimodal: A distribution with two mounds.
Multimodal: A distribution with more than two mounds.
Skewed-left: The left tail is longer than the right tail.
Skewed-right: The right tail is longer than the left tail.
Symmetric: The left tail is approximately a mirror image of the right tail.
Analyzing Distributions of Numerical Variables
The four characteristics of a distribution should be determined in the following order:
Identify all outliers. Correct or remove those stemming from errors; consider separate studies for others.
Determine the shape. Evaluate if subgroups should be analyzed separately for bimodal or multimodal data.
Measure and interpret the center.
Describe the spread.
Mathematical Models
A model is defined as a mathematical description of an authentic situation.