Notes on Histograms and Other Quantitative Data Displays
Histograms
- Definition: A histogram is a graphical representation of a frequency distribution.
- The horizontal scale corresponds to classes of quantitative data values.
- The vertical scale corresponds to the frequency of each class.
- Benefits:
- Easy to interpret.
- Quickly see where most of the data lies.
- Determine the shape of the distribution (symmetric or skewed).
Example: Heart Rates of 50 Students
- Frequency distribution table:
- 57-66: 2 students
- 67-76: 10 students
- 77-86: 32 students
- 87-96: 5 students
- 97-106: 1 student
- Histogram construction:
- Each class is represented by a vertical bar.
- The height of the bar is the frequency for that class.
- Horizontal boundaries correspond to class endpoints.
- Interpretation: Most data is in the middle class (77-86), and the distribution is fairly symmetric.
Stem and Leaf Plot
- Definition: A graph representing quantitative data that separates each data value into two parts: the stem and the leaf.
- Similar to a histogram but retains all original data values.
- Construction:
- Stem: Tens digit (left column).
- Leaf: Ones digit (right column).
- Example:
- Data: 67, 69, 68, 65, 62
- Stem: 6
- Leaves: 7, 9, 8, 5, 2
- Interpretation: Quickly see that most data is in the 80s (based on the example in the transcript, but not explicitly shown).
Dot Plots
- Definition: Each data value is plotted as a dot along an axis.
- Usefulness:
- See where data is clustered.
- Identify values that occur most often.
- Example: Heart rate data.
- Horizontal axis ranges from 55 to 100.
- Dots above values (e.g., a dot above 62, last dot above 98).
- Interpretation: Data is clustered between 77 and 84, with 80 having the most occurrences (five dots).