Misleading Graphical Displays of Data

Misleading Class Widths

  • The selection of class width in a histogram can significantly alter the perceived shape of a data distribution.

  • In the case of President Obama’s 20092009 approval ratings (referenced in Exercise 1919 of Homework 3.43.4), a smaller class width of 22 percentage points reveals a bimodal distribution, indicating high ratings in the first half of the year and low ratings in the second half.

  • A larger class width of 55 percentage points is misleading because it suggests a unimodal distribution, obscuring the actual drop in approval ratings.

Vertical Axis Starting Values

  • Starting the vertical axis at 00 (e.g., 0%0\% or 00 thousand) emphasizes the absolute magnitude of values and de-emphasizes differences or increases.

  • Starting the vertical axis at a nonzero value (e.g., 35%35\% or 3838 thousand) emphasizes the relative differences or changes between data points.

  • Entities often choose an axis start to support a specific narrative: Northwestern University might use a zero start to de-emphasize tuition increases, whereas Facebook might use a nonzero start to highlight its lead over Snapchat.

  • If a company like Snap wanted to show their percentage is close to Facebook’s, they would display a graph starting at 0%0\% to make the bar heights appear similar.

Data Estimation Precision

  • Choosing a nonzero starting point for the vertical axis often allows for smaller scale increments.

  • This increased granularity improves the ability to estimate specific values accurately, such as determining that approximately 38%38\% of Instagram users visited the platform multiple times daily in 20182018.