Graphs

Graphs in Communication

  • Graphs: An essential tool for data representation; they offer the advantage of being interpretable but can also misrepresent data.

    • Often the most interpretable form but not necessarily the most accurate.

    • Can manipulate, distort, and misrepresent data through various means.

Overview of Content

  • Key topics to cover:

    • Review of Variable Types

    • Types of Graphs

    • Different Variables Require Different Graphs

    • Rules for Creating Graphs

    • Categorical Data Graphs

    • Continuous Data Graphs

    • Methods of Misleading Through Graphs

Different Types of Variables

  • Discrete Data and Variables:

    • Nominal or Categorical Level Data:

      • Finite categories/distinct groups.

      • Can count responses; must be whole numbers.

      • Examples:

      • US census regions

      • Biological sex

      • Class standing

      • Marital status

      • Payment methods

      • Movie genres

      • Streaming services

      • Grades.

  • Continuous Data and Variables:

    • Ordinal, Interval, Ratio Level Data:

      • Infinite number of values between any two values; can be any numerical value.

      • Examples:

      • Age

      • Income in dollars

      • Height

      • Weight

      • Grades in a class.

Types of Graphs

  • For Discrete Data (Categorical Variables):

    • Pie Charts:

      • Display the percentage of cases in each category.

    • Bar Charts:

      • Present percentage or number of cases; it’s possible to show multiple categories simultaneously.

    • Pictograms:

      • Use images to represent data, often used in magazines.

  • For Continuous Data:

    • Histograms (Frequency Curves):

      • Present continuous data and display data distributions.

      • X-Axis: Values (e.g., defects per hour).

      • Y-Axis: Number of cases within intervals.

    • Scatter Plots:

      • Show relationships between two continuous variables; each data point is represented by a dot.

    • Line Graphs:

      • Display continuous variable values over time, suitable for showing trends.

Rules for Graphs

  1. Efficient: Balance simplicity with the necessary information displayed.

  2. Accurate: Data must be represented without distortion.

  3. Aesthetically Pleasing: Good use of color, font, and size; the graph should be “user-friendly.”

  4. Accessible: Information should be easy to understand without excessive cognitive effort.

Categorical Data Representations

  • Pie Charts Characteristics:

    • Must sum to 100%.

    • Typically represent discrete categories such as fruits or political candidates.

  • Bar Charts Characteristics:

    • Show percentage or number of cases in categories; height indicates percentage.

    • Can be oriented vertically or horizontally.

  • Pictograms Characteristics:

    • Similar to bar charts but utilize images instead of bars for representation.

Continuous Data Representations

  • Histograms:

    • Similar to bar charts; display distributions of continuous data.

    • Example:

      • X-Axis: Defects per hour, Y-Axis: Number of cases.

  • Cumulative Frequency Curves:

    • Demonstrate frequency distribution; helpful for showing cumulative totals over intervals.

  • Scatter Plots:

    • Each data point represented as a dot, illustrating relationships between two continuous variables.

  • Line Graphs:

    • Useful for visualizing trends over time with continuous data.

How to Lie with Graphs

  • Omitting Labels:

    • No labels on axes can mislead viewers about the data’s context.

    • Example: A chart displaying membership growth without a Y-axis label.

  • Y-Axis Manipulation:

    • Not starting the Y-axis at zero leads to distortion of data representation; can exaggerate perceived changes.

    • Using small units or non-equal intervals can artificially inflate or minimize apparent changes.

  • Inaccurate Measurements:

    • Height of bars not consistently representing values on the Y-axis; misleading visual encoding of data.

    • Improper scale or units on the X-axis can misinform the audience (e.g., inflated growth rates).

  • Cumulative Curves:

    • Present data in a way that suggests misleading accumulation; often provides an inflated sense of growth or presence of a metric.

    • Example of misleading cumulative iPhone sales data: could show growth inaccurately over a period, especially if axes not labeled or improperly scaled.

Summary of Key Points

  • Graphs: Although they simplify data understanding, they can be misleading and manipulated.

  • Types of Graphs: Differences between discrete vs. continuous data must be recognized when selecting graph types.

  • Techniques for Manipulation: Important to recognize how graphical misrepresentation can occur through unlabeled axes, nonsensical scaling, and cumulative representations.