CH. 2

Graphical Summaries of Data

Key Definitions

  • Variability: Differences in values/categories among individuals.

  • Distribution: Reflects the values a variable takes and their frequencies.

  • Frequency Table: Summarizes data; applicable for categorical and quantitative variables.

Types of Graphical Summaries

Bar Graphs

  • Bars can be vertical/horizontal, with uniform width and spacing.

  • Represents frequency or percentage of categories.

  • Must include a title, labels, and vertical scale.

Pie Charts

  • Circle divided into sectors representing relative frequencies of categories.

  • Each sector's size corresponds to the category's frequency.

  • Requires a title and legend.

Contingency Tables

  • Displays distributions of two categorical variables together.

  • Cells indicate counts for combinations of variables.

Conditional Distributions

  • Shows distribution of one variable conditional on another.

  • Indicates association between variables if distributions differ.

Side-by-Side & Stacked Bar Graphs

  • Side-by-side: Compares categories of one variable with another.

  • Stacked: One categorical variable is displayed on top of another.

  • Both help visualize relationships between variables.

Quantitative Data Representation

Histograms

  • Bar graph representation of frequency distribution for quantitative variables.

  • No gaps between bars (unless class is empty).

  • Class intervals (bins) must cover all observed data values.

Stemplots

  • Displays data while maintaining original values, using stems and leaves for visualization.

Dotplots

  • Useful for small data sets to show value concentration and gaps.

Time-Series Plots

  • Graphs behavior of a variable over time; time on x-axis, variable on y-axis.

  • Looks for overall trends and seasonal patterns.