Chapter 1 Notes: Picturing Distributions with Graphs

Individuals and Variables

  • Individuals: Objects described by a set of data (people, animals, things).
    • Example: In the fuel economy dataset, individuals are car makes and models like Subaru Impreza.
  • Variable: Characteristic of an individual that can take different values.
    • Example: Vehicle Class, Transmission Type, Number of Cylinders, City mpg, Highway mpg, and Annual Fuel Cost.
  • Categorical Variable: Places individuals into groups or categories.
    • Example: Vehicle Class and Transmission Type.
  • Quantitative Variable: Takes numerical values where arithmetic operations make sense, usually recorded with a unit of measurement.
    • Example: Number of Cylinders, City mpg, Highway mpg, and Annual Fuel Cost.

Distribution of a Variable

  • Distribution: Tells us what values a variable takes and how often it takes those values.
  • For categorical variables, the distribution lists categories and the count or percentage of individuals in each category.
  • Distributions can be displayed using:
    • Bar Graph: Represents each category as a bar.
    • Steps to create a bar graph in excel are:
      • Enter the distribution data into an excel spreadsheet.
      • Highlight the distribution data.
      • Click insert.
      • Click recommended charts and select bar graph.
      • Click Ok.
    • Pie Chart: Shows the distribution of a categorical variable as slices of a pie.
    • Steps to create a pie chart in excel are:
      • Enter the distribution data into an excel spreadsheet.
      • Highlight the distribution data.
      • Click insert.
      • Click recommended charts and select pie chart.
      • Click Ok.
    • Pie charts must include all categories that make up a whole.
Example: Births by Day of the Week
  • Data shows the average number of babies born on each day of the week in 2014.
  • The table represents the distribution of births in the categorical variable