Categorical Data

Types of Variables
  • Categorical
  • Quantitative
    • Discrete
    • Continuous
Categorical Variables
  • Values that are category names or group variables
  • Show frequencies or relative frequencies that observations fall into each category
Quantitative
  • Numerical values for a measured or counted quantity
  • Can be used for mathematical operations
Categorical data can be in numbers
Numerical data can be in categories
Discrete Variable
  • A countable number of values
Continuous Variable
  • Can take on infinitely many values
Discrete values can be treated as continuous if there are a lot of those values
Graphs for Categorical Data
  • Pie (circle) charts- categories in relation to a whole
  • Bar graphs/charts- categories in relation to each other
  • Side-by-side bar graphs- bars are grouped together and placed side by side
  • Segmented bar graphs- displays variable distribution as segments in a rectangle
  • Mosiac plots- a three-way split of data structured like a segmented bar graph
Two-way Table
  • Two categorical variables can be summarized in a two-way table
  • Gives counts of observations for each combination of variables
Joint Relative Frequency
  • Each cells percentage of the total (in a table)
Marginal Relative Frequency
  • Focuses on only one categorical variable
  • Row and column totals for a two-way table
Conditional Relative Frequency
  • Relative frequency for specific row or column
Assosiation
  • When one variable helps to predict the other