Statistical Reasoning: Visual Displays of Data and Frequency Tables

Section 3.1: Frequency Tables and Learning Goals

  • The primary learning goal of this section is to develop the technical ability to create and interpret frequency tables.

  • This topic is situated within Chapter 3: Visual Displays of Data, of the Sixth Edition of Statistical Reasoning for Everyday Life.

Definitions and Fundamental Components

  • Frequency Table: A basic frequency table is a structured data display consisting of two primary columns:

    • Categories: One column lists all the distinct categories or classifications of the data.

    • Frequency: The other column lists the frequency of each specific category.

  • Frequency: Defined as the number of data values that fall within a specific category.

  • Relative Frequency: The proportion or percentage of the total data set that falls within a particular category. It is calculated using the formula:

    • Relative Frequency=Frequency of CategoryTotal Frequency\text{Relative Frequency} = \frac{\text{Frequency of Category}}{\text{Total Frequency}}

  • Cumulative Frequency: The total number of data values in a specific category combined with the values in all preceding categories.

Example 1: Taste Test Study

  • Background: The Rocky Mountain Beverage Company conducted a taste test for its new product, Coral Cola.

  • Data Set: The study included 2020 individuals who each rated the cola on a 55-point scale.

  • Variable of Interest: The taste rating, which is a qualitative variable at the ordinal level of measurement.

  • Frequency Table (Table 3.2):

    • Taste Scale 1: Frequency = 22

    • Taste Scale 2: Frequency = 33

    • Taste Scale 3: Frequency = 99

    • Taste Scale 4: Frequency = 44

    • Taste Scale 5: Frequency = 22

    • Total Frequency: 2020

Example 2: Relative and Cumulative Frequency Interpretation

  • This example expands on the Taste Test data by adding relative and cumulative frequency metrics to Table 3.4.

  • Calculating Relative Frequency: Each category's frequency was divided by the total sum (2020).

    • Example: For the highest rating (55), the frequency is 22. The relative frequency is 220=0.10\frac{2}{20} = 0.10, or 10%10\%.

  • Calculating Cumulative Frequency: The sum of values for a category and its predecessors.

    • Cumulative Frequency for rating 33: 2+3+9=142 + 3 + 9 = 14.

    • Interpretation: 1414 out of 2020 people (70%70\%) gave the cola a rating of 33 or lower.

  • Full Data (Table 3.4):

    • Taste Scale 1: Frequency = 22; Relative Frequency = 2/20=0.102/20 = 0.10; Cumulative Frequency = 22

    • Taste Scale 2: Frequency = 33; Relative Frequency = 3/20=0.153/20 = 0.15; Cumulative Frequency = 3+2=53 + 2 = 5

    • Taste Scale 3: Frequency = 99; Relative Frequency = 9/20=0.459/20 = 0.45; Cumulative Frequency = 9+3+2=149 + 3 + 2 = 14

    • Taste Scale 4: Frequency = 44; Relative Frequency = 4/20=0.204/20 = 0.20; Cumulative Frequency = 4+9+3+2=184 + 9 + 3 + 2 = 18

    • Taste Scale 5: Frequency = 22; Relative Frequency = 2/20=0.102/20 = 0.10; Cumulative Frequency = 2+4+9+3+2=182 + 4 + 9 + 3 + 2 = 18 (Note: Transcript displays this value as 1818)

    • Totals: Frequency = 2020; Relative Frequency = 11, or 100%100\%; Cumulative Frequency = 2020

Example 3: Binned Exam Scores

  • When data sets contain many unique values (like numerical exam scores), data is often grouped into "bins."

  • Dataset (20 Raw Scores): 7676, 8080, 7878, 7676, 9494, 7575, 9898, 7777, 8484, 8888, 8181, 7272, 9191, 7272, 7474, 8686, 7979, 8888, 7272, 7575

  • Bin Selection Strategy:

    • The scores range from a minimum of 7272 to a maximum of 9898.

    • Bin width was chosen as 55 points.

    • Bins are defined to avoid overlap and ensure consistent width (e.g., 9595 to 9999, 9090 to 9494, etc.).

  • Interpretation of Cumulative Frequency in Binned Data: In this specific case, cumulative frequency is interpreted as the total number of scores in or above that specific bin.

  • Frequency Table for Binned Exam Scores (Table 3.5):

    • 95 to 99: Frequency = 11; Relative Frequency = 1/20=0.051/20 = 0.05; Cumulative Frequency = 11

    • 90 to 94: Frequency = 22; Relative Frequency = 2/20=0.102/20 = 0.10; Cumulative Frequency = 2+1=32 + 1 = 3

    • 85 to 89: Frequency = 33; Relative Frequency = 3/20=0.153/20 = 0.15; Cumulative Frequency = 3+2+1=63 + 2 + 1 = 6

    • 80 to 84: Frequency = 33; Relative Frequency = 3/20=0.153/20 = 0.15; Cumulative Frequency = 3+3+2+1=93 + 3 + 2 + 1 = 9

    • 75 to 79: Frequency = 77; Relative Frequency = 7/20=0.357/20 = 0.35; Cumulative Frequency = 7+3+3+2+1=167 + 3 + 3 + 2 + 1 = 16

    • 70 to 74: Frequency = 44; Relative Frequency = 4/20=0.204/20 = 0.20; Cumulative Frequency = 4+7+3+3+2+1=204 + 7 + 3 + 3 + 2 + 1 = 20

    • Totals: Frequency = 2020; Relative Frequency = 11; Cumulative Frequency = 2020