Notes on Frequencies, Sample Size, and Relative Frequencies (Transcript)

Data counts and intervals

  • There are 5 numbers in the data set that lie between 10 and 26.

  • There are 2 numbers that lie between 26 and 41.

  • Intervals referenced: between 10 and 26; and between 26 and 41.

Sample size and relative frequency

  • If we add three more numbers to the data set, the sample size becomes n = 5 + 2 + 3 = 10.

  • Relative frequency for the first interval (10–26): ext{RF}_1 = rac{5}{10} = 0.5.

  • Relative frequency for the second interval (26–41): ext{RF}_2 = rac{2}{10} = 0.2.

  • Relative frequency for the third (new) interval: ext{RF}_3 = rac{3}{10} = 0.3.

  • In percentages, these are: ext{RF}1 = 50\%\, , \text{RF}2 = 20\%\, , \text{RF}_3 = 30\%.

  • Quick check: Sum of relative frequencies should equal 1: ext{RF}1 + ext{RF}2 + ext{RF}_3 = rac{5}{10} + rac{2}{10} + rac{3}{10} = 1.

  • Total counts and proportions can also be written as: N = 5 + 2 + 3 = 10 and rac{5}{N} = 0.5, \; \frac{2}{N} = 0.2, \; \frac{3}{N} = 0.3.

Presentation notes and audience considerations

  • The transcript notes a presentation difference: converting to a decimal or a percentage can change how the audience perceives the data.

  • To convey clearly to a broad audience, you might present multiple formats:

    • Fractions/counts: 5\text{ out of }10 or 5/10 (0.5)

    • Decimals: 0.5, 0.2, 0.3

    • Percentages: 50\%, 20\%, 30\% (these are equivalent to the decimals above)

  • The speaker suggests that a boss might prefer a simple representation like rac{2}{10} = 0.2 or just a percentage.

  • Practical takeaway: choose representation that minimizes misinterpretation, while maintaining accuracy.

Final notes on the transcript excerpt

  • The transcript ends with an incomplete sentence: "Note, if I say my" which indicates the speaker was about to make an additional point but it trails off.

  • Core concepts captured in the excerpt:

    • Distinguishing counts in different value ranges (bins).

    • Computing sample size from counts in all bins when adding observations.

    • Calculating relative frequencies from counts and total sample size.

    • Expressing results as fractions, decimals, and percentages for different audiences.

Key equations and expressions

  • Total sample size with added observations: n = 5 + 2 + 3 = 10

  • Relative frequencies: ext{RF}1 = rac{5}{10} = 0.5, \, ext{RF}2 = rac{2}{10} = 0.2,
    \, ext{RF}_3 = rac{3}{10} = 0.3

  • Percentage equivalents: 50\%,\;20\%,\;30\% (from the respective decimals)

  • Sum of relative frequencies: ext{RF}1 + ext{RF}2 + ext{RF}_3 = 1

  • Alternative compact form: N = 10, \;
    \; rac{5}{N} = 0.5, \; rac{2}{N} = 0.2, \; rac{3}{N} = 0.3

Connections to foundational ideas

  • Sample size (n) is the denominator for calculating relative frequencies.

  • Relative frequency is a basic form of probability estimate for a category.

  • Percentages provide intuitive interpretation for non-technical audiences.

  • Consistency check: The sum of relative frequencies across all categories equals 1.

Practical implications and ethics of presentation

  • Always report data truthfully and transparently; avoid manipulating formatting to mislead.

  • Provide multiple representations (counts, fractions, decimals, percentages) to accommodate different stakeholders.

  • When data size changes (adding observations), recalculate and clearly indicate the new sample size and updated frequencies.