Notes on Frequencies, Sample Size, and Relative Frequencies (Transcript)
Data counts and intervals
There are 5 numbers in the data set that lie between 10 and 26.
There are 2 numbers that lie between 26 and 41.
Intervals referenced: between 10 and 26; and between 26 and 41.
Sample size and relative frequency
If we add three more numbers to the data set, the sample size becomes n = 5 + 2 + 3 = 10.
Relative frequency for the first interval (10–26): ext{RF}_1 = rac{5}{10} = 0.5.
Relative frequency for the second interval (26–41): ext{RF}_2 = rac{2}{10} = 0.2.
Relative frequency for the third (new) interval: ext{RF}_3 = rac{3}{10} = 0.3.
In percentages, these are: ext{RF}1 = 50\%\, , \text{RF}2 = 20\%\, , \text{RF}_3 = 30\%.
Quick check: Sum of relative frequencies should equal 1: ext{RF}1 + ext{RF}2 + ext{RF}_3 = rac{5}{10} + rac{2}{10} + rac{3}{10} = 1.
Total counts and proportions can also be written as: N = 5 + 2 + 3 = 10 and rac{5}{N} = 0.5, \; \frac{2}{N} = 0.2, \; \frac{3}{N} = 0.3.
Presentation notes and audience considerations
The transcript notes a presentation difference: converting to a decimal or a percentage can change how the audience perceives the data.
To convey clearly to a broad audience, you might present multiple formats:
Fractions/counts: 5\text{ out of }10 or 5/10 (0.5)
Decimals: 0.5, 0.2, 0.3
Percentages: 50\%, 20\%, 30\% (these are equivalent to the decimals above)
The speaker suggests that a boss might prefer a simple representation like rac{2}{10} = 0.2 or just a percentage.
Practical takeaway: choose representation that minimizes misinterpretation, while maintaining accuracy.
Final notes on the transcript excerpt
The transcript ends with an incomplete sentence: "Note, if I say my" which indicates the speaker was about to make an additional point but it trails off.
Core concepts captured in the excerpt:
Distinguishing counts in different value ranges (bins).
Computing sample size from counts in all bins when adding observations.
Calculating relative frequencies from counts and total sample size.
Expressing results as fractions, decimals, and percentages for different audiences.
Key equations and expressions
Total sample size with added observations: n = 5 + 2 + 3 = 10
Relative frequencies: ext{RF}1 = rac{5}{10} = 0.5, \, ext{RF}2 = rac{2}{10} = 0.2,
\, ext{RF}_3 = rac{3}{10} = 0.3Percentage equivalents: 50\%,\;20\%,\;30\% (from the respective decimals)
Sum of relative frequencies: ext{RF}1 + ext{RF}2 + ext{RF}_3 = 1
Alternative compact form: N = 10, \;
\; rac{5}{N} = 0.5, \; rac{2}{N} = 0.2, \; rac{3}{N} = 0.3
Connections to foundational ideas
Sample size (n) is the denominator for calculating relative frequencies.
Relative frequency is a basic form of probability estimate for a category.
Percentages provide intuitive interpretation for non-technical audiences.
Consistency check: The sum of relative frequencies across all categories equals 1.
Practical implications and ethics of presentation
Always report data truthfully and transparently; avoid manipulating formatting to mislead.
Provide multiple representations (counts, fractions, decimals, percentages) to accommodate different stakeholders.
When data size changes (adding observations), recalculate and clearly indicate the new sample size and updated frequencies.