To represent simple categorical data using images or symbols.
Choose a symbol.
Assign a key (e.g. 1 symbol = 10 items).
Draw symbols in rows to represent frequencies.
Use half-symbols or partial symbols for non-multiples of the key.
Count the number of full and partial symbols per category.
Multiply by the value of the key to get the actual frequency.
Always include a key.
Keep symbols evenly spaced.
Make sure all symbols are the same size.
Look for proportionality – the length of a row should reflect the size of the value.
To show the frequency of different categories (discrete data).
Categories go on the x-axis.
Frequencies go on the y-axis.
Draw bars for each category with equal width and equal spacing.
The height of each bar shows the frequency.
Compare heights to find most/least common categories.
Bars must not touch (unlike histograms).
Label axes clearly with units.
Use a sensible scale on the y-axis.
To show how a total is divided between different categories proportionally.
Total frequency = 360°.
For each category:
Angle=(category frequency/total frequency)×360∘
Use a protractor to draw each sector.
Label sectors or provide a key.
Larger angles = greater proportions.
You can calculate the actual frequency from the angle:
Frequency=(angle/360)×total
Add up all angles to check they total 360°.
Label sectors clearly or use a colour key.
To show changes in data over time.
Time on the x-axis, values on the y-axis.
Plot data points for each time interval.
Join points with straight lines.
Look for trends (increasing, decreasing).
Identify sharp increases/decreases or plateaus.
Useful in showing rate of change.
Points should only be connected when the variable is continuous over time.
Use gridlines to align values accurately.
To represent frequency data (usually grouped) with a smooth line alternative to a bar chart.
Calculate midpoint of each class interval:
Midpoint=lower limit+upper limit/2
Plot midpoint against frequency.
Join points with straight lines.
Often used with or instead of a bar chart/histogram.
How to Interpret:
Compare shape of distribution: symmetric, skewed, bimodal.
Find modal group (highest peak).
Start and finish at the x-axis by adding dummy values with 0 frequency to enclose the shape.
To show how data changes over regular time intervals (e.g. months, years).
Time on x-axis, values on y-axis.
Plot and connect points in order.
Label axes clearly with units and dates.
Identify trends (e.g. upward, downward).
Spot seasonal variations or cyclical patterns.
Predict future values based on patterns.
Use a ruler for connecting points cleanly.
Time intervals should be evenly spaced.
To show raw data in a semi-graphical format, keeping individual values visible.
Split numbers into stem (e.g. tens) and leaf (e.g. units).
List all leaves in ascending order for each stem.
Include a key (e.g. 4 | 7 = 47).
Easy to find mode, median, and range.
Useful for comparing data sets (e.g. back-to-back diagrams).
Don’t forget the key.
Group all same-stem values in the same row.
Sort the leaves for clarity.
To show the relationship between two numerical variables.
Plot paired data as (x, y) coordinates.
Each point represents one data pair.
Look at overall direction:
Positive correlation: as x increases, y increases.
Negative correlation: as x increases, y decreases.
No correlation: no clear trend.
Estimate line of best fit (straight line that best follows the trend).
Use the line to make predictions (interpolation or extrapolation).
Correlation does not mean causation.
Line of best fit can be drawn by eye.
Label axes with both variable names and units.
To organize large sets of continuous data into class intervals.
Group raw data into intervals (e.g. 0–10, 10–20...).
Count how many values fall in each class → this is the frequency.
Interpretation:
Can't identify exact values, only how many fall within each range.
Used to estimate things like the mean, median, or mode.
Note:
Classes should be equal width (especially for histograms).
Avoid overlapping boundaries (e.g., use 10–19 not 10–20).
Because exact data values are unknown, you use midpoints of each class as estimates.
Find midpoints of each class (e.g. for 10–20 → midpoint = 15).
Multiply midpoint × frequency for each row.
Add these up and divide by total frequency.
Note: Use a working table to stay organized.
To show how the total frequency accumulates up to a certain value.
Use upper class boundaries (e.g. for 10–20, use 20).
Add up frequencies cumulatively.
Plot (upper boundary, cumulative frequency).
Join points with a smooth curve (not straight lines).
Use the curve to estimate:
Median (at the 50th percentile)
Quartiles (25% and 75%)
Interquartile range (IQR) = Q3 − Q1
Note: Read off values by drawing lines from the y-axis (frequency) across to the curve and down.
To give a quick visual summary of the distribution of data.
Minimum
Lower quartile (Q1)
Median
Upper quartile (Q3)
Maximum
Use a number line.
Mark the five values.
Draw a box from Q1 to Q3.
Draw a line at the median.
Add whiskers from the box to the min and max.
Width of the box = Interquartile Range (spread of middle 50%)
Long whiskers = data is more spread out
Skewed if box is not symmetric
Tip: Box plots are great for comparing two sets of data!
To show continuous data where class intervals are not necessarily equal.
x-axis = class intervals
y-axis = frequency density
Frequency Density=Frequency/Class Width
Bars touch (since data is continuous)
Area of each bar = actual frequency
Interpretation:
Taller bars = higher density, not necessarily higher frequency.
Be careful if class widths are different!
Note: Always label axes properly and calculate frequency density clearly.
Averages:
Mean, Median
Spread:
Range, Interquartile Range (IQR)
Standard deviation (Higher only, rarely tested)
Use two clear comparisons, eg: Set A has a higher median, so the typical value is greater. However, Set B has a smaller IQR, meaning it is more consistent.
Note: Use box plots, cumulative frequency curves, or summary tables to justify comparisons.
This is the expected chance of an event happening, assuming fairness.
Probability = Number of desired outcomes/Total number of outcomes
Based on real trials or data:
Relative frequency = Number of times event occurs/Total trials
Use this to predict future outcomes or to estimate theoretical probability.
To list all possible outcomes of one or more events.
For one event: list all options (e.g. die roll: 1–6)
For two events: create a grid or list all pairs
Use sample space to find probabilities:
P(event)=Number of favourable outcomes/Total outcomes
Already mentioned above, but to clarify:
Used when outcomes are recorded (e.g. flipping a coin 100 times).
Values may vary from theoretical probability due to random variation.
Note: As number of trials increases, relative frequency tends to approach theoretical probability.
A∪B: Union (either A or B)
A∩B: Intersection (A and B)
A′: Not A (complement of A)
n(A): Number of elements in A
ξ: Universal set
Fill the Venn diagram with values.
Use set operations to calculate probabilities.
Note: Be careful not to double-count the overlap.
To show all possible outcomes of two or more linked events.
First branches: outcomes of first event.
Second branches: outcomes of second event (based on each first branch).
Multiply along branches for joint probability.
Add across branches for either/or situations.
Example: Tossing a coin twice
P(H then H) = 12×12=144
One event does not affect the other.
P(A and B)=P(A)×P(B)
Example: Tossing a coin and rolling a die.
One event affects the next (e.g. without replacement).
P(A then B)=P(A)×P(B ∣ A)
Example: Drawing 2 cards from a deck without replacement.
P(1st red) = 26/52
P(2nd red | 1st was red) = 25/51
So: P(2 reds)=26/52×25/51