Displaying Data
Why Visualise Data?
• Generates insight that is often hidden in raw tables or spreadsheets.
• Uncovers underlying structure (e.g., clustering, skewness, multimodality).
• Pin-points important variables worth modelling further.
• Detects outliers or data-entry errors early.
• Helps researchers, practitioners, and lay readers make sense of patterns quickly.
• Bridges technical analysis and decision-making (policy, policing, social change).
Frequency Distributions
A frequency distribution table summarises how often each category occurs.
Parts of a Frequency Table
• Absolute Frequency (Count): raw tally in each category.
• Relative Frequency: proportion or percentage of the total.
• Cumulative Frequency: running total of the absolute frequencies.
• Cumulative Relative Frequency: running total of the percentages.
When to Use
• Data are categorical (nominal or ordinal) or discrete numerical.
• Goal: compare categories, identify dominant groups, or prepare for further visualisation.
Worked Example – Offence Type (600 + rows of data)
• Dataset: criminal incident records between – .
• Research question: “What is the distribution of the Offence Type variable?”
Absolute Frequencies (Pivot Table in Excel)
• Assault –
• Drug Offences –
• Fraud –
• Good Order Offences –
• Handling Stolen Goods –
• Liquor (excl. Drunkenness) –
• Miscellaneous Offences –
• … (total categories, grand total ).
Relative Frequencies (same order)
• Assault –
• Drug Offences –
• Fraud –
• Good Order Offences – ← most common
• Handling Stolen Goods –
• Liquor (excl. Drunkenness) –
• … (sums to ).
Cumulative Frequencies & Percentages (key checkpoints)
To figure out the cumulative frequency of each class, you simply add its frequency to the frequency of the previous class.
• By the time we include Good Order Offences we’ve reached cases (≈ of total).
• Including Other Theft raises the cumulative relative frequency to .
• Final running totals match the grand total and .
Significance
• Highlights a “long-tail” distribution: a few categories dominate, many are rare.
• Guides resource allocation (e.g., policing priorities) or merges rare categories for modelling.
Bar Charts
• Visual representation for categorical/discrete data.
• Each bar’s height = frequency (absolute or relative).
• Bars separated by gaps to signify categorical, non-continuous nature.
Example – Crime Counts by Offence Type
• Raw Excel plot first appeared unordered, cluttered.
• Improved version:
– Categories re-sorted from most to least frequent.
– Title contextualised: “Crime Counts by Offence Type between – ”.
– X-axis labels angled/abridged for readability.
• Presentation tips: order logically, keep axis units consistent, avoid 3-D effects.
Histograms
• Designed for continuous or large-range discrete variables.
• X-axis divided into contiguous bins; no gaps (data are numeric and continuous along the axis).
• Each rectangle’s area represents frequency or density.
• Reveals modality (one peak vs multiple), skew, and outliers.
Example – Respondent Age (World Values Survey 2018)
• ages; observed min , max (potential distinct values).
• Instead of thin bars, group into broader bins (e.g., ).
• Final histogram quickly shows:
– Mode cluster in – range.
– Tapering tail in + ages.
– Potential data-entry errors if extreme ages (e.g., >100) pop up.
Research Example – Violent Crime Counts per Neighbourhood
• Histogram across Brisbane suburbs (Figure 4.9).
• Highlights positively skewed distribution: many suburbs with low counts, few with very high counts.
• Assists analysts in targeting outlier suburbs for situational crime prevention.
Line Charts (Time-Series)
• Best for showing change over ordered time periods.
• X-axis = time (years, months, weeks); Y-axis = value of interest.
• Multiple lines may compare groups (male vs female), variables, or geographies.
Example – Queensland Imprisonment Rate –
• Data excerpt (rates per population):
– : Male , Female .
– : Male , Female .
– : Male , Female .
• Observations:
– Overall upward trend for both genders; steeper for males.
– Temporary dip in (male ) before sustained growth.
– Public policy relevance: prison crowding, gender-specific interventions.
Practical / Ethical / Philosophical Considerations
• Clarity vs deception: mis-scaled axes or truncated zeros can mislead.
• Accessibility: colour palettes should be colour-blind friendly.
• Privacy: granular maps or line charts can inadvertently re-identify individuals.
• Equity: focusing solely on dominant categories might obscure minority experiences (e.g., low-frequency offences that disproportionately affect vulnerable groups).