Intelligence Testing: Uses & Limits

Intelligence Testing: Uses & Limits

Origins & Development
  • Attempts to measure human intelligence in Western tradition date back to the mid-1800s.
  • Originally used for military purposes and by eugenicists.
  • Alfred Binet developed the first ‘IQ’ tests designed to identify school children struggling academically.
  • IQ tests quantify intellectual ability by comparing scores against a cohort's expected standards (e.g., children of the same age).
Original Formulation
  • The original IQ score was a standardized score indicating performance relative to cohort expectations.
  • A higher score indicates exceeding expected standards while a lower score indicates underperformance.
Initial Applications
  • Primarily used in education.
    • Identifying students with low ability for remedial help.
    • Identifying students with high ability for advanced learning opportunities.
  • Modern neuropsychological tests can target specific cognitive impairments.
What is Measured
  • Intelligence tests for adults do not depend on developmental benchmarks but rely on comparisons to cohorts.
  • IQ tests measure individual differences rather than an absolute amount of intelligence.
Standardisation & Ranking
  • Intelligence tests are viewed as tools to rank performance on intellectually demanding tasks.
  • Rankings are often expressed in percentile scores, indicating the percentage of individuals scoring higher or lower.
  • Example: University admissions scores (e.g., ATAR).
Standardised Scores
  • Standardized intelligence tests convert ranking into scores with a fixed reference point.
  • A common IQ scale sets average at 100, with a standard deviation of 15.
  • A Z-score sets the average at 0 and standard deviation at 1.
Standardised Distribution
  • Standardized intelligence scores typically follow a Gaussian (normal) distribution, also known as the Bell Curve.
  • Most values cluster around the average, with extreme scores being less common.
Application of Measurement
  • Intelligence is a useful but imperfect predictor of life outcomes, explaining about 60% of variations like academic performance.
  • Intelligence tests come with a degree of measurement error.
Limitations, Assumptions & Harms
  • Intelligence testing has several historical problems categorized into three areas:
    • Validity issues with tests.
    • Interpretation problems of test results.
    • Social and policy implications due to misinterpretation of differences.
Reliability & Validity
  • Quality of measurement tools is assessed via Reliability & Validity:
    • Reliability: Consistency in results across multiple tests.
    • Validity: Whether the test measures what it purports to measure.
  • An intelligence test is valid only when the design aligns with the characteristics of the test-takers.
Reliability & Validity Illustrated
  • A Reliable intelligence test yields similar scores for the same person across attempts.
  • A Valid intelligence test should measure intelligence, not cultural familiarity or socioeconomic status.
Contextual Assumptions
  • Assumptions in intelligence testing can undermine validity if mismatched with test-takers’ characteristics.
  • Many tests may reinforce existing social hierarchies based on race, gender, and culture, comparing averages from groups for whom the test was designed.
Common Assumptions
  • Intelligence tests usually assume:
    • Familiarity with the test language.
    • Knowledge of specific cultural references.
    • Experience with written problem-solving formats.
    • Motivation to score well and a suitable mental state during testing.
    • Knowledge of expected answer formats.
Assumptions Beyond Language
  • Even non-verbal tests carry assumptions about familiarity with test formats and cognitive patterns in processing questions.
Validity of Comparisons
  • Invalid IQ test comparisons reinforce social hierarchies, marginalizing groups that perform poorly on tests designed for mainstream cultures.
  • Meaningful IQ comparisons require inclusive standardization for the entire group under comparison, which is infrequently achieved.
Conclusion
  • IQ tests primarily rank individuals against an average standard.
  • Standardized scores are adjusted based on reference groups.
  • The assumptions underlying tests can lead to invalid scores or comparisons.