Lecture 9 Notes: Aptitude, Ability & Achievement Tests

Lecture 9: Tests of Aptitude, Ability & Achievement

Lecture Overview

Alternative tests are needed to address limitations of standard intelligence tests like Wechsler and Stanford-Binet.
Key topics include the distinction between achievement, aptitude, and intelligence, and various alternative tests and scales.
Examples of tests to be discussed: WIAT (Achievement), WRAT-4 (Achievement), Peabody Picture Vocabulary Test, Lindamood Auditory Conceptualisation Test (LAC-III).
Group testing in education, civil service, and the military will also be covered.

Achievement vs. Aptitude vs. Intelligence

Achievement Tests
- Assess what a person has learned from a specific course of instruction.
- Content is curriculum-based.
- Areas examined include reading, reading comprehension, writing, and math.
Aptitude Tests
- Evaluate a student’s potential for learning, rather than what has already been learned.
- Predict more specific abilities like math, science, or music.
Intelligence Tests
- Attempt to predict future performance.
- Predict abilities that are general and broad.

Ability

Achievement, aptitude, and intelligence are highly interrelated.
Implication: there is considerable overlap, all three concepts are encompassed by the term "human ability."
Achievement, aptitude, and intelligence are distinguished as different types of ability.

Why not just use the Wechsler or the Stanford-Binet?

Stanford-Binet-V: Suitable for ages 2 - 85+
WPPSI-III: Suitable for ages 4 - 6 years
WISC-IV: Suitable for ages 6 - 16 years
WAIS-IV: Suitable for ages 16+ years
Limitations of Binet/Wechsler Tests
- No tests for infants under 2 years.
- Not appropriate for individuals with physical, sensor, or language disabilities.
- Not good for assessing learning disabilities/Specific Learning Disorders.
- Not appropriate for non-members of the dominant culture.
- Correlate strongly with scholastic achievement and reading ability.

Types of Alternative Tests

Specialized tests limited to a specific:
- Age range (e.g., infancy or early childhood).
- Area of cognitive ability (e.g., nonverbal ability).
Often used for screening, follow-up evaluations, and assessing children with or without disabilities.
Many tests require considerable training (e.g., Bayley Scales of Infant Development –III).
Every assessment requires skill in establishing rapport, recognizing signs of developmental conditions, and interpretation of assessment findings.
These tests are often used in conjunction with other tests (e.g., standardized batteries).
Be ALWAYS mindful and ethical in your interpretation and recommendations.

Purpose of Academic Assessments

Can answer the following questions:
- Can their academic difficulties be better explained by their intellectual abilities?
- Are they under/over performing?
- Better understanding of client and their mix of strengths and weaknesses.
- Hypothesis generation / testing.
  - E.g. Factors contributing to “problem behavior” (acting out in class, not completing homework, disengaging from school).
- Intervention.
- Test for a learning disorder / other learning problems.

Achievement/Ability Tests – Individually Administered

Purposes of individually administered achievement/ability tests include:
- Assessing how well a student is learning.
- Assessing how well a class, grade, school, school system or state is learning.
- Identifying learning problems.
- Determining whether a child is ready to proceed to the next grade.
- Evaluating teacher effectiveness.
- Determining readiness/suitability for tertiary programs.
- Determining if an individual has mastered knowledge for professional advancement (bar exam).

Infant Scales: Achievement of Developmental Milestones

Scale	Brazelton Neonatal Assessment Scale	Gessell Developmental Scale	Baley Scales of Infant Development	Cattel Infant Intelligence Scale
Age	3 days – 4 weeks	2.3 mth to 6.3 yr	2 to 30 mths	2 to 30 mths
Purpose	Newborn Competence	Developmental Status	Cognitive & Motor Functions	Infant Intelligence
Scales	27 behavioural 20 elicited	5 areas	DQ Mental Motor	Age scale Mental Age (IQ)
Downward Binet Norms	None	Not adequate	Yes	Yes
Reliability	Poor	No evidence	Yes	No
Predict future Intelligence	No	No	No	No
Other	Research Tool	Scoring Issues	Predicts Intellectual Disability	Outdated

Bayley Scales of Infant Development, 4th Edition

Domains
- Cognitive
  - Example Questions: attention to familiar and unfamiliar objects, looking for a fallen object, and pretend play.
- Language
  - Example Questions: recognition of objects and people, following directions, and naming objects and pictures.
- Motor
  - Example Questions: grasping, sitting, stacking blocks, and climbing stairs.
- Socioemotional (Parent Report)
- Adaptive (Parent Report)
Scaled Scores
Composite Scores
Percentile ranks
Growth Scores
High reliability
Good construct validity
Publication date: 2020
Completion time: 30 to 70 minutes
Administration: Paper-and-pencil or online administration (Parent report)
Age range: 16 days to 42 months
Need specific training for use of test

Purpose of Bayley Scales of Infant Development, 4th Edition

Employed to determine children's developmental functioning and plan management for kids with developmental delays.
BSID helps in the following ways:
- Early identification of intellectual delay. Re-assessments can help to monitor progress over time.
- To individualize the management and to accommodate a child's developmental and learning needs.
- To assess individual domains of development: for example, cognitive delay.
- To help researchers as a research tool.
Early diagnosis of developmental delay or intellectual developmental delay is a critical responsibility of health and educational professionals (particularly psychologists).
BSID helps in detecting developmental delay early and in initiating early developmental intervention.

Children & Adolescents

Scale	McCarthy Scales of Children’s Abilities	Kaufman Assessment Battery for Children (2nd Edition Normative Update): 2018
Age	2 – 8 years	3 – 18 years
Purpose	Intelligence	Abilities (psychological, clinical, neuropsych, learning disability
Scales	18 Scales	18 Scales
	General Cognitive Ability	Sequential processing
	Verbal	Simultaneous processing
	Perceptual-Performance	Learning
	Quantitative	Planning
Additional	Memory	knowledge
	Motor
Reliability/Validity	Good validity and reliability	Good validity and reliability
Other	Outdated (Has not been updated)	Derived for neuropsychological theory (but seen as problematic by some)
		Consistent with CHC theory

Ability in Populations with a disability

Scale	Columbia Mental Maturity Scale	Peabody Picture Vocabulary Test -5	Leiter International Performance Test-3
Age	3 – 12 years	2-90 years	2 to 18 years
Type of disability	Sensory, physical, language disabilities	Physical, Language disabilities	Pervasive developmental disorders
What is assessed	General reasoning ability	Vocabulary (verbal IQ)	Gf, visualisation, memory & attention
Format	Picture format	Picture format	Performance tasks
	Multiple choice response	Multiple choice response
	No timed tasks
Norms	Excellent	Excellent	Good
Reliability/Validity	Good	Good	Good
Other	Fine motor and verbal skills not required	Outdated	Oral language not needed
		Limited in that it only measures one aspect of ability

Learning disability/Specific Learning Disorder

Scale	Illinois Test of Psycholinguistic Abilities-III	Woodcock Johnson 4th Edition
Age	5 – 12 years, 11 months	2 - 90+ years
What is assessed	Children’s spoken and written language	From Information Perspective:
		Input problems
		Information Analysis problems
		Output problems
		CHC perspective
		General intellectual abilities (g)
		Specific cognitive factors
		Scholastic Aptitude
		Oral language
		Achievement
Scales	10 composite scores	69 narrow abilities CHC factors General factors achievement
	General language, Spoken language, Written language, Semantic, Grammar, Phonology, Comprehension, Spelling, Sight-Symbol Processing, Sound-Symbol Processing
Norms	US norms: Gender, ethnicity addressed	Australasian Norms: Cognitive and achievement components co-normed
Reliability	Good	Good
Validity	Spoken and written language	Does not align well with CHC model of intelligence

Wechsler Individual Achievement Test-III (WIAT-III)

WIAT-III: Australian Language Adapted Edition but US norms
A&NZ norm referenced for Pre-School to 18 years
19 to 50 years 11 months = Adult norms are from USA
WIAT-II Australian:
- Modifications of Subtests (reflect curriculum standards & classroom practices).
- Strengthening the link between assessment and intervention.
- Inclusion of Ability-Achievement Discrepancy Tables (WISC-IV FSIQ and Factor Scores).
Administration time: 45 minutes to 2 hours.

WIAT-III Application

Used for:
- Assessment of reading, spelling, mathematics, written expression, comprehension – can identify academic strengths and weaknesses.
- Provide information to assist with diagnosis of specific learning disorder or eligibility for educational service.
- Can provide information to assist with planning interventions.

WIAT-III

16 subtests
Can use subtests by themselves, or obtain a total achievement score, and/or seven composite scores
Subtests used for total achievement score differs across age groups
Seven domain composites:
- Oral Language, Total Reading, Basic Reading, Reading comprehension and Fluency, Written Expression, Mathematics, Maths Fluency

WIAT-III Composite and Subtest

Composite	Subtest
Oral Language (ages 4 - 17+)	Listening Comprehension
	Oral Expression
Total Reading (ages 6 – 17+)	Word Reading
	Pseudoword decoding
	Reading comprehension
	Oral reading fluency
Basic Reading (ages 6-17+)	Word reading
	Pseudoword decoding
Reading comprehension and fluency (ages 7- 17+)	Reading comprehension
	Oral reading fluency
Written Expression (ages 5-17+)	Alphabet writing fluency
	Spelling
	Sentence Composition
	Essay Composition
Mathematics (ages 5-17+)	Maths Problem Solving
	Numerical Operation
Maths Fluency (ages 6 –17+)	Math fluency – addition
	Math fluency – subtraction
	Math fluency - multiplication

WIAT-III Composite and Subtest Scores

Status Scores
- Standard Score
- Composite Scores
- Percentile Rank
- Normal Curve Equivalent
- Stanines
- Growth Scores
  - Grade Equivalent
  - Age Equivalent
  - Growth Scale Values
- Statistical Significance of Score Differences
- Frequency of score differences
- Qualitative evaluation of subtest component score differences
- Out-of-level item set performance.

Descriptive Classifications of Scores Ranges

Standard Score Range	Descriptive Classification
130 and above	Extremely High
120 – 129	Very High
110 – 119	High Average
90 – 109	Average
90 – 89	Low Average
70 – 79	Very Low
69 and below	Extremely low

WIAT-III Subtests

Composite	WIAT-III Subtest	Measures
TOTAL READING	Word Reading	• Letter identification • Phonological awareness • Letter-sound awareness • Accuracy of word recognition • Automaticity of word
	Reading Comprehension	• Literary comprehension • Inferential comprehension • Lexical comprehension •Oral reading comprehension • Word recognition in context
	Oral Reading Fluency	• Oral reading rate • Oral reading accuracy • Oral reading fluency
	Pseudoword Decoding	• Phonological decoding • Accuracy of word attack

WIAT-III Subtests

Composite	WIAT-III Subtest	Measures
Written Expression	Spelling	• Sound-letter awareness • Written spelling of regular & irregular words • Written spelling of homonyms (e.g., their vs. they’re; write vs. right)
	Written Expression	• Timed alphabet writing • Word fluency (written) • Sentence combining • Sentence Generation • Written responses to verbal & visual cues • Descriptive writing • Persuasive writing • Writing fluency (word count)
Oral Language	Listening Comprehension	• Receptive vocabulary • Expressive vocabulary • Listening-inferential comprehension
	Oral Expression	• Word fluency (oral) • Auditory short-term recall for contextual information • Story generation • Giving directions • Explaining steps in sequential tasks

WIAT-III Subtests

Composite	WIAT-II Subtest	Measures
Mathematics	Numerical Operations	• Counting • One-to-one correspondence • Numerical identification & writing • Calculation (addition, subtraction, multiplication, division) • Fractions, decimals, algebra
	Math Problem Solving	• Quantitative concepts • Multi-step problem solving • Money, time, and measurement • Geometry • Reading and interpreting charts and graphs • Statistics and probability • Estimation • Identifying patterns

Achievement tests WRAT-4


Age	5+ years
What is assessed	Grade Level Functioning
Scales	• Word Reading • Spelling • Math computation • Sentence Comprehension
Norms	Level 1: 5 to 11 years Level 2: 12 years +
Reliability	Good
Validity	Questionable
	• Intelligence tests measure potential abilities
	• Achievement tests what has actually been acquired
	• Discrepancies between the two are important

Psychometric properties

Extensive information in the manual.
Reliability: alternate forms (immediate & delayed), internal consistency.
Validity: Content, Concurrent (WIAT-II, Woodcock- Johnson III Tests of Achievement, Kaufman Test of Educational Achievement II)
Correlation with Test of General Cognitive Ability (WISC-IV, WAIS-III).

LINDAMOOD AUDITORY CONCEPTUALISATION TEST (LAC-3)

Assessment of reading-related abilities.
Assessment of phoneme awareness/auditory conceptualization skills.
- Essential for adequate reading ability.
- Phoneme awareness/auditory conceptualization play a central role in specific reading disabilities (dyslexia).
Purposes:
- Identification
- Quantification of strengths & weaknesses
- Tracking progress
- Research

Model of Language and Literacy Skills

Comprehension
- Auditory
  - Phonetic processing
- Visual
  - Sight words
- Language
  - Oral Vocabulary
  - Use of context

LAC-3 Components

Precheck
- Discontinue testing only if the person is unable to do any of the items.
- Sameness vs. different, number concept to 4, left-to-right order, first vs. last.
Category 1: Isolated Phoneme Patterns
- Develop familiarity with colored blocks to represent speech sounds.
- Requires discrimination of number of phonemes in a sequence, same or different, order of same and different.
- Persons 8 yrs and older usually have no difficulties
Category 2: Tracking Phonemes
- Ability to discriminate the identity and number of phonemes, as well as their order within a syllable.
- Single phoneme is added, substituted, omitted, shifted, or repeated.

LAC-3 Components

Category 3: Counting Syllables (NEW)
- Ability to perceive the number of syllables within pseudowords.
- Use of felt squares instead of blocks.
Category 4: Tracking Syllables (Multisyllables) (NEW)
- Ability to perceive a change in the identity, number, or order of syllables in a spoken pseudoword.
- Use of felt squares.
Category 5: Tracking Syllables and Phonemes (Multisyllables) (NEW)
- Ability to discriminate syllables and phonemes and perceive their identity, number, and order within a pseudoword.
- Use of felt squares AND blocks.

Psychometric Properties

Norms (N = 1003) for persons between 5.0 to 18 years 11 months.
Reliability
- Internal consistency .91 to .95
- Test-retest (2 weeks) .96 to .97
Validity
- Criterion prediction: correlated well with other tests of phonological awareness.
- Measures of Reading and Spelling: WRAT-3

LAC-3 Standard Scores

Standard Score	Descriptive Rating	Percentage Included
> 130	Very Superior	2.34
121 – 130	Superior	6.87
111 – 120	Above Average	16.12
90 – 110	Average	49.51
80 – 89	Below Average	16.12
70 – 79	Poor	6.87
< 70	Very Poor	2.34
	Mean = $100 \pm 15$

Benefits of Achievement tests

Allow the identification of children, classrooms, schools, and systems that are performing poorly – implement interventions.
Facilitate the diagnosis of learning difficulties – implement interventions.
Allows accurate placement of students in appropriate learning environment – beneficial for all students.
Identifies areas of strength and weakness – target weak areas.

Revisiting: Individual or group testing

Individual Tests	Group Tests
Characteristics	Characteristics
• One subject tested at a time	• Many subjects tested at a time
• Examiner records responses	• Participant records their own responses
• Scoring requires skill	• Scoring is simple and objective
• Examiner flexibility – if permitted by standardization	• No safeguards
Advantages	Advantages
• Provide information beyond the test score	• Cost-efficient
• Allow the examiner to observe behavior in a standard setting	• Minimize professional time for administration and scoring
• Allow individualized interpretation of test score	• Require less examiner skill and training
	• Have more objective and more reliable scoring procedures
	• Have broad application

Using group tests

Group Tests have good reliability & good norms
Validity can be scant, weak, or contradictory
Tests should be used with other data
Use the tests with caution & seek confirming alternative sources of information
Be wary of low scores – there are a multiplicity of alternative factors that produce low scores
Discrepancies among test scores and other data are a warning signal
When in doubt, use individual testing

Recap: Achievement versus Aptitude Tests

To summarize, achievement tests assess what you have learned, while aptitude tests assess potential for learning.
Achievement tests
- Easier and more reliable administration and scoring
- Validity determined by content-related evidence
Aptitude tests
- Assess a wide range of experiences in a variety of ways
- Evaluate effects of unknown/uncontrolled experiences
- Validity is based on ability to predict future performance

Recap: Achievement vs Aptitude Tests

Achievement tests	Aptitude tests
1. Evaluate the effects of a known or controlled set of experiences	1. Evaluate the effects of an unknown, uncontrolled set of experiences
2. Evaluate the product of a course of training	2. Evaluate the potential to profit from a course of training
3. Rely heavily on content validation procedures	3. Rely heavily on predictive criterion validation procedures