Lecture 9 Notes: Aptitude, Ability & Achievement Tests
Lecture 9: Tests of Aptitude, Ability & Achievement
Lecture Overview
- Alternative tests are needed to address limitations of standard intelligence tests like Wechsler and Stanford-Binet.
- Key topics include the distinction between achievement, aptitude, and intelligence, and various alternative tests and scales.
- Examples of tests to be discussed: WIAT (Achievement), WRAT-4 (Achievement), Peabody Picture Vocabulary Test, Lindamood Auditory Conceptualisation Test (LAC-III).
- Group testing in education, civil service, and the military will also be covered.
Achievement vs. Aptitude vs. Intelligence
- Achievement Tests
- Assess what a person has learned from a specific course of instruction.
- Content is curriculum-based.
- Areas examined include reading, reading comprehension, writing, and math.
- Aptitude Tests
- Evaluate a student’s potential for learning, rather than what has already been learned.
- Predict more specific abilities like math, science, or music.
- Intelligence Tests
- Attempt to predict future performance.
- Predict abilities that are general and broad.
Ability
- Achievement, aptitude, and intelligence are highly interrelated.
- Implication: there is considerable overlap, all three concepts are encompassed by the term "human ability."
- Achievement, aptitude, and intelligence are distinguished as different types of ability.
Why not just use the Wechsler or the Stanford-Binet?
- Stanford-Binet-V: Suitable for ages 2 - 85+
- WPPSI-III: Suitable for ages 4 - 6 years
- WISC-IV: Suitable for ages 6 - 16 years
- WAIS-IV: Suitable for ages 16+ years
- Limitations of Binet/Wechsler Tests
- No tests for infants under 2 years.
- Not appropriate for individuals with physical, sensor, or language disabilities.
- Not good for assessing learning disabilities/Specific Learning Disorders.
- Not appropriate for non-members of the dominant culture.
- Correlate strongly with scholastic achievement and reading ability.
Types of Alternative Tests
- Specialized tests limited to a specific:
- Age range (e.g., infancy or early childhood).
- Area of cognitive ability (e.g., nonverbal ability).
- Often used for screening, follow-up evaluations, and assessing children with or without disabilities.
- Many tests require considerable training (e.g., Bayley Scales of Infant Development –III).
- Every assessment requires skill in establishing rapport, recognizing signs of developmental conditions, and interpretation of assessment findings.
- These tests are often used in conjunction with other tests (e.g., standardized batteries).
- Be ALWAYS mindful and ethical in your interpretation and recommendations.
Purpose of Academic Assessments
- Can answer the following questions:
- Can their academic difficulties be better explained by their intellectual abilities?
- Are they under/over performing?
- Better understanding of client and their mix of strengths and weaknesses.
- Hypothesis generation / testing.
- E.g. Factors contributing to “problem behavior” (acting out in class, not completing homework, disengaging from school).
- Intervention.
- Test for a learning disorder / other learning problems.
Achievement/Ability Tests – Individually Administered
- Purposes of individually administered achievement/ability tests include:
- Assessing how well a student is learning.
- Assessing how well a class, grade, school, school system or state is learning.
- Identifying learning problems.
- Determining whether a child is ready to proceed to the next grade.
- Evaluating teacher effectiveness.
- Determining readiness/suitability for tertiary programs.
- Determining if an individual has mastered knowledge for professional advancement (bar exam).
Infant Scales: Achievement of Developmental Milestones
| Scale | Brazelton Neonatal Assessment Scale | Gessell Developmental Scale | Baley Scales of Infant Development | Cattel Infant Intelligence Scale |
|---|
| Age | 3 days – 4 weeks | 2.3 mth to 6.3 yr | 2 to 30 mths | 2 to 30 mths |
| Purpose | Newborn Competence | Developmental Status | Cognitive & Motor Functions | Infant Intelligence |
| Scales | 27 behavioural 20 elicited | 5 areas | DQ Mental Motor | Age scale Mental Age (IQ) |
| Downward Binet Norms | None | Not adequate | Yes | Yes |
| Reliability | Poor | No evidence | Yes | No |
| Predict future Intelligence | No | No | No | No |
| Other | Research Tool | Scoring Issues | Predicts Intellectual Disability | Outdated |
Bayley Scales of Infant Development, 4th Edition
- Domains
- Cognitive
- Example Questions: attention to familiar and unfamiliar objects, looking for a fallen object, and pretend play.
- Language
- Example Questions: recognition of objects and people, following directions, and naming objects and pictures.
- Motor
- Example Questions: grasping, sitting, stacking blocks, and climbing stairs.
- Socioemotional (Parent Report)
- Adaptive (Parent Report)
- Scaled Scores
- Composite Scores
- Percentile ranks
- Growth Scores
- High reliability
- Good construct validity
- Publication date: 2020
- Completion time: 30 to 70 minutes
- Administration: Paper-and-pencil or online administration (Parent report)
- Age range: 16 days to 42 months
- Need specific training for use of test
Purpose of Bayley Scales of Infant Development, 4th Edition
- Employed to determine children's developmental functioning and plan management for kids with developmental delays.
- BSID helps in the following ways:
- Early identification of intellectual delay. Re-assessments can help to monitor progress over time.
- To individualize the management and to accommodate a child's developmental and learning needs.
- To assess individual domains of development: for example, cognitive delay.
- To help researchers as a research tool.
- Early diagnosis of developmental delay or intellectual developmental delay is a critical responsibility of health and educational professionals (particularly psychologists).
- BSID helps in detecting developmental delay early and in initiating early developmental intervention.
Children & Adolescents
| Scale | McCarthy Scales of Children’s Abilities | Kaufman Assessment Battery for Children (2nd Edition Normative Update): 2018 |
|---|
| Age | 2 – 8 years | 3 – 18 years |
| Purpose | Intelligence | Abilities (psychological, clinical, neuropsych, learning disability |
| Scales | 18 Scales | 18 Scales |
| General Cognitive Ability | Sequential processing |
| Verbal | Simultaneous processing |
| Perceptual-Performance | Learning |
| Quantitative | Planning |
| Additional | Memory | knowledge |
| Motor | |
| Reliability/Validity | Good validity and reliability | Good validity and reliability |
| Other | Outdated (Has not been updated) | Derived for neuropsychological theory (but seen as problematic by some) |
| | Consistent with CHC theory |
Ability in Populations with a disability
| Scale | Columbia Mental Maturity Scale | Peabody Picture Vocabulary Test -5 | Leiter International Performance Test-3 |
|---|
| Age | 3 – 12 years | 2-90 years | 2 to 18 years |
| Type of disability | Sensory, physical, language disabilities | Physical, Language disabilities | Pervasive developmental disorders |
| What is assessed | General reasoning ability | Vocabulary (verbal IQ) | Gf, visualisation, memory & attention |
| Format | Picture format | Picture format | Performance tasks |
| Multiple choice response | Multiple choice response | |
| No timed tasks | | |
| Norms | Excellent | Excellent | Good |
| Reliability/Validity | Good | Good | Good |
| Other | Fine motor and verbal skills not required | Outdated | Oral language not needed |
| | Limited in that it only measures one aspect of ability | |
Learning disability/Specific Learning Disorder
| Scale | Illinois Test of Psycholinguistic Abilities-III | Woodcock Johnson 4th Edition |
|---|
| Age | 5 – 12 years, 11 months | 2 - 90+ years |
| What is assessed | Children’s spoken and written language | From Information Perspective: |
| | Input problems |
| | Information Analysis problems |
| | Output problems |
| | CHC perspective |
| | General intellectual abilities (g) |
| | Specific cognitive factors |
| | Scholastic Aptitude |
| | Oral language |
| | Achievement |
| Scales | 10 composite scores | 69 narrow abilities CHC factors General factors achievement |
| General language, Spoken language, Written language, Semantic, Grammar, Phonology, Comprehension, Spelling, Sight-Symbol Processing, Sound-Symbol Processing | |
| Norms | US norms: Gender, ethnicity addressed | Australasian Norms: Cognitive and achievement components co-normed |
| Reliability | Good | Good |
| Validity | Spoken and written language | Does not align well with CHC model of intelligence |
Wechsler Individual Achievement Test-III (WIAT-III)
- WIAT-III: Australian Language Adapted Edition but US norms
- A&NZ norm referenced for Pre-School to 18 years
- 19 to 50 years 11 months = Adult norms are from USA
- WIAT-II Australian:
- Modifications of Subtests (reflect curriculum standards & classroom practices).
- Strengthening the link between assessment and intervention.
- Inclusion of Ability-Achievement Discrepancy Tables (WISC-IV FSIQ and Factor Scores).
- Administration time: 45 minutes to 2 hours.
WIAT-III Application
- Used for:
- Assessment of reading, spelling, mathematics, written expression, comprehension – can identify academic strengths and weaknesses.
- Provide information to assist with diagnosis of specific learning disorder or eligibility for educational service.
- Can provide information to assist with planning interventions.
WIAT-III
- 16 subtests
- Can use subtests by themselves, or obtain a total achievement score, and/or seven composite scores
- Subtests used for total achievement score differs across age groups
- Seven domain composites:
- Oral Language, Total Reading, Basic Reading, Reading comprehension and Fluency, Written Expression, Mathematics, Maths Fluency
WIAT-III Composite and Subtest
| Composite | Subtest |
|---|
| Oral Language (ages 4 - 17+) | Listening Comprehension |
| Oral Expression |
| Total Reading (ages 6 – 17+) | Word Reading |
| Pseudoword decoding |
| Reading comprehension |
| Oral reading fluency |
| Basic Reading (ages 6-17+) | Word reading |
| Pseudoword decoding |
| Reading comprehension and fluency (ages 7- 17+) | Reading comprehension |
| Oral reading fluency |
| Written Expression (ages 5-17+) | Alphabet writing fluency |
| Spelling |
| Sentence Composition |
| Essay Composition |
| Mathematics (ages 5-17+) | Maths Problem Solving |
| Numerical Operation |
| Maths Fluency (ages 6 –17+) | Math fluency – addition |
| Math fluency – subtraction |
| Math fluency - multiplication |
WIAT-III Composite and Subtest Scores
- Status Scores
- Standard Score
- Composite Scores
- Percentile Rank
- Normal Curve Equivalent
- Stanines
- Growth Scores
- Grade Equivalent
- Age Equivalent
- Growth Scale Values
- Statistical Significance of Score Differences
- Frequency of score differences
- Qualitative evaluation of subtest component score differences
- Out-of-level item set performance.
Descriptive Classifications of Scores Ranges
| Standard Score Range | Descriptive Classification |
|---|
| 130 and above | Extremely High |
| 120 – 129 | Very High |
| 110 – 119 | High Average |
| 90 – 109 | Average |
| 90 – 89 | Low Average |
| 70 – 79 | Very Low |
| 69 and below | Extremely low |
WIAT-III Subtests
| Composite | WIAT-III Subtest | Measures |
|---|
| TOTAL READING | Word Reading | • Letter identification • Phonological awareness • Letter-sound awareness • Accuracy of word recognition • Automaticity of word |
| Reading Comprehension | • Literary comprehension • Inferential comprehension • Lexical comprehension •Oral reading comprehension • Word recognition in context |
| Oral Reading Fluency | • Oral reading rate • Oral reading accuracy • Oral reading fluency |
| Pseudoword Decoding | • Phonological decoding • Accuracy of word attack |
WIAT-III Subtests
| Composite | WIAT-III Subtest | Measures |
|---|
| Written Expression | Spelling | • Sound-letter awareness • Written spelling of regular & irregular words • Written spelling of homonyms (e.g., their vs. they’re; write vs. right) |
| Written Expression | • Timed alphabet writing • Word fluency (written) • Sentence combining • Sentence Generation • Written responses to verbal & visual cues • Descriptive writing • Persuasive writing • Writing fluency (word count) |
| Oral Language | Listening Comprehension | • Receptive vocabulary • Expressive vocabulary • Listening-inferential comprehension |
| Oral Expression | • Word fluency (oral) • Auditory short-term recall for contextual information • Story generation • Giving directions • Explaining steps in sequential tasks |
WIAT-III Subtests
| Composite | WIAT-II Subtest | Measures |
|---|
| Mathematics | Numerical Operations | • Counting • One-to-one correspondence • Numerical identification & writing • Calculation (addition, subtraction, multiplication, division) • Fractions, decimals, algebra |
| Math Problem Solving | • Quantitative concepts • Multi-step problem solving • Money, time, and measurement • Geometry • Reading and interpreting charts and graphs • Statistics and probability • Estimation • Identifying patterns |
Achievement tests WRAT-4
| |
|---|
| Age | 5+ years |
| What is assessed | Grade Level Functioning |
| Scales | • Word Reading • Spelling • Math computation • Sentence Comprehension |
| Norms | Level 1: 5 to 11 years Level 2: 12 years + |
| Reliability | Good |
| Validity | Questionable |
| • Intelligence tests measure potential abilities |
| • Achievement tests what has actually been acquired |
| • Discrepancies between the two are important |
Psychometric properties
- Extensive information in the manual.
- Reliability: alternate forms (immediate & delayed), internal consistency.
- Validity: Content, Concurrent (WIAT-II, Woodcock- Johnson III Tests of Achievement, Kaufman Test of Educational Achievement II)
- Correlation with Test of General Cognitive Ability (WISC-IV, WAIS-III).
LINDAMOOD AUDITORY CONCEPTUALISATION TEST (LAC-3)
- Assessment of reading-related abilities.
- Assessment of phoneme awareness/auditory conceptualization skills.
- Essential for adequate reading ability.
- Phoneme awareness/auditory conceptualization play a central role in specific reading disabilities (dyslexia).
- Purposes:
- Identification
- Quantification of strengths & weaknesses
- Tracking progress
- Research
Model of Language and Literacy Skills
- Comprehension
- Auditory
- Visual
- Language
- Oral Vocabulary
- Use of context
LAC-3 Components
- Precheck
- Discontinue testing only if the person is unable to do any of the items.
- Sameness vs. different, number concept to 4, left-to-right order, first vs. last.
- Category 1: Isolated Phoneme Patterns
- Develop familiarity with colored blocks to represent speech sounds.
- Requires discrimination of number of phonemes in a sequence, same or different, order of same and different.
- Persons 8 yrs and older usually have no difficulties
- Category 2: Tracking Phonemes
- Ability to discriminate the identity and number of phonemes, as well as their order within a syllable.
- Single phoneme is added, substituted, omitted, shifted, or repeated.
LAC-3 Components
- Category 3: Counting Syllables (NEW)
- Ability to perceive the number of syllables within pseudowords.
- Use of felt squares instead of blocks.
- Category 4: Tracking Syllables (Multisyllables) (NEW)
- Ability to perceive a change in the identity, number, or order of syllables in a spoken pseudoword.
- Use of felt squares.
- Category 5: Tracking Syllables and Phonemes (Multisyllables) (NEW)
- Ability to discriminate syllables and phonemes and perceive their identity, number, and order within a pseudoword.
- Use of felt squares AND blocks.
Psychometric Properties
- Norms (N = 1003) for persons between 5.0 to 18 years 11 months.
- Reliability
- Internal consistency .91 to .95
- Test-retest (2 weeks) .96 to .97
- Validity
- Criterion prediction: correlated well with other tests of phonological awareness.
- Measures of Reading and Spelling: WRAT-3
LAC-3 Standard Scores
| Standard Score | Descriptive Rating | Percentage Included |
|---|
| > 130 | Very Superior | 2.34 |
| 121 – 130 | Superior | 6.87 |
| 111 – 120 | Above Average | 16.12 |
| 90 – 110 | Average | 49.51 |
| 80 – 89 | Below Average | 16.12 |
| 70 – 79 | Poor | 6.87 |
| < 70 | Very Poor | 2.34 |
| Mean = 100 \pm 15 | |
Benefits of Achievement tests
- Allow the identification of children, classrooms, schools, and systems that are performing poorly – implement interventions.
- Facilitate the diagnosis of learning difficulties – implement interventions.
- Allows accurate placement of students in appropriate learning environment – beneficial for all students.
- Identifies areas of strength and weakness – target weak areas.
Revisiting: Individual or group testing
| Individual Tests | Group Tests |
|---|
| Characteristics | Characteristics |
| • One subject tested at a time | • Many subjects tested at a time |
| • Examiner records responses | • Participant records their own responses |
| • Scoring requires skill | • Scoring is simple and objective |
| • Examiner flexibility – if permitted by standardization | • No safeguards |
| Advantages | Advantages |
| • Provide information beyond the test score | • Cost-efficient |
| • Allow the examiner to observe behavior in a standard setting | • Minimize professional time for administration and scoring |
| • Allow individualized interpretation of test score | • Require less examiner skill and training |
| • Have more objective and more reliable scoring procedures |
| • Have broad application |
Using group tests
- Group Tests have good reliability & good norms
- Validity can be scant, weak, or contradictory
- Tests should be used with other data
- Use the tests with caution & seek confirming alternative sources of information
- Be wary of low scores – there are a multiplicity of alternative factors that produce low scores
- Discrepancies among test scores and other data are a warning signal
- When in doubt, use individual testing
Recap: Achievement versus Aptitude Tests
- To summarize, achievement tests assess what you have learned, while aptitude tests assess potential for learning.
- Achievement tests
- Easier and more reliable administration and scoring
- Validity determined by content-related evidence
- Aptitude tests
- Assess a wide range of experiences in a variety of ways
- Evaluate effects of unknown/uncontrolled experiences
- Validity is based on ability to predict future performance
Recap: Achievement vs Aptitude Tests
| Achievement tests | Aptitude tests |
|---|
| 1. Evaluate the effects of a known or controlled set of experiences | 1. Evaluate the effects of an unknown, uncontrolled set of experiences |
| 2. Evaluate the product of a course of training | 2. Evaluate the potential to profit from a course of training |
| 3. Rely heavily on content validation procedures | 3. Rely heavily on predictive criterion validation procedures |