Sections from Castles, Rastle & Nation (2018): Ending the Reading Wars: Reading Acquisition from Novice to Expert,
Chapter 6: Studying Eye Movements (pp. 168–170)
Why study eye movements?
Eye movements provide online, moment-by-moment evidence of language processing.
Unlike reaction times, they show where and when processing difficulty occurs during reading.
Based on the assumption that eye position closely reflects attention.
Key components of eye movements in reading
Fixations
Brief pauses (≈200–250 ms) where visual information is extracted.
Most linguistic processing occurs during fixations.
Saccades
Rapid eye movements between fixations.
Vision is largely suppressed during saccades.
Regressions
Backward saccades to earlier text.
Often indicate processing difficulty, ambiguity, or integration problems.
Measures used in eye-tracking studies
First-fixation duration
Sensitive to early lexical processing (e.g., word frequency).
Gaze duration
Sum of all fixations on a word before moving on.
Reflects lexical access + early integration.
Regression path duration
Time from first fixating a word until the eye moves past it to the right.
Sensitive to syntactic/semantic difficulty.
Total reading time
Includes re-reading; reflects later, integrative processes.
What eye movements reveal
Readers do not process all words equally:
Short, frequent, and predictable words are often skipped.
Processing difficulty shows up immediately, not only after sentence completion.
Strong evidence against strictly serial, post-lexical models of reading.
Chapter 6: Models of Visual Word Recognition (pp. 192–198)
Core problem
How does the reader:
Identify a visual pattern
Activate the correct word representation
Do so rapidly and accurately, despite noise and ambiguity?
1. The Logogen Model (Morton)
Each word has a logogen (a word detector).
Logogens accumulate evidence from:
Visual input
Context
When activation exceeds a threshold, the word is recognised.
Key strengths
Explains:
Word frequency effects (lower thresholds for frequent words)
Context effects
Limitations
Vague about:
Letter-level processing
Competition between words
Largely descriptive, not computational.
2. Interactive Activation Model (IAM) (McClelland & Rumelhart)
Architecture
Three levels:
Feature level
Letter level
Word level
Excitatory connections between compatible units.
Inhibitory connections between competitors at the same level.
Key principles
Processing is parallel.
Information flows bottom-up and top-down.
Word recognition emerges from competition.
Explains
Word superiority effect (letters recognised better in words than isolation).
Frequency effects.
Contextual facilitation.
Importance
One of the most influential models in psycholinguistics.
Basis for later models of spoken word recognition (e.g., TRACE).
3. Dual-Route Cascaded (DRC) Model
Two routes for reading aloud:
Lexical route: whole-word recognition
Non-lexical route: grapheme-to-phoneme conversion
Explains
Reading of:
Irregular words (e.g., yacht)
Non-words (e.g., blint)
Criticism
Less successful at explaining:
Semantic effects
Graded frequency effects
More modular than interactive models.
Chapter 6: Effects of Meaning Frequency & Prior Context (pp. 202–205)
⚠ This section contains more detail than required — key take-home points are highlighted.
Word frequency effects
High-frequency words:
Shorter fixation durations
More likely to be skipped
Frequency effects occur:
Early (first fixation)
Persist even with strong contextual support
Meaning (sense) frequency
Words with multiple meanings:
Dominant meaning accessed faster than subordinate meanings.
Subordinate meanings cause:
Longer fixations
Increased regressions
Prior context effects
Context facilitates word recognition but does not eliminate frequency effects.
Strong context:
Speeds recognition
Reduces competition
Evidence supports interactive models, not purely bottom-up ones.
Theoretical implications
Lexical access is:
Probabilistic
Influenced by prior knowledge
Sensitive to distributional properties of language
Appendix: Connectionism (pp. 481–483)
What is connectionism?
A class of models where cognition emerges from:
Many simple processing units
Operating in parallel
Connected by weighted links
Also known as:
Parallel Distributed Processing (PDP)
Neural network models
Core principles
Simple units
Units pass activation, nothing more.
Weighted connections
Positive weights excite
Negative weights inhibit
Emergent behaviour
Complex cognition arises from simple interactions.
Interactive Activation & Competition (IAC) models
No learning (weights are hand-coded).
Used extensively in word recognition.
Include:
Excitation between compatible units
Inhibition between competitors
Processing
Activation spreads through cycles.
Network settles into a stable pattern.
Best-matching word remains active.
Learning vs non-learning models
IAC models: explain how processing works
Learning models (e.g., back-propagation): explain how knowledge is acquired
Back-propagation details are beyond the required scope
Why connectionism matters
Explains:
Frequency effects
Generalisation to novel items
Graded, probabilistic behaviour
Avoids explicit symbolic rules.
Strongly influenced modern psycholinguistic theory.
I