Chapter 3: Giving Meaning to Scores

Chapter 3: Giving Meaning to Scores
Nature of a Score
- A score by itself lacks interpretative meaning.
- For context, it is important to know whether a score is good or bad, e.g., a score of 44 could either be out of 44 (perfect) or 100 (poor).
- Different tests measure different skills, e.g., a score of 15 on a simple spelling test is not comparable to a score of 15 on a difficult one (e.g., Test A vs Test B in Table 3-1).
Frames of Reference
- The meaning derived from scores depends on the frame of reference used for comparison.
- Three dimensions define this frame of reference:
1. Temporal Dimension: Current skills vs. future abilities.
2. Performance Type: Maximum performance (correct answers) vs. typical performance (how often tasks are completed).
3. Comparison Standard: Test content, personal past performance, or group average.
Types of Evaluations
- Summative Evaluation: Measures accomplishments at the end of a period (e.g., report cards).
- Formative Evaluation: Assess skills to guide ffuture instruction.
Achievement Tests
- Measure what has been learned.
- Types include domain-referenced and criterion-referenced tests.
- Domain-referenced tests focus on mastery of specific knowledge.
Norm-Referenced Evaluation
- Evaluates performance based on comparison with normative data or groups.
- Types of norms include Grade Norms, Age Norms, Percentile Norms, and Standard Score Norms.
Cautions in Using Norms
- Norms do not indicate absolute achievement.
- An individual can be high relative to a group but still low in absolute terms.
- Educational performance should not solely depend on student comparisons; factors such as learning environment should also be considered.
Item Response Theory (IRT)
- Focuses on the characteristics of individual items on a test to provide information about a person’s abilities.
- Uses curves to indicate probability of a correct response.
- Promotes computer adaptive testing (CAT) for personalized assessment.
Summary
- Scores are understood through context and comparison, either criterion-based or norm-based.
- Diverse scoring methods (e.g., age norms, grade equivalents) allow differentiated analysis for setting educational targets and standards.
Key Terms:
- Criterion-Referenced Evaluation: Focuses on performance against a standard.
- Norm-Referenced Evaluation: Focuses on performance in relation to others.
- Developmental Standard Scores: Interval scores that provide estimates of ability.
- Percentile Ranks: Indicates percentage of the norm group that scores below a certain score.
- Z-scores: Standard deviations from the mean indicating individual performance.
Examples and Tables referenced include:
- Table 3-1: Two spelling tests differences.
- Table 3-4: Determining Percentile Ranks.
- Table 3-6: Standard Scores and Norms.
- Figures showcasing grade equivalent comparisons and profiles.