Test Development Flashcards

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/41

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards about test development, covering concepts from initial progress to qualitative item analysis.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

42 Terms

1
New cards

Conceptualization (Test Development)

The process of defining what a test is designed to measure, including how the test developer defines the construct and how it differs from similar tests.

2
New cards

Stimulus for Test Development

An emerging social phenomenon or pattern of behavior that might inspire the development of a new test.

3
New cards

Pilot Work

Preliminary research surrounding the creation of a test prototype, involving evaluating test items for inclusion in the final instrument.

4
New cards

Scaling

The process of setting rules for assigning numbers in measurement; the design and calibration of a measuring device.

5
New cards

Age-based Scale

A scale measuring a testtaker’s performance as a function of age.

6
New cards

Grade-based Scale

A scale measuring a testtaker’s performance as a function of their grade level.

7
New cards

Stanine Scale

A scale used to transform raw test scores into scores ranging from 1 to 9.

8
New cards

Unidimensional Test

A test where only one dimension is presumed to underlie the ratings.

9
New cards

Multidimensional Test

A test where more than one dimension is thought to guide the testtaker’s responses.

10
New cards

Comparative Scaling

Entails judgments of a stimulus in comparison with every other stimulus on the scale.

11
New cards

Categorical Scaling

Stimuli are placed into one of two or more alternative categories that differ quantitatively with respect to some continuum.

12
New cards

Rating Scale

A grouping of words, statements, or symbols on which judgments of the strength of a trait, attitude, or emotion are indicated.

13
New cards

Summative Scale

A scale in which the final test score is obtained by summing the ratings across all items.

14
New cards

Likert Scale

A scale where each item presents testtakers with five to seven alternate responses, usually on an agree-disagree continuum.

15
New cards

Method of Paired Comparisons

Testtakers are presented with pairs of stimuli and must select one according to a rule.

16
New cards

Guttman Scale

Items range sequentially from weaker to stronger expressions of attitude, belief, or feeling being measured.

17
New cards

Method of Equal-Appearing Intervals

A scaling method used to obtain data presumed to be interval in nature, involving collecting and evaluating statements reflecting positive and negative attitudes.

18
New cards

Item Pool

The reservoir from which items will or will not be drawn for the final version of a test.

19
New cards

Selected-Response Format

Requires testtakers to select a response from a set of alternative responses.

20
New cards

Matching Item

Testtakers are presented with two columns: premises and responses, and the task is to associate each response with the correct premise.

21
New cards

Binary-Choice Item

Usually takes the form of a sentence that requires the testtaker to indicate whether the statement is or is not a fact.

22
New cards

Constructed-Response Format

Require testtakers to supply or create the correct answer, not merely select it.

23
New cards

Completion Item

Requires the examinee to provide a word or phrase that completes a sentence.

24
New cards

Short-Answer Item

Another form of completion item that is more on identification rather than sentence completion

25
New cards

Essay Item

A test item requiring the testtaker to respond to a question by writing a composition, typically demonstrating recall, understanding, analysis, or interpretation.

26
New cards

Item Bank

A large and easily accessible collection of test questions classified by subject area or item statistics.

27
New cards

Computerized Adaptive Testing (CAT)

An interactive, computer-administered test-taking process where items presented are based on the testtaker’s performance on previous items.

28
New cards

Item Branching

Ability of the computer to tailor the content and order of test items based on responses to previous items.

29
New cards

Cumulative Model

The higher the score on the test, the higher the testtaker is on the characteristic being measured.

30
New cards

Class Scoring/Category Scoring

Testtaker responses earn credit toward placement in a class or category with others showing similar response patterns.

31
New cards

Ipsative Scoring

Comparing a testtaker’s score on one scale within a test to another scale within that same test.

32
New cards

Phantom Factors

Factors that are just artifacts of the small sample size

33
New cards

Item-Difficulty Index (p)

The proportion of testtakers who answered the item correctly.

34
New cards

Item-Reliability Index

Indicates the internal consistency of a test, equal to the product of the item-score standard deviation and the correlation between the item score and the total test score.

35
New cards

Item-Validity Index

Statistic designed to indicate the degree to which a test measures what it purports to measure.

36
New cards

Item-Discrimination Index (d)

Indicates how adequately an item separates or discriminates between high scorers and low scorers on an entire test.

37
New cards

Qualitative Item Analysis

Nonstatistical procedures designed to explore how individual test items work, comparing them to each other and to the test as a whole.

38
New cards

Think Aloud Test Administration

Designed to shed light on the testtaker’s thought processes during the administration of a test.

39
New cards

Sensitivity Review

Study of test items, typically during test development, examining fairness and offensive content.

40
New cards

Cross-validation

Revalidation of the test on a sample of testtakers other than those on whom test performance was originally found to be a valid predictor of some criterion

41
New cards

Validity Shrinkage

Decrease in item validities that inevitably occurs after cross-validation of findings.

42
New cards

Co-validation

Test validation process conducted on two or more tests using the same sample of testtakers