Chapter 4 – Of Tests and Testing: Key Concepts

0.0(0)
linked notesView linked note
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/37

flashcard set

Earn XP

Description and Tags

Fill-in-the-blank flashcards covering Assumptions of Testing, reliability, validity, types of norms, sampling methods, norm- vs criterion-referenced evaluation, error variance, cultural considerations, and key terminology from Chapter 4.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

38 Terms

1

A psychological __ is a relatively enduring way one individual differs from another.

trait

2

A psychological __ is a relatively temporary condition that can vary within an individual.

state

3

Traits exist as scientific __ that are inferred from observable behaviour.

constructs

4

Observable actions or their products used to infer traits are called __ behaviour.

overt

5

Thorndike’s famous phrase, “Whatever exists, exists in some __,” underlies the assumption that traits can be measured.

amount

6

In __ scoring, item responses (e.g., correct = 1, incorrect = 0) are summed to indicate trait magnitude.

cumulative

7

The assumption that test behaviour forecasts real-world behaviour states that test-related behaviour can __ non-test behaviour.

predict

8

Sources of variation in scores not related to the trait being measured are called __ variance.

error

9

A test that consistently measures in the same way is said to have high __.

reliability

10

A test that actually measures what it purports to measure possesses __.

validity

11

Norm-referenced interpretation compares an individual’s score to the scores of a __ group.

reference (normative)

12

Criterion-referenced interpretation compares performance to a predefined __ or standard.

criterion

13

Test performance data of a particular group used for comparison are known as __.

norms

14

A __ sample is the group whose test performance becomes the basis for norms.

normative (or standardization)

15

Sampling that mirrors subgroups in the population in correct proportions is called __ sampling.

stratified

16

Selecting subjects because they are easily available yields a(n) __ or convenience sample.

incidental

17

Norms broken down by chronological years (e.g., 8-year-olds) are called __ norms.

age

18

Norms expressed in school grade units (e.g., 4.5) are called __ norms.

grade

19

Norms based on a sample representative of an entire country are termed __ norms.

national

20

Tables that equate scores from two different national tests are called national __ norms.

anchor

21

Locally gathered performance data used instead of published norms are referred to as __ norms.

local

22

Using the 1990 SAT sample to calibrate all later SAT scores exemplifies a __ reference group scoring system.

fixed

23

The statistical point below which a given percentage of scores fall is a __.

percentile

24

‘75% correct’ refers to raw score performance, whereas the ‘75th __’ refers to ranking among test-takers.

percentile

25

Giving different cutoff scores for hiring based on race was known as race __ and was outlawed in 1991.

norming

26

Factors such as illness, assessor bias, or faulty instruments illustrate different __ of error.

sources

27

The practice of evaluating tests for cultural fairness recognises that tests can contain __.

bias

28

Testing and assessment are assumed to provide powerful __ to society by informing critical decisions.

benefits

29

A __ test provides detailed, uniform procedures for administration, scoring, and (usually) norms.

standardized

30

When the tasks on a test resemble the actual behaviour being predicted, the test is said to be __-based.

performance

31

Comparing one dancer’s ‘shyness’ score to other male exotic dancers illustrates how the chosen __ group affects interpretation.

reference

32

In domain (content) sampling, items are selected to represent the entire __ of behaviours relevant to a construct.

domain

33

Random situational factors like weather can influence self-ratings, reminding us that some error is simply due to __.

chance

34

Using multiple assessment tools to balance limitations of any single test reflects the assumption that all tests have __.

imperfections

35

Assessors are encouraged to follow culturally __ assessment guidelines to ensure fair use.

informed

36

A measure that yields consistent but systematically off-target results (e.g., always 1.3 lb on a 1 lb bar) is reliable but not __.

accurate (valid)

37

Tests intended to verify mastery of specific knowledge or skills are often called __ tests.

mastery (criterion-referenced)

38

The equipercentile method equates test scores by matching equal __ ranks on two tests.

percentile