Psychology Licensing Exam: Test Construction Vocabulary

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/41

flashcard set

Earn XP

Description and Tags

Flashcards covering the essential vocabulary of test construction, including reliability types, validity methods, and statistical measurements from the lecture notes.

Last updated 2:04 AM on 5/23/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

42 Terms

1
New cards

Test

A systematic procedure for measuring a sample of an individual's behavior.

2
New cards

Standardized Administration

A systematic procedure where the test developer provides specific guidelines for administering the test.

3
New cards

Systematic Scoring

A systematic procedure where the test developer specifies clear rules or steps for evaluating an examinee's responses.

4
New cards

Validity

The degree to which a test accurately and adequately measures what it was designed to measure.

5
New cards

Reliability

The degree of consistency with which the test measures whatever it is measuring, ensuring an examinee's score is their true score.

6
New cards

True Score

The score a person would get on a test if it were perfectly reliable and performance was not affected by error.

7
New cards

Measurement Error

Random or unpredictable factors irrelevant to the behavior measured that affect examinee scores on a test.

8
New cards

X=T+EX = T + E

The equation where XX is the obtained score, TT is the variability due to true scores, and EE is the variability due to measurement error.

9
New cards

Reliability Coefficient

Symbolized as rXXr_{XX} or rYYr_{YY}, it is a squared number ranging from 00 to +1+1 interpreted as the proportion of variability in obtained scores due to true score variability.

10
New cards

Test-Retest Reliability

Evaluated by administering the test to the same group on two occasions; also known as the coefficient of stability.

11
New cards

Alternate Forms Reliability

Evaluated by administering two equivalent forms of the test to the same sample; also known as the coefficient of equivalence.

12
New cards

Internal Consistency Reliability

A measure of how consistent examinees' responses are to different items within a single test administration.

13
New cards

Split-Half Method

A method of assessing internal consistency by dividing a test into two halves and correlating the scores.

14
New cards

Spearman-Brown Prophecy Formula

A formula used to correct the split-half reliability coefficient or to estimate the effect of lengthening or shortening a test on reliability.

15
New cards

Cronbach's Coefficient Alpha

An internal consistency measure conceptualized as the average of all possible split-half reliability coefficients corrected by the Spearman-Brown formula.

16
New cards

KR20KR-20 (Kuder-Richardson Formula 20)

An internal consistency method used specifically when test items are scored dichotomously, such as right or wrong.

17
New cards

Inter-Rater Reliability

The degree of consistency across different raters, assessed when tests are subjectively scored.

18
New cards

Kappa Statistic

A correlation coefficient used to assess inter-rater reliability for two or more raters when the data are nominal.

19
New cards

Coefficient of Concordance

A correlation coefficient used to evaluate inter-rater reliability for two or more raters when data are in the form of ranks.

20
New cards

Standard Error of Measurement (SEMSEM)

The standard deviation of a distribution of obtained scores that would result from testing an individual an infinite number of times; it measures variability due to error.

21
New cards

Confidence Interval

The range within which an examinee's true score is likely to fall given their obtained score, constructed using the standard error of measurement.

22
New cards

Content Validity

The degree to which test items adequately represent the content or behavior domain the test was designed to measure.

23
New cards

Construct Validity

The degree to which a test measures a hypothetical or intangible trait, such as intelligence or self-esteem, evaluated through the accumulation of evidence.

24
New cards

Multi-Trait Multi-Method Matrix

A table of correlation coefficients used to evaluate construct validity by measuring two or more traits using two or more methods.

25
New cards

Convergent Validity

Evidence of construct validity shown when scores on a test correlate highly with scores on measures of the same or similar traits.

26
New cards

Divergent Validity

Evidence of construct validity shown when scores on a test do not correlate highly with scores on measures of unrelated traits; also called discriminant validity.

27
New cards

Monotrait-Monomethod Coefficient

A reliability coefficient in the multi-trait multi-method matrix indicating the correlation of a test with itself.

28
New cards

Monotrait-Heteromethod Coefficient

A correlation between two different measures of the same trait, providing information on convergent validity.

29
New cards

Heterotrait-Monomethod Coefficient

A correlation between two different traits measured by the same method, providing information on divergent validity.

30
New cards

Heterotrait-Heteromethod Coefficient

A correlation between two different traits measured by two different methods, providing information on divergent validity.

31
New cards

Criterion-Related Validity

The degree of association between a predictor (the test) and an external measure of performance (the criterion).

32
New cards

Predictive Validity

A type of criterion-related validity where the test is used to predict an examinee's future performance on the criterion.

33
New cards

Concurrent Validity

A type of criterion-related validity where the test is used to estimate an examinee's current status on the criterion.

34
New cards

Coefficient of Determination

The squared criterion-related validity coefficient (r2r^2) indicating the amount of variability in the criterion explained by the predictor.

35
New cards

Standard Error of Estimate

A value used to construct a confidence interval around an examinee's predicted criterion score.

36
New cards

Incremental Validity

The increase in decision-making accuracy achieved by using a predictor to make selection decisions.

37
New cards

Base Rate

The proportion of people who score above the criterion cutoff without using the new predictor tool.

38
New cards

Positive Hit Rate

The proportion of people who would have been successful if the predictor test had been used as a selection tool.

39
New cards

True Positives

Individuals who scored high on both the predictor and the criterion.

40
New cards

False Positives

Individuals who scored high on the predictor but low on the criterion.

41
New cards

False Negatives

Individuals who scored low on the predictor but high on the criterion.

42
New cards

True Negatives

Individuals who scored low on both the predictor and the criterion.