Measurement in Psychology & Intelligence Testing

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/24

flashcard set

Earn XP

Description and Tags

25 vocabulary flashcards summarizing key terms on measurement principles—reliability, validity, standardization, and bias—in psychological testing.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

Measurement in Psychology

The process of quantitatively assessing invisible constructs such as thoughts, feelings, and behaviours to predict outcomes and diagnose disorders.

2
New cards

Individual Differences

Continuous variations among people on physical, sensory-motor, cognitive, personality, and social functioning dimensions.

3
New cards

Reliability

The consistency of a test; its ability to yield the same score for the same person under identical conditions.

How do we know if a questionnaire measure is reliable ?

 

Methods to quantify reliability

Several methods available:

 

Test retest reliability

 

Alternate parallel forms

 

Split half reliability

 

Inter-rater reliability

 

4
New cards

Validity

The accuracy of a test; the degree to which it actually measures the construct it is intended to measure.

Major types of validity:

 

  • Construct validity
    measures X, related to past measures of X

 

  • Content validity
    captures all aspects of X

 

  • Criterion validity
    predicts what X should theoretically predict

 

5
New cards

Standardization

The process of administering a test to a large, representative sample to establish norms that give meaning to individual raw scores.

STANDARDISATION

 

  • Involves giving test to a large random sample to find out:

  • average performance

  • variation in performance around the average

  • Tells us how well someone has performed

  • Important to know what population the test is standardized on

6
New cards

Bias-freeness

The extent to which a test is fair and does not advantage or disadvantage particular groups unrelated to the construct being measured.

  • Differences between groups on other factors may explain differences in test performance, not lack of ability

7
New cards

Test-retest Reliability

Reliability evidence obtained by administering the same test to the same group on two occasions and correlating the scores.

8
New cards

Alternate (Parallel) Forms Reliability

Reliability obtained by creating two equivalent versions of a test and correlating scores between the versions.

9
New cards

Split-half Reliability

Reliability determined by dividing a test into two halves (e.g., odd vs. even items) and correlating the two sets of scores.

10
New cards

Inter-rater Reliability

Consistency between independent observers’ ratings of the same behaviour, shown by high agreement or correlation.

11
New cards

Construct Validity

Evidence that a test relates highly to other accepted measures of the same construct.

12
New cards

Content Validity

The extent to which a test samples all facets of the construct it aims to measure.

13
New cards

Criterion Validity

The extent to which a test predicts outcomes (criteria) that theory says it should predict.

14
New cards

Norm Group

The large, randomly selected sample whose scores establish the mean and standard deviation used for interpreting individual results.

15
New cards

Mean (µ)

The average score of the norm group, used as the reference point in standardization.

16
New cards

Standard Deviation (σ)

A statistic that describes the spread of scores around the mean in the norm group.

17
New cards

Normal (Gaussian) Distribution

A bell-shaped distribution in which about 68% of scores fall within ±1σ, 95% within ±2σ, and 99% within ±3σ of the mean.

18
New cards

IQ Norms

Standard values for intelligence tests, typically set at µ = 100 and σ = 15.

19
New cards

Stanford–Binet Intelligence Scale

A widely used standardized test for measuring human intelligence, often used as a benchmark for construct validity.

20
New cards

Wechsler Scales

A family of standardized intelligence tests (e.g., WAIS, WISC) frequently used to assess cognitive ability across the lifespan.

21
New cards

Social Desirability Bias

The tendency of respondents to answer questions in a manner that will be viewed favourably by others, potentially distorting self-report data.

22
New cards

Observable Behaviour Coding

Recording behaviours from video or live observation for later analysis, often requiring high inter-rater reliability.

23
New cards

Predictive Use of Tests

Employing psychological measures to forecast behaviours or life outcomes such as academic performance or job success.

24
New cards

Continuous Dimension

A variable on which individuals differ by degree (e.g., height, reaction time) rather than by discrete categories.

25
New cards

Measurement Bias

Systematic factors causing certain groups to score higher or lower on a test for reasons unrelated to the construct being assessed.