Measurement in Psychology & Intelligence Testing

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/24

Earn XP

Description and Tags

25 vocabulary flashcards summarizing key terms on measurement principles—reliability, validity, standardization, and bias—in psychological testing.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

25 Terms

New cards

Measurement in Psychology

The process of quantitatively assessing invisible constructs such as thoughts, feelings, and behaviours to predict outcomes and diagnose disorders.

New cards

Individual Differences

Continuous variations among people on physical, sensory-motor, cognitive, personality, and social functioning dimensions.

New cards

Reliability

The consistency of a test; its ability to yield the same score for the same person under identical conditions.

How do we know if a questionnaire measure is reliable ?

Methods to quantify reliability

Several methods available:

Test retest reliability

Alternate parallel forms

Split half reliability

Inter-rater reliability

New cards

Validity

The accuracy of a test; the degree to which it actually measures the construct it is intended to measure.

Major types of validity:

Construct validity
measures X, related to past measures of X

Content validity
captures all aspects of X

Criterion validity
predicts what X should theoretically predict

New cards

Standardization

The process of administering a test to a large, representative sample to establish norms that give meaning to individual raw scores.

STANDARDISATION

Involves giving test to a large random sample to find out:
average performance
variation in performance around the average
Tells us how well someone has performed
Important to know what population the test is standardized on

New cards

Bias-freeness

The extent to which a test is fair and does not advantage or disadvantage particular groups unrelated to the construct being measured.

Differences between groups on other factors may explain differences in test performance, not lack of ability

New cards

Test-retest Reliability

Reliability evidence obtained by administering the same test to the same group on two occasions and correlating the scores.

New cards

Alternate (Parallel) Forms Reliability

Reliability obtained by creating two equivalent versions of a test and correlating scores between the versions.

New cards

Split-half Reliability

Reliability determined by dividing a test into two halves (e.g., odd vs. even items) and correlating the two sets of scores.

New cards

Inter-rater Reliability

Consistency between independent observers’ ratings of the same behaviour, shown by high agreement or correlation.

New cards

Construct Validity

Evidence that a test relates highly to other accepted measures of the same construct.

New cards

Content Validity

The extent to which a test samples all facets of the construct it aims to measure.

New cards

Criterion Validity

The extent to which a test predicts outcomes (criteria) that theory says it should predict.

New cards

Norm Group

The large, randomly selected sample whose scores establish the mean and standard deviation used for interpreting individual results.

New cards

Mean (µ)

The average score of the norm group, used as the reference point in standardization.

New cards

Standard Deviation (σ)

A statistic that describes the spread of scores around the mean in the norm group.

New cards

Normal (Gaussian) Distribution

A bell-shaped distribution in which about 68% of scores fall within ±1σ, 95% within ±2σ, and 99% within ±3σ of the mean.

New cards

IQ Norms

Standard values for intelligence tests, typically set at µ = 100 and σ = 15.

New cards

Stanford–Binet Intelligence Scale

A widely used standardized test for measuring human intelligence, often used as a benchmark for construct validity.

New cards

Wechsler Scales

A family of standardized intelligence tests (e.g., WAIS, WISC) frequently used to assess cognitive ability across the lifespan.

New cards

Social Desirability Bias

The tendency of respondents to answer questions in a manner that will be viewed favourably by others, potentially distorting self-report data.

New cards

Observable Behaviour Coding

Recording behaviours from video or live observation for later analysis, often requiring high inter-rater reliability.

New cards

Predictive Use of Tests

Employing psychological measures to forecast behaviours or life outcomes such as academic performance or job success.

New cards

Continuous Dimension

A variable on which individuals differ by degree (e.g., height, reaction time) rather than by discrete categories.

New cards

Measurement Bias

Systematic factors causing certain groups to score higher or lower on a test for reasons unrelated to the construct being assessed.