Module 3: Reliability and Validity

0.0(0)
Studied by 0 people
call kaiCall Kai
Locked
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/55

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 1:50 AM on 6/15/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai
Chat

No analytics yet

Send a link to your students to track their progress

56 Terms

1
New cards

Assumptions in Test Development

Serves as a guide for psychometricians when developing psychological tests

2
New cards

1st Assumption

Traits and States Exist

3
New cards

2nd Assumption

Traits and States Can Be Measured

4
New cards

3rd Assumption

Test-Related Behavior predicts Non-Test Related Behavior

5
New cards

4th Assumption

Test and Other Measurement Techniques Have Strengths and Weaknesses

6
New cards

5th Assumption

Various Sources of Error Are Part of the Assessment Process

7
New cards

6th Assumption

Testing and Assessment Can Be Conducted in a Fair and Unbiased Manner

8
New cards

7th Assumption

Testing and Assessment Benefit Society

9
New cards

Trait

Enduring, stable personality characteristic that persists across time and situations (e.g., extroverted)

10
New cards

State

Temporary emotional or behavioral condition triggered by a specific situation or environment (e.g., emotions)

11
New cards

Constructs

Psychological Concepts (e.g., shyness, self-esteem)

12
New cards

Measurement Error

Errors within the test itself; errors that affect test reliability

13
New cards

Extraneous Variables

Any factor outside of your independent variable that has the potential to influence the results of your study or experiment (e.g., noise)

14
New cards

What is a good test?

A good test is reliable and valid

15
New cards

Reliability

Refers to the consistency of the test across different contexts, situations, cultures, and time

16
New cards

Validity

Refers to the judgment of how well a test measures what it intends to measure (accuracy)

17
New cards

Types of Measurement Error

Random and Systematic

18
New cards

Random Errors

External errors; unavoidable, unpredictable fluctuations in measurements or processes that occur by chance that affects the test’s reliability.

19
New cards

Test Administration Random Error

How a test is administered is a frequent source of random error; Gender of administrator, overall mood, temperature, situation/s

20
New cards

Systematic Errors

A consistent, predictable shift in measurement or data that deviates from the true value in one specific direction. Also known as bias.

21
New cards

Sources of Systematic Errors

Test Construction & Test Scoring and Interpretation

22
New cards

Test Construction

How items are made and its variations with another test item; Ex: phrasing, wording, sentence construction

23
New cards

Test Scoring and Interpretation

Qualifications of the test administrator, how scores are computed: Manual vs. Computerized?; format of test: Objective vs. Projective?

24
New cards

Assessing Test Reliability

  1. Test Retest Reliability Estimates

  2. Parallel-Forms and Alternate Forms Reliability Estimates

  3. Split Half Reliability Estimates

  4. Internal Consistency

  5. Inter-scorer Reliability

25
New cards

Test Retest Reliability Estimates

The test is conducted on a pool of respondents at one point. After some time, they are tasked to answer the test again.Scores between the two tests are evaluated if they produced consistent results. More appropriate with static constructs.

26
New cards

Internal Consistency

Chronbach’s α (alpha). Most used method in measuring reliability by determining the tests reliability coefficient by using software and statistics.

27
New cards

Alpha Value 0.90++

Excellent Reliability Level

28
New cards

Alpha Value 0.80 - 0.89

Good Reliability Level

29
New cards

Alpha Value 0.70 - 0.79

Acceptable Reliability Level

30
New cards

Alpha Value 0.60 - 0.69

Questionable Reliability Level

31
New cards

Alpha Value 0.59 and below

Poor Reliability Level

32
New cards

Inter-Scorer Reliability

Test is shown to SMEs or subject matter experts. Then, we evaluate the consistency of scores between different experts. Usually, three (3) experts are chosen to prevent a “tie”

33
New cards

Homogeneous Test

A reliable test is homogeneous. This means that the test items are uniform in though.

34
New cards

Heterogeneous Test

Items are not uniform in thought and are varied

35
New cards

Dynamic Construct

Construct is changing rapidly overtime

36
New cards

Static Construct

Construct is consistent and stable

37
New cards

Assessing Test Validity

Test Validation, Face Validity, Content Validity, Criterion-related Validity

38
New cards

Test Validation

Process of gathering evidence about one’s validity

39
New cards

Face Validity

The degree to which a test, survey, or assessment appears to measure what it claims to measure at "face value”

40
New cards

3 Categories of Validity

Content Validity, Criterion-related Validity, Construct Validity

41
New cards

Content Validity

The extent to which a measurement instrument (like a test or survey) thoroughly covers all relevant facets of the theoretical concept or construct it aims to measure

42
New cards

Criterion-related Validity

Evaluates how accurately a test or measurement predicts an outcome by comparing it to an established external benchmark

43
New cards

Types of Criterion-Related Validity

Concurrent Validity and Predictive Validity

44
New cards

Concurrent Validity

How a test correlates to a “gold standard” test at the same point in time; We use an existing and psychometrically sound test and correlate it with our own existing test at the same time. (correlated = high in concurrent validity)

45
New cards

Predictive Validity

Refers to how well does your test predict future behavior; “Can this test predict something that will happen in the future?”

46
New cards

Construct Validity

Refers to how the test truly measures the theoretical construct of framework of the test; ensures a test accurately measures

47
New cards

Types of Construct Validity

Convergent Validity and Discriminant Validity

48
New cards

Convergent Validity

Your construct is positively correlated with another closely related construct; “Does this test agree with other similar measures?”

49
New cards

Discriminant Validity

This correlation indicates that the construct in your test is negatively correlated with another construct. This proves that your construct is distinct

50
New cards

Test Bias

Inherent factors in a test that interferes with the accuracy of the results (in the context of psychometrics)

51
New cards

Types of Biases in Psychological tests

Leniency Error, Severity Error, Central Tendency, Halo Effect, Horn Effect

52
New cards

Leniency Error

Type of error in which the rater has a tendency to be lenient in scoring; “Okay na ‘to”, “medyo tama naman”

53
New cards

Severity Error

Type of error in which the rater has a tendency to scrutinize the individual too much; “Nitpicking”

54
New cards

Central Tendency Error

Type of error in which the rater has a tendency to stay in the neutral or “safe” zone

55
New cards

Halo Effect

Tendency to give a particular ratee a higher score because they appear “nice”, attractive, pleasant

56
New cards

Horn Effect

Tendency to give a particular ratee a lower score because they appear unpleasant, unattractive, etc.