Module 3: Reliability and Validity

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/55

There's no tags or description

Looks like no tags are added yet.

Last updated 1:50 AM on 6/15/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai	Chat

No analytics yet

Send a link to your students to track their progress

56 Terms

New cards

Assumptions in Test Development

Serves as a guide for psychometricians when developing psychological tests

New cards

1st Assumption

Traits and States Exist

New cards

2nd Assumption

Traits and States Can Be Measured

New cards

3rd Assumption

Test-Related Behavior predicts Non-Test Related Behavior

New cards

4th Assumption

Test and Other Measurement Techniques Have Strengths and Weaknesses

New cards

5th Assumption

Various Sources of Error Are Part of the Assessment Process

New cards

6th Assumption

Testing and Assessment Can Be Conducted in a Fair and Unbiased Manner

New cards

7th Assumption

Testing and Assessment Benefit Society

New cards

Trait

Enduring, stable personality characteristic that persists across time and situations (e.g., extroverted)

New cards

State

Temporary emotional or behavioral condition triggered by a specific situation or environment (e.g., emotions)

New cards

Constructs

Psychological Concepts (e.g., shyness, self-esteem)

New cards

Measurement Error

Errors within the test itself; errors that affect test reliability

New cards

Extraneous Variables

Any factor outside of your independent variable that has the potential to influence the results of your study or experiment (e.g., noise)

New cards

What is a good test?

A good test is reliable and valid

New cards

Reliability

Refers to the consistency of the test across different contexts, situations, cultures, and time

New cards

Validity

Refers to the judgment of how well a test measures what it intends to measure (accuracy)

New cards

Types of Measurement Error

Random and Systematic

New cards

Random Errors

External errors; unavoidable, unpredictable fluctuations in measurements or processes that occur by chance that affects the test’s reliability.

New cards

Test Administration Random Error

How a test is administered is a frequent source of random error; Gender of administrator, overall mood, temperature, situation/s

New cards

Systematic Errors

A consistent, predictable shift in measurement or data that deviates from the true value in one specific direction. Also known as bias.

New cards

Sources of Systematic Errors

Test Construction & Test Scoring and Interpretation

New cards

Test Construction

How items are made and its variations with another test item; Ex: phrasing, wording, sentence construction

New cards

Test Scoring and Interpretation

Qualifications of the test administrator, how scores are computed: Manual vs. Computerized?; format of test: Objective vs. Projective?

New cards

Assessing Test Reliability

Test Retest Reliability Estimates
Parallel-Forms and Alternate Forms Reliability Estimates
Split Half Reliability Estimates
Internal Consistency
Inter-scorer Reliability

New cards

Test Retest Reliability Estimates

The test is conducted on a pool of respondents at one point. After some time, they are tasked to answer the test again.Scores between the two tests are evaluated if they produced consistent results. More appropriate with static constructs.

New cards

Internal Consistency

Chronbach’s α (alpha). Most used method in measuring reliability by determining the tests reliability coefficient by using software and statistics.

New cards

Alpha Value 0.90++

Excellent Reliability Level

New cards

Alpha Value 0.80 - 0.89

Good Reliability Level

New cards

Alpha Value 0.70 - 0.79

Acceptable Reliability Level

New cards

Alpha Value 0.60 - 0.69

Questionable Reliability Level

New cards

Alpha Value 0.59 and below

Poor Reliability Level

New cards

Inter-Scorer Reliability

Test is shown to SMEs or subject matter experts. Then, we evaluate the consistency of scores between different experts. Usually, three (3) experts are chosen to prevent a “tie”

New cards

Homogeneous Test

A reliable test is homogeneous. This means that the test items are uniform in though.

New cards

Heterogeneous Test

Items are not uniform in thought and are varied

New cards

Dynamic Construct

Construct is changing rapidly overtime

New cards

Static Construct

Construct is consistent and stable

New cards

Assessing Test Validity

Test Validation, Face Validity, Content Validity, Criterion-related Validity

New cards

Test Validation

Process of gathering evidence about one’s validity

New cards

Face Validity

The degree to which a test, survey, or assessment appears to measure what it claims to measure at "face value”

New cards

3 Categories of Validity

Content Validity, Criterion-related Validity, Construct Validity

New cards

Content Validity

The extent to which a measurement instrument (like a test or survey) thoroughly covers all relevant facets of the theoretical concept or construct it aims to measure

New cards

Criterion-related Validity

Evaluates how accurately a test or measurement predicts an outcome by comparing it to an established external benchmark

New cards

Types of Criterion-Related Validity

Concurrent Validity and Predictive Validity

New cards

Concurrent Validity

How a test correlates to a “gold standard” test at the same point in time; We use an existing and psychometrically sound test and correlate it with our own existing test at the same time. (correlated = high in concurrent validity)

New cards

Predictive Validity

Refers to how well does your test predict future behavior; “Can this test predict something that will happen in the future?”

New cards

Construct Validity

Refers to how the test truly measures the theoretical construct of framework of the test; ensures a test accurately measures

New cards

Types of Construct Validity

Convergent Validity and Discriminant Validity

New cards

Convergent Validity

Your construct is positively correlated with another closely related construct; “Does this test agree with other similar measures?”

New cards

Discriminant Validity

This correlation indicates that the construct in your test is negatively correlated with another construct. This proves that your construct is distinct

New cards

Test Bias

Inherent factors in a test that interferes with the accuracy of the results (in the context of psychometrics)

New cards

Types of Biases in Psychological tests

Leniency Error, Severity Error, Central Tendency, Halo Effect, Horn Effect

New cards

Leniency Error

Type of error in which the rater has a tendency to be lenient in scoring; “Okay na ‘to”, “medyo tama naman”

New cards

Severity Error

Type of error in which the rater has a tendency to scrutinize the individual too much; “Nitpicking”

New cards

Central Tendency Error

Type of error in which the rater has a tendency to stay in the neutral or “safe” zone

New cards

Halo Effect

Tendency to give a particular ratee a higher score because they appear “nice”, attractive, pleasant

New cards

Horn Effect

Tendency to give a particular ratee a lower score because they appear unpleasant, unattractive, etc.