CA1 - PSYASS 1 (PSYCHOMETRIC PROPERTIES)

0.0(0)

Studied by 2 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/33

Earn XP

Description and Tags

CA1 - Final Term

Psychology

Psychological Assessment

CA1

Course Audit 1

University/Undergrad

Last updated 8:07 AM on 12/2/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

34 Terms

New cards

Reliability

Consistency, accuracy, dependability of the test results.

New cards

Classical Test Score Theory

Assumes that each person has a true score that would be obtained if there were no errors in measurement.

New cards

Classical Test Score Theory

A person’s observed score is made up of:

True score → their actual ability or knowledge
Error score → random influences like guessing or mistakes

Formula: Observed Score = True Score + Error

New cards

Systematic error

Is a consistent, predictable influence on test scores that can usually be identified and corrected.

New cards

Random error

Is an unpredictable fluctuation in the measurement process that is difficult to detect or remove, making it harder to estimate the true score.

New cards

Domain Sampling Method

Considers the problem created by using a limited number of items.

New cards

The more items, the higher the reliability.

What is the mantra on reliability?

New cards

Item Response Theory

Focuses on the range of item difficulty that helps assess an individual’s ability.

New cards

Individual’s ability

Refers to how skilled or knowledgeable an individual is.

New cards

Item difficulty

Refers to how hard a test question is, usually measured by the proportion of people who answered it correctly.

New cards

Item branching

A way of giving test questions that change depending on your previous answer, making the test adaptive.

New cards

Test-Retest Reliability

Refers to the consistency of test results when the same test is given to the same group of people at two different times.

New cards

Parallel Forms Reliability

Compares two equivalent forms of a test that measure the same attributes.

New cards

Internal Consistency

Refers to how well the items (questions) on a test measure the same idea or skill.

New cards

Split‑Half Reliability

The test is split into two halves.
Reliability is estimated by comparing scores from each half.
Spearman-Brown formula is used to adjust reliability for the reduced number of items.
Reliability may be lower because the test was cut in half.

New cards

Kuder-Richardson 20

Used for dichotomous items (questions with only one correct answer, e.g., true/false).
Assumes items vary in difficulty (easy, medium, hard).
All tests naturally have varying item difficulty unless justified otherwise.

New cards

Kuder-Richardson 21

Also for dichotomous items.
Assumes all items have the same level of difficulty (must be justified).
Simpler to compute but less precise.

New cards

Cronbach’s Coefficient Alpha

Used for polytomous items (questions with multiple possible answers, not just right/wrong).
Commonly applied to Likert‑scale items.
Estimates how consistently items measure the same construct when responses can vary in degree.

New cards

Interrater Reliability

Consistency of judges/raters evaluating the same behavior.

New cards

Validity

We measure if the test is measuring what it purports to measure.

New cards

Criterion Validity

How well it corresponds to a particular criterion.

New cards

Criterion Test

A well‑established psychological test that is already known to measure the construct correctly.
Used as a benchmark when developing new tests (e.g., comparing a new intelligence test to an existing one).
If both tests give similar results, it shows they measure the same thing.

New cards

Criterion Data

Any type of information or data that is easily accessible and can serve as a standard for comparison.

New cards

Predictive Validity

Refers to how well a test can forecast future performance or outcomes.
There is a time gap between taking the test and observing the results.
Using entrance exam scores to predict a student’s GPA in their fourth year.

New cards

Concurrent Validity

Refers to how well a test’s results agree with a criterion test or criterion data that measure the same construct at the same time.
Time elapsed is not important.
Shows that the test and the criterion are related and produce similar results.

New cards

Content Validity

Adequacy of representation of the conceptual domain the test is designed to cover.
Experts judge the validity of test items.

New cards

Construct Validity

Refers to how well a test truly measures the abstract concept it claims to measure.
Needed when measuring intangible traits (e.g., intelligence, anxiety, motivation).
Strongly based on theoretical frameworks and psychological models.
Harder to establish because it requires proof that the theory holds through research and evidence

New cards

Convergent Validity

Refers to how well your test is related to an existing theory or construct.
Shows that your test is measuring the same concept as other established measures.
If two tests measure the same construct, their results should be strongly related.

New cards

Divergent Validity

Refers to how well your test is not related to a different construct.
Proves that your test is measuring something unique, not overlapping with unrelated traits.
If two constructs are theoretically different, your test should not correlate with measures of the other.

New cards

Face Validity

Refers to whether a test appears to measure what it is supposed to measure, just by looking at it.
It’s about appearance and impression, not statistical proof.

New cards

Utility