KIN 3313 - Assessment and Evaluation - Reliability and Validity

0.0(0)

Studied by 1 person

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/28

Earn XP

Description and Tags

Flashcards for KIN 3313 - Assessment and Evaluation lecture review.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

29 Terms

New cards

Reliability

The consistency or repeatability of scores resulting from a testing procedure.

New cards

Other terms to describe reliability

Consistency, dependability, stability, and precision.

New cards

Validity

The degree of truthfulness of a test score, measuring what it purports to measure, dependent on reliability and relevance.

New cards

Relevance

The degree to which a test pertains to its objectives.

New cards

Objectivity

A special kind of reliability, consistency between two or more raters (interrater) or within the same rater over time (intrarater).

New cards

Interrater reliability

Consistency between two or more independent judgments of the same performance made by two or more raters

New cards

Intrarater reliability

Consistency in scoring when a rater scores the same test or performance two or more times

New cards

Pearson product-moment (PPM) correlation coefficient

A correlation coefficient used to provide evidence of a test's reliability.

New cards

Observed score

True score + error score

New cards

True Score

Theoretically exists but is impossible to measure; the theoretical average of an infinite number of test takings.

New cards

Error Score

Results from anything that causes the observed score to differ from the true score.

New cards

Theoretical Conclusions of Reliability

Observed score variance = true score variance + error score variance

New cards

Interclass coefficients

Based on PPM correlation coefficient

New cards

Intraclass coefficients

Based on ANOVA

New cards

Interclass Reliability

Test-retest reliability, Equivalence reliability, Split-halves reliability

New cards

Test-Retest Reliability

Administering a single test twice to participants and correlating the scores.

New cards

Equivalence Reliability

Two parallel or equivalent forms of an exam are given to participants.

New cards

Split-Halves Reliability

A single test split into parts is given to participants.

New cards

Intraclass Reliability

Estimates reliability when scores from more than two trials are available and addresses constant differences between means.

New cards

Factors Affecting Reliability

Participant variability, time between testing, circumstances surrounding the testing periods, precision of measurement, environmental conditions, fatigue, practice and appropriate level of difficulty for testing participants.

New cards

Five Evidences of Validity

Evidence based on test content, relations to other variables, internal structure, response processes, and consequences of testing.

New cards

Test content

The themes, wording, and format of the items, tasks, or questions on a test.

New cards

Evidence Based on Relations to Other Variables

Analyzing the relationship between test scores and variables external to the test.

New cards

Convergent evidence

Relationships between test scores and other measures intended to assess similar constructs

New cards

Discriminant evidence

Relationships between test scores and other measures intended to assess different constructs

New cards

Evidence Based on Internal Structure

Degree to which the relationship among test items and test components conform to the construct on which the proposed test score interpretations are based

New cards

Evidence Based on Response Processes

Degree to which the processes of test takers or scorers are consistent with the intended interpretation of scores

New cards

Evidence Based on Consequences of Testing

Focuses on scoring meaning and the intended and unintended consequences of assessment use

New cards

Important considerations in reliability and validity

Reliability and validity results are specific to the group tested, the environment of testing, and the testing procedure and are not typically generalizable