1.2 Reliability Estimates

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/36

There's no tags or description

Looks like no tags are added yet.

Last updated 6:09 AM on 7/22/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai	Chat

No analytics yet

Send a link to your students to track their progress

37 Terms

New cards

Test-Retest Reliability

Reliability obtained by correlating pairs of scores from the sample people on two different administrations

New cards

Coefficient of Stability

In test-retest reliability, the longer the time that passess, the greater the likelihood that the reliability coefficient will be lower

New cards

Pearson Product-Moment Correlation Coefficient (r)

Typically calculated between the scores from the two administrations in test-retest reliability

New cards

Carryover effects

Happens when test-retest interval is short, wherein the second test is influenced by the first test because the content of the first test was remembered

New cards

Practice Effect

Happens in test-retest wherein scores on the second session are higher due to their experience of the first session of testing

New cards

Test Sophistication

In test-retest, items are remembered by the test takers, especially the difficult ones/items that got highlighted as confused

New cards

Test Wiseness

In test-retest, it might inflate the abilities of test takers

New cards

Mortality

Problems in absence in second session oof the test-retest session (just remove the first test of the absents)

New cards

Parallel Forms Reliability

Reliability in which item sampling and other errors have affected scores on versions of the same test

New cards

Parallel Forms

Each form of the test means that the variance of observed test scores is equal

New cards

Alternate Forms Reliability

Reliability that is an estimate of the extent to which these different forms of the same test have been affected by item sampling error, or other errors

New cards

Alternate Forms

Simple, different versions of the test that have been constructed so as to be parallel

New cards

Counterbalancing

A technique to avoid carryover effects for parallel forms, by using different sequence for groups

New cards

Pearson product-moment correlation coefficient (r)

In parallel forms, what is coefficient is calculated using the scores on the two parallel forms?

New cards

Internal Consistency (Inter-Item Consistency)

Degree of correlation among all the items on a scale

New cards

Homogeneity

Single factor test measure

New cards

Heterogeneity

Multiple factor test measure

New cards

Kuder-Richardson Formula 20

Used exclusively for tests where items are dichotomously scored (i.e., items with only two possible responses, such as correct/incorrect, true/false, or yes/no) with different degrees of difficulty

New cards

Kuder-Richardson Formula 21

Used exclusively for tests where items are dichotomously scored (i.e., items with only two possible responses, such as correct/incorrect, true/false, or yes/no) with equal degrees of difficulty

New cards

Cronbach’s Alpha (α)

Arguably the most widely used and reported measure of internal consistency, especially for tests with items that are scored on a continuous scale (e.g., Likert-scale items, multiple-choice items with partial credit)

New cards

McDonald’s Omega (ω)

A measure of internal consistency reliability, similar to Cronbach's Alpha, but often preferred for its ability to handle more complex factor structures

New cards

McDonald’s Omega (ω)

It is a way to assess how well the items on a questionnaire or scale consistently measure a single underlying construct

New cards

Average Proportional Distance

A measure used to evaluate internal consistency of a test that focuses on the degree of difference that exist between item scores

New cards

Split-Half Reliability

Reliability that correlates two pairs of scores obtained from equivalent halves of a single test administered once

New cards

Spearman-Brown Formula

Estimates internal consistency reliability from a correlation of two halves of a test

New cards

Spearman-Brown Prophecy Formula

Estimates how many more items are needed in order to achieve the target reliability

New cards

Steps of the Spearman-Brown Formula

Divide test into equivalent halves 2. Calculate the correlation coefficient (Spearman’s rank correlation or Pearson correlation if the data is interval/ratio) between the scores of the two halves 3. Adjust the half-test reliability using Spearman-Brown formula

New cards

Rulon’s Formula (Flanagan-Rulon formula)

Counterpart of the Spearman-Brown formula, a method for assessing the consistency of a test by comparing the scores on two halves of the test

New cards

Steps of Rulon's Formula

Split the test: Divide the test into two equivalent halves (e.g., odd and even questions) 2. Calculate variances Calculate the variance of the differences between each person's scores on the two halves Calculate the variance of the total scores for each person 3. Apply the formula

New cards

Odd-Even Reliability

Reliability that assigns odd-numbered items to one half of the test and even-numbered items to the other half

New cards

Interrater Reliability

Reliability that involves the degree of agreement or consistency between two or more scorers with regards to particular measure

New cards

Coefficient of Interrater Reliability

Coefficient that is the way of determining the degree of consistency among scorers in the scoring of a test is to calculate a coefficient of correlation

New cards

Kappa Statistics

Used formula for nominal data

New cards

Fleiss & Cohen's

Types of Kappa Statistics

New cards

Fleiss Kappa

Agreement between multiple raters (three or more)

New cards

Cohen’s Kappa

Agreement between two raters

New cards

Kendal’s W

Used for rankings or ordinal data