1.2 Reliability Estimates

0.0(0)
Studied by 0 people
call kaiCall Kai
Locked
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/36

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:00 PM on 6/29/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai
Chat

No analytics yet

Send a link to your students to track their progress

37 Terms

1
New cards

Test-Retest Reliability

Reliability obtained by correlating pairs of scores from the sample people on two different administrations

2
New cards

Coefficient of Stability

In test-retest reliability, the longer the time that passess, the greater the likelihood that the reliability coefficient will be lower

3
New cards

Pearson Product-Moment Correlation Coefficient (r)

Typically calculated between the scores from the two administrations in test-retest reliability

4
New cards

Carryover effects

Happens when test-retest interval is short, wherein the second test is influenced by the first test because the content of the first test was remembered

5
New cards

Practice Effect

Happens in test-retest wherein scores on the second session are higher due to their experience of the first session of testing

6
New cards

Test Sophistication

In test-retest, items are remembered by the test takers, especially the difficult ones/items that got highlighted as confused

7
New cards

Test Wiseness

In test-retest, it might inflate the abilities of test takers

8
New cards

Mortality

Problems in absence in second session oof the test-retest session (just remove the first test of the absents)

9
New cards

Parallel Forms Reliability

Reliability in which item sampling and other errors have affected scores on versions of the same test

10
New cards

Parallel Forms

Each form of the test means that the variance of observed test scores is equal

11
New cards

Alternate Forms Reliability

Reliability that is an estimate of the extent to which these different forms of the same test have been affected by item sampling error, or other errors

12
New cards

Alternate Forms

Simple, different versions of the test that have been constructed so as to be parallel

13
New cards

Counterbalancing

A technique to avoid carryover effects for parallel forms, by using different sequence for groups

14
New cards

Pearson product-moment correlation coefficient (r)

In parallel forms, what is coefficient is calculated using the scores on the two parallel forms?

15
New cards

Internal Consistency (Inter-Item Consistency)

Degree of correlation among all the items on a scale

16
New cards

Homogeneity

Single factor test measure

17
New cards

Heterogeneity

Multiple factor test measure

18
New cards

Kuder-Richardson Formula 20

Used exclusively for tests where items are dichotomously scored (i.e., items with only two possible responses, such as correct/incorrect, true/false, or yes/no) with different degrees of difficulty

19
New cards

KR21

Dichotomous items

20
New cards

Cronbach’s Alpha (α)

Arguably the most widely used and reported measure of internal consistency, especially for tests with items that are scored on a continuous scale (e.g., Likert-scale items, multiple-choice items with partial credit) ●

21
New cards

McDonald’s Omega (ω)

A measure of internal consistency reliability, similar to Cronbach's Alpha, but often preferred for its ability to handle more complex factor structures

22
New cards

McDonald’s Omega (ω)

It is a way to assess how well the items on a questionnaire or scale consistently measure a single underlying construct

23
New cards

Average Proportional Distance

A measure used to evaluate internal consistency of a test that focuses on the degree of difference that exist between item scores

24
New cards

Split-Half Reliability

Reliability that correlates two pairs of scores obtained from equivalent halves of a single test administered once

25
New cards

Spearman-Brown Formula

Estimates internal consistency reliability from a correlation of two halves of a test

26
New cards

Spearman-Brown Prophecy Formula

Estimates how many more items are needed in order to achieve the target reliability

27
New cards

Steps of the Spearman-Brown Formula

  1. Divide test into equivalent halves 2. Calculate the correlation coefficient (Spearman’s rank correlation or Pearson correlation if the data is interval/ratio) between the scores of the two halves 3. Adjust the half-test reliability using Spearman-Brown formula
28
New cards

Rulon’s Formula (Flanagan-Rulon formula)

Counterpart of the Spearman-Brown formula, a method for assessing the consistency of a test by comparing the scores on two halves of the test

29
New cards

Steps of Rulon's Formula

  1. Split the test: Divide the test into two equivalent halves (e.g., odd and even questions) 2. Calculate variances Calculate the variance of the differences between each person's scores on the two halves Calculate the variance of the total scores for each person 3. Apply the formula
30
New cards

Odd-Even Reliability

Reliability that assigns odd-numbered items to one half of the test and even-numbered items to the other half

31
New cards

Interrater Reliability

Reliability that involves the degree of agreement or consistency between two or more scorers with regards to particular measure

32
New cards

Coefficient of Interrater Reliability

Coefficient that is the way of determining the degree of consistency among scorers in the scoring of a test is to calculate a coefficient of correlation

33
New cards

Kappa Statistics

Used formula for nominal data

34
New cards

Fleiss & Cohen's

Types of Kappa Statistics

35
New cards

Fleiss Kappa

Agreement between multiple raters (three or more)

36
New cards

Cohen’s Kappa

Agreement between two raters

37
New cards

Kendal’s W

Used for rankings or ordinal data