Lecture Notes on Reliability and Validity

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/37

Earn XP

Description and Tags

Flashcards on Reliability and Validity

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

38 Terms

New cards

Reliability

The consistency in measurement.

New cards

Reliability coefficient

A statistic that quantifies reliability (0-1), where 0 is not reliable at all and 1 is perfectly reliable.

New cards

Measurement error

The inherent uncertainty associated with any measurement, even after care has been taken to minimize preventable mistakes.

New cards

Random error

Unpredictable fluctuations and inconsistencies of other variables in the measurement process.

New cards

Systematic error

Typically proportionate to what is presumed to be the true value of the variable being measured.

New cards

True score

The measurement of quantity if there were no measurement error at all.

New cards

Construct score

Persons standing on a theoretical variable that is independent of any particular measurement.

New cards

Observed score

Actual measurement obtained when assessing a person.

New cards

True score

Long-term average of measurement scores.

New cards

Test construction

Variation may exist within items in a test or between tests (item sampling or content sampling).

New cards

Test administration

Test taker variables include: pressing emotional problems, physical discomfort, lack of sleep, effects of drugs or medication. Examiner-related variables include: physical appearance and demeanor may play a role.

New cards

Test scoring and interpretation

Computer testing reduces error in test scoring, but many tests still require expert interpretation. Example: tests of personality, tests of creativity, various behavioral measures, etc.

New cards

Sampling error

The extent to which the population of voters in the study actually was representative of voters in the election.

New cards

Methodological error

Result of interviewers not been trained properly, ambiguous wording in the questionnaire, or the presence of biased in the questionnaire.

New cards

Test-retest reliability

An estimate of reliability obtained by correlating pairs of scores from the same people on two different administrations of the same test. Only useful when assessing traits that do not change overtime.

New cards

COEFFICIENT OF STABILITY

With intervals greater than 6 months, the estimate of test-retest reliability.

New cards

Parallel forms reliability

Degree of the relationship between various forms of a test can be evaluated by means of a parallel-forms coefficient of reliability.

New cards

Parallel-forms coefficient of reliability

Is often termed COEFFICIENTS OF EQUIVALENCE.

New cards

Parallel forms reliability

Comparing scores on two different measures of the same quality.

New cards

Split-half reliability

An estimate of internal consistency of a test obtained by correlating two pairs of scores obtained from equivalent halves of a singular test administered once.

New cards

Inter-item consistency

To the degree of correlation among all the items on a scale.

New cards

Inter-rater reliability

Degree of agreement or consistency between two or more scorers (or rates) with regard to a particular measure.

New cards

Coefficient of inter-scorer reliability

Degree of consistency among scorers in a scoring of a test.

New cards

Kappa statistic

Statistical measure of inter-rater reliability. Used for more than 2 raters. Used in nominal (categorical) data.

New cards

Face validity

The extent to which a test appears to measure to the person being tested, as compared to what the test actually measures.

New cards

Content validity

Judgement of how adequately a test samples behavior representative of the universe of behavior that the test was designed to sample.

New cards

Test blueprint

A plan regarding the types of information to be covered by the items, the number of items tapping each component, and the organisation of the test.

New cards

Criterion

The standard against which a test or a test score is evaluated. Example: psychiatric diagnosis, index of alcohol intoxication.

New cards

Criterion validity

Judgement of how adequately a test score can be used to infer an individuals most probable standing on a criterion.

New cards

Concurrent validity

An index of the degree to which a test score is related to some criterion measure obtained at the same time (concurrently).

New cards

Predictive validity

The extent to which a given test can anticipate future occurrences.

New cards

Validity coefficient

A correlation coefficient that provides a measure of the relationship between test scores and the scores on teh criterion measure. It allows one to tell the extent to which the test is valid for making statements about the criterion.

New cards

Construct validity

Judgement about the appropriateness of inferences drawn from test scores regarding individual standings on a construct.

New cards

Construct

An informed, scientific idea developed or hypothesized to describe or explain behavior.

New cards

Evidence of HOMOGENEITY

How uniform a test is in measuring a single concept.

New cards

Evidence from DISTINCT GROUPS

Scores on a test vary in a predictable way as a function of membership in some group.

New cards

Convergent validity

Scores on the test undergoing construct validation tend to correlate highly in the predicted direction with scores on older, more established tests designed to measure the same (or similar) construct.

New cards

Discriminant evidence

Validity coefficient showing little relationship between test scores and/or other variables with which scores on the test should not theoretically be correlated.