Week 3 (Validity)

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/71

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

72 Terms

New cards

Test Validity

The extent to which the test measure what it claims to measure

most fundamental consideration when evaluating psychological tests

New cards

A test may yield reliable scores _____

yet might not be a valid indicator of what it claims to measure

New cards

Validity is context-dependent

Test may be valid in one situation or population but not in other.

New cards

Validity is a unitary concept

Supported by multiple lines of evidence

New cards

Face Validity

Superficial appearance that a test measures what it claims to. Lacks evidence & is not a true form of validity.

New cards

Face Validity is still useful for _________

test taker motivation & acceptance when items appear relevant.

New cards

Content-Related Validity

Assesses how well test items represent the full scope of the construct or subject matter.

New cards

Tests have high content validity when ________

test items provide representative samples of all possible items in the relevant domain.

New cards

Primary applications of content-related validity

educational achievement tests, employment tests & medical testing

New cards

Content-related validity is judged ________

logically, not statistically, often w/ expert evaluation

Involves checking item wording, relevance & reading level

New cards

Construct underrepresentation

missing important content (geometry not in math test)

Threat to construct-related validity

New cards

Construct irrelevant variance

scores influenced by unrelated factors (test anxiety…)

Threat to construct-related validity

New cards

What is the primary evidence of validity in achievement testing?

Content validity

New cards

Scenario: How can a psychometric theory test demonstrate content validity?

Items are based on relevant textbook chapters

Material is adequately sampled

New cards

What is the first step in establishing content validity?

Clearly defining the content to be covered.

New cards

What tool is often used to establish content validity?

A table of specifications

New cards

What framework is often used in a table of specifications for achievement tests?

Bloom’s taxonomy for cognitive domains

New cards

What must be demonstrated when a test is used for hiring or promotion?

That test items are work-related

New cards

How is the content for employment tests defined?

Through job analysis by a panel of experts specifying required knowledge and skills.

New cards

What is commonly used to match test content with job specifications?

A percentage agreement figure.

New cards

What method is used to assess item relevance in employment tests?

Essential, Useful but not essential, Not necessary

New cards

How is the Content Validity Ratio (CVR) calculated?

Based on the number of panelists rating an item as “essential” (ne) and the total number of panelists (N).

New cards

What does a higher CVR indicate?

Greater consensus that an item is essential

New cards

What is the range of the Content Validity Ratio (CVR)?

From 0 (50% say essential) to 1.00 (100% say essential)

New cards

Criterion-Related Validity

Assesses how well test scores correlate with a specific external criterion.

test serves as a proxy for the actual behaviour or outcome we aim to predict.

New cards

Predictive Validity

Test scores predict future performance on a relevant criterion.

New cards

Predictive Validity may be more time consuming but ________

better reflects real-world applications.

New cards

Concurrent Validity

test scores are related to some criterion measure obtained at the same point in time (i.e., concurrently).

New cards

Concurrent validity is a special case of

predictive validity with a minimal time gap.

New cards

Types of statistical evidence for criterion-related validity?

Validity coefficient and decision theory/expectancy data.

New cards

What is a validity coefficient?

A correlation showing how well a test predicts or relates to a criterion.

New cards

Which correlation coefficients are typically used to express validity coefficients?

Pearson’s r or Spearman’s rho (for ordinal data).

New cards

What does a larger correlation between test scores and criterion scores indicate?

Greater criterion-related validity.

New cards

What is the common range for an adequate validity coefficient?

.30 to .40

New cards

How common are validity coefficients above .60?

They are rare.

New cards

Why is the Standard Error of Estimate (SEest) reported with the validity coefficient?

Because test scores are imperfect predictors of criterion scores

New cards

What does the Standard Error of Estimate (SEest) represent?

The margin of error in predicted criterion scores due to imperfect validity

New cards

What is the formula for SEest?

SEest = SDy × √(1 - rxy²)

New cards

In the SEest formula, what does SDy represent?

The standard deviation of criterion scores

New cards

In the SEest formula, what does rxy represent?

The validity coefficient

New cards

What does SEest reflect in regression analysis?

Error of prediction from the regression line

New cards

What does the Standard Error of Measurement (SEM) reflect?

The margin of error in an individual’s test score due to test unreliability

New cards

What is the formula for SEM?

SEM = SD × √(1 - rxx)

New cards

In the SEM formula, what does SD represent?

The standard deviation of test scores

New cards

In the SEM formula, what does rxx represent?

The reliability coefficient

New cards

Cronbach & Meehl (1955)

expanded concept of validity to include both practical & theoretical dimensions.

New cards

What is a key requirement for construct validity?

The test should behave as predicted by the theory behind the construct.

New cards

What does it indicate if a test performs as theory predicts?

It strengthens confidence in both the test and the underlying theory.

New cards

What does it suggest if a test does not behave as the theory predicts?

The problem may lie with the theory or the test’s validity.

New cards

Why is construct validity important for many psychological traits?

Because they lack objective criteria, making criterion validity impractical.

New cards

How is construct validity established when no adequate criterion exists?

By defining the construct and developing/testing appropriate measures.

New cards

How is construct validity typically built?

Through multiple studies showing consistent relationships with other measures.

New cards

What does construct validity refer to?

Any evidence showing that a test measures its intended construct.

encompasses all types of validity evidence, including content and criterion validity.

New cards

What is convergent evidence in construct validity?

When a test correlates well with other measures of the same construct.

New cards

What should valid tests show in terms of theory?

Expected theoretical relationships.

New cards

How is convergent evidence established?

Through multiple studies building a network of meaning around test scores.

New cards

What additional evidence is required to fully support construct validity?

Discriminant evidence.

New cards

What does discriminant evidence demonstrate?

That the test does not measure unrelated constructs, confirming its uniqueness.

New cards

What kind of correlations should a test have with unrelated constructs to show discriminant evidence?

Low correlations.

New cards

Why is discriminant evidence important?

It ensures the test measures something distinct and not redundant with other tests.

New cards

What does evaluating item or subtest homogeneity check for?

Whether the test measures a single construct.

New cards

How can developmental changes support construct validity?

If score changes across development align with theoretical expectations.

New cards

What does correlating test scores with related and unrelated measures assess?

Convergent and discriminant validity.

New cards

How can group differences support construct validity?

If score differences across groups match theoretical predictions.

New cards

What is the purpose of factor analysis in construct validity?

To examine the internal structure of the test.

New cards

Why analyze classification accuracy of test scores?

To see if scores allow proper identification or categorization of examinees.

New cards

How do intervention effects relate to construct validity?

If interventions produce expected score changes, this supports the test’s validity.

New cards

What limits the maximum possible validity of a test?

The square root of the product of the reliabilities of the two measures.

New cards

How strong can a test's correlation with another variable be?

No stronger than its correlation with itself (its reliability).

New cards

Who shares responsibility for test validation?

Both the test developer and the test user.

New cards

What is the test developer's responsibility in validation?

To provide evidence and rationale for the test's intended use.

New cards

What is the test user's responsibility in validation?

Evaluating the evidence in the particular setting in which the test is to be used