1.4 Validity

0.0(0)
Studied by 0 people
call kaiCall Kai
Locked
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/86

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:28 PM on 7/1/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai
Chat

No analytics yet

Send a link to your students to track their progress

87 Terms

1
New cards

Validity

A test measures what it's supposed to measure in a particular context

2
New cards

Validity Coefficient

Correlation coefficient that provides a measure of the relationship between test scores and scores on the criterion measure

3
New cards

Inference

A logical result or deduction

4
New cards

Validation

Process of gathering and evaluating evidence about validity

5
New cards

Local Validation Studies

Studies that are absolutely necessary when the test user plans to alter in some ways the format, instructions, language, or content of the test

6
New cards

Ecological Validity

Refers to a judgment regarding how well a test measures what it purports to measure at the time and place that the variable being measured is actually emitted

7
New cards

Internal, External, Conceptual, & Face

Types of Validity

8
New cards

Internal Validity

Degree of control among variables in the study (increased through random assignment)

9
New cards

External Validity

Generalizability of the research results (increased through random selection)

10
New cards

Conceptual Validity

How well a test measures what it's designed to evaluate

11
New cards

Face Validity

A test appears to measure what it actually measures at face-value

12
New cards

Content, Criterion, & Construct

Trinitarian View of Validity

13
New cards

Content Validity

An evaluation of the subjects, topics, or content covered by the item in the test

14
New cards

Criterion-Related Validity

Evaluating the relationship of scores obtained on the test to scores on other tests or measures

15
New cards

Construct Validity

How scores on the test can be understood within some theoretical framework for understanding the construct that the test was designed to measure

16
New cards

Content Validity

A fundamental psychometric property that addresses the extent to which a measurement instrument (e.g., a test, questionnaire, or observational tool) adequately and comprehensively represents all relevant facets or domains of the construct it is designed to measure

“Does the test content truly reflect the content domain it’s supposed to cover?”

17
New cards

Construct

Refers to a theoretical concept, trait, or attribute that cannot be directly observed or measured (e.g., intelligence, depression, anxiety, job satisfaction, mathematical ability)

18
New cards

Construct Underrepresentation

Failure to capture important components of a construct

19
New cards

Construct-Irrelevant Variance

Happens when scores are influenced by factors irrelevant to the construct

20
New cards

Relevance & Representativeness

Core Principles of Content Validity

21
New cards

Relevance

Core principle of content validity that is directly pertinent to the construct being measured

22
New cards

Representativeness

Core principle of content validity that adequately samples all important sub-domains or facets of the construct

23
New cards

Content Validity

What type of validity is a necessary precursor for other forms of validity?

24
New cards

Test Blueprint

A detailed plan of the content, organization, and quantity of the items that a test will contain

25
New cards

Subject Matter Expert (SME) Review

It is when a panel of qualified subject matter experts (SMEs) independently reviews each item of the instrument against the defined content domain

26
New cards

Lawshe's Content Validity Ratio (CVR) & Content Validity Index (CVI)

Quantitative Indices for Content Validity

27
New cards

Lawshe’s Content Validity Ratio (CVR)

For each item, SMEs typically rate its essentiality (e.g., “essential,” “useful but not essential,” “not necessary”)

28
New cards

Positive CVR

In CVR , this result means more than half of the SMEs consider the item essential

29
New cards

Zero CVR

In CVR, this result means exactly half of the experts rate the item as essential

30
New cards

Content Validity Index (CVI)

Can be calculated at the item level (I-CVI) or scale level (S-CVI)

31
New cards

I-CVI

The proportion of SMEs who rate an item as highly relevant (e.g., 3 or 4 on a 4-point relevance scale)

32
New cards

S-CVI

The average of the I-CVIs for all items on the scale (S-CVI/Ave) or the proportion of items that achieve universal agreement (e.g., all experts rate it as 3 or 4) (S-CVI/UA)

33
New cards

Pilot Testing and Cognitive Interviewing

The instrument can be pilot tested with a sample from the target population

34
New cards

Criterion-Related Validity

Demonstrates how well a test or measurement instrument predicts or correlates with a relevant external criterion

35
New cards

Criterion

A separate, independent, and often objective measure that serves as a standard against which the test scores are compared

36
New cards

Concurrent & Predictive

Types of Criterion-Related Validity

37
New cards

Concurrent Validity

Established when the test scores and the criterion measures are collected at approximately the same time

38
New cards

Concurrent Validity

It assesses how well the test can estimate an individual’s current standing on the criterion and is often used when a new test is developed to replace an existing, well-validated but perhaps more cumbersome, expensive, or time-consuming measure

39
New cards

Predictive Validity

Established when the test scores are collected at one point in time, and the criterion measures are collected at a later point in time

40
New cards

Predictive Validity

It assesses how well the test can predict future performance, behavior, or outcomes

41
New cards

Incremental Validity

The degree to which an additional predictor explains something about the criterion measure that is not explained by the predictors already in use

42
New cards

Construct Validity

A central concept in psychometrics and refers to the degree to which a test or measurement instrument adequately measures the theoretical construct it purports to measure

43
New cards
Construct
A theoretical, abstract concept or attribute that is not directly observable but is inferred from observable behaviors or indicators
44
New cards
Clear conceptualization, Operationalization, & Empirical Evidence
Core Principles of Construct Validity
45
New cards
Clear Conceptualization
A precise and well-defined theoretical understanding of the construct, including its dimensions, relationships to other constructs, and expected manifestations
46
New cards
Operationalization
Translating the abstract construct into concrete, measurable indicators or test items
47
New cards
Empirical Evidence
Gathering observable data that supports or refutes the hypothesized relationships between the test scores and other variables, consistent with the theoretical framework of the construct
48
New cards

Evidence of Homogeneity, Evidence of Changes with Age, Evidence of Pretest-Posttest Changes, Known-Groups Validity (Group Differences), Convergent Validity, & Discriminant Validity

Evidences of Construct Validity

49
New cards
Evidence of Homogeneity
Broader concept in construct validity that refers to the entire network of theoretical relationships between a construct and other constructs, as well as observable behaviors
50
New cards
Evidence of changes with age
Evidence observed if the construct is expected to change over time (e.g., cognitive ability in children, emotional maturity), the test scores should reflect these expected developmental patterns
51
New cards
Evidence of pretest-posttest changes
Evidence if the construct is expected to change in response to a specific intervention (e.g., a therapy for depression, a training)
52
New cards
depression, a training)
53
New cards
Known-Groups Validity (Group Differences)
Can differentiate between groups of individuals who are known, based on external criteria, to differ on the construct being measured
54
New cards
Convergent Validity
Scores on the test undergoing constructed validation tend to correlate highly in the predicted direction with scores on older, more established, and already validated test design to measure the same construct
55
New cards
Discriminant Validity
Validity showing a little relationship between test scores and/or other variables with which scores on the test being construct-validated should theoretically be correlated
56
New cards
Multi-trait Multi-method Matrix (MTMM)
It is a method for assessing the construct validity of a set of measures in a study
57
New cards
Monomethod-monotrait Correlations (Reliability Diagonal), Heteromethod-monotrait Correlations (Validity Diagonal), Monomethod-heterotrait Correlations (Heterotrait-Monomethod Triangles, & Heteromethod-heterotrait Correlations (Heterotrait-Heteromethod Triangles)
Analysis of 4 Types of Correlation Coefficients
58
New cards
Monomethod-monotrait Correlations (Reliability Diagonal)
Correlation coefficient in which the correlations of a trait with itself, measured by the same method, represent the reliability of each measure and should be the highest in the entire matrix
59
New cards
Heteromethod-monotrait Correlations (Validity Diagonal)
Correlation Coefficient that represents the same trait measured by different methods, significant correlations here provide evidence of convergent validity, meaning that different measures of the same trait converge or agree
60
New cards
Monomethod-heterotrait Correlations (Heterotrait-Monomethod Triangles)
Correlation coefficient that is between different traits measured by same methods—should be low, providing evidence of discriminant validity, meaning the measures can discriminate between different constructs
61
New cards
Heteromethod-heterotrait Correlations (Heterotrait-Heteromethod Triangles)
Correlation coefficient that is between different traits measured by different methods—should be the lowest in the matrix, further supporting discriminant validity
62
New cards
Factor Analysis
A class of mathematics procedures designed to identify factors or specific variables that are typically attributes, characteristics, or dimensions on which people may differ
63
New cards
Data Reduction
The most common purpose of factor analysis is to simplify complex data by reducing the number of variables, making dataset easier to work and interpret
64
New cards
Structure Discovery
Factor Analysis helps uncover the underlying structure or dimensions within a set of observed variables
65
New cards
Construct Validation
Factor Analysis is used to develop and refine measurement scales (like personality tests) to ensure that the items truly measure the intended underlying construct (e.g., intelligence, anxiety)
66
New cards
Latent Variables
Provides an estimate of how strongly each observed variable relates to each factor, known as factor loading
67
New cards
Factor Loading
Conveys information about the extent to which the factor determines the test score
68
New cards
Exploratory & Confirmatory
Types of Factor Analysis
69
New cards
Exploratory Factor Analysis
Type of Factor Analysis used to explore and discover the underlying factor structure when the researcher has no preconceived idea about how many factors exist or which variables belong to which factor. “theory-generating”
70
New cards
Confirmatory Factor Analysis
Type of Factor Analysis used to confirm a hypothesized factor structure. “theory-testing”
71
New cards
Principal Component Analysis (PCA)
Method used to reduce data dimensionality and summarize variance with fewer variables
72
New cards
Dimensionality Reduction
The primary goal of PCA that is reducing the number of variables (features) in the data set
73
New cards
Variance Maximization
The principal components of PCA are chosen to capture the maximum possible variance in the data
74
New cards
Orthogonality. The principal components of PCA are orthogonal (perpendicular) to each other, meaning they are uncorrelated
75
New cards
Scree Plot
In PCA, it is the plot of the eigenvalues vs. the number of the principal component. It helps in deciding how many components to retain by looking for an “elbow” or point where the explained variance drops sharply (point of inflection)
76
New cards
Explained Variance Ratio
In PCA, it is the proportion of the dataset’s variance that lies along each principal component
77
New cards
Kaiser Criterion (K1 Rule) & Elbow Method (Scree Test)
Two Common Traditional Heuristic Methods in EFA and PCA
78
New cards
Kaiser Criterion (K1 Rule)
Criterion used to retain all factors or components that have an eigenvalue greater than 1.0
79
New cards
Limitation of the Kaiser Criterion
Tends to overestimate the number of factors, especially when the number of variables is large. It is generally not considered the most accurate method
80
New cards
Elbow Method (Scree Test)
Criterion used to plot the eigenvalues on the Y-axis against the factor number on the X-axis. Retain all factors that appear before the “elbow” (the point where the slope of the curve sharply levels off)
81
New cards
Limitation of the Elbow Method
Highly subjective, as the “elbow” point is often ambiguous or hard to define, leading different researchers to select different numbers of factors from the same plot
82
New cards
Cross-Validation
Revalidation of the test to a criterion based on another group different from the original group form which the test was validated
83
New cards
Validity Shrinkage, Co-Validation, & Co-Norming
Concepts related to Cross-Validation
84
New cards
Cross-Validation
Revalidation of the test to a criterion based on another group different from the original group form which the test was validated
85
New cards
Validity Shrinkage
Decrease in validity after cross-validation
86
New cards
Co-Validation
Validation of more than one test from the same group in cross-validation
87
New cards
Co-Norming
In cross-validation, it is norming more than one test from the same group