Psychological Testing & Assessment – Core Vocabulary

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/127

Earn XP

Description and Tags

Vocabulary flashcards summarizing essential psychometric, statistical, and ethical terms drawn from the lecture notes to aid RPm exam preparation.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

128 Terms

New cards

Psychological Testing

Process of measuring psychology-related variables with standardized devices or procedures to obtain numerical samples of behavior.

New cards

Psychological Assessment

Gathering and integrating psychology-related data to answer a referral question through multiple evaluation tools.

New cards

Ability (Maximal Performance) Test

Assessment designed to determine what a person can do under optimal conditions.

New cards

Achievement Test

Exam that measures previous learning or mastery of specific subject matter.

New cards

Aptitude Test

Instrument that estimates the potential for learning or acquiring specific skills; emphasizes predictive validity.

New cards

Intelligence Test

Measure of general potential for problem solving, adaptation, abstract thought, and learning from experience.

New cards

Typical Performance Test

Assessment of usual or habitual thoughts, feelings, and behaviors with no right or wrong answers.

New cards

Personality Test

Tool that measures enduring dispositions and preferences, either ideographically or nomothetically.

New cards

Structured Personality Test

Self-report inventory providing fixed responses (e.g., true/false) to statements about typical behavior.

New cards

Projective Personality Test

Unstructured measure using ambiguous stimuli to elicit projection of inner dynamics.

New cards

Speed Test

Assessment in which performance is gauged by the number of correct responses within a strict time limit.

New cards

Power Test

Exam emphasizing item difficulty; ample time is provided so speed plays little role.

New cards

Cut Score

Reference point, set by judgment, that divides test scores into categories (e.g., pass/fail).

New cards

Psychometric Soundness

Overall technical quality of a measurement instrument, encompassing reliability and validity.

New cards

Reliability

Consistency or dependability of test scores across administrations, items, or raters.

New cards

Validity

Degree to which evidence and theory support the interpretations of test scores for intended uses.

New cards

Trait

Relatively enduring characteristic that differentiates one individual from another.

New cards

State

Temporary condition reflecting how someone feels or behaves at a specific moment.

New cards

Construct

Scientific concept inferred from behavior and used to explain psychological phenomena.

New cards

Classical Test Theory (CTT)

Model asserting that an observed score equals a true score plus random measurement error.

New cards

Error Variance

Portion of score variance attributable to factors other than the trait being measured.

New cards

Standard Error of Measurement (SEM)

Index reflecting the expected error in an individual observed score; inversely related to reliability.

New cards

Test–Retest Reliability

Correlation of scores from the same test administered to the same group on two occasions.

New cards

Parallel-Forms Reliability

Correlation between scores on two equivalent versions of a test given to the same group.

New cards

Split-Half Reliability

Internal consistency estimate obtained by correlating two halves of a single test administration.

New cards

Cronbach’s Alpha

Coefficient estimating internal consistency for scales with items scored in multiple categories.

New cards

KR-20

Reliability formula for dichotomously scored items with unequal item variances.

New cards

Inter-Scorer Reliability

Degree of agreement among different raters scoring the same performance.

New cards

Face Validity

Extent to which a test appears, on the surface, to measure the intended construct.

New cards

Content Validity

Judgment of how well test items represent the domain of interest.

New cards

Construct Validity

Evidence that test scores relate to other measures as predicted by theory of the underlying construct.

New cards

Criterion Validity

How well test scores predict or correlate with an external criterion (concurrent or predictive).

New cards

Convergent Evidence

High correlations between the test and other measures of the same construct.

New cards

Discriminant Evidence

Low correlations between the test and measures of unrelated constructs.

New cards

Incremental Validity

Amount of additional predictive value a new test provides beyond existing measures.

New cards

Multitrait-Multimethod Matrix

Table of correlations used to examine convergent and discriminant validity across traits and methods.

New cards

Factor Analysis

Statistical technique identifying latent variables that account for interrelationships among test items.

New cards

Standard Score

Converted score expressed relative to a distribution’s mean and standard deviation (e.g., z, T, stanine).

New cards

Z-Score

Standard score with mean 0 and SD 1 indicating how many SDs a raw score lies from the mean.

New cards

T-Score

Standard score with mean 50 and SD 10 eliminating negative values.

New cards

Stanine

Nine-point standard score scale with mean 5 and SD 2.

New cards

Norm-Referenced Test

Assessment interpreted by comparing an individual’s score to scores of a norm group.

New cards

Criterion-Referenced Test

Test interpreted by comparing performance to a defined level of proficiency, not to other people.

New cards

Base Rate

Proportion of the population that actually possesses the characteristic being predicted.

New cards

Sensitivity (True Positive Rate)

Ability of a test to correctly identify individuals who have the condition of interest.

New cards

Specificity (True Negative Rate)

Ability of a test to correctly identify individuals who do not have the condition of interest.

New cards

False Positive

Case where test predicts success or presence but outcome is failure or absence.

New cards

False Negative

Case where test predicts failure or absence but outcome is success or presence.

New cards

Utility

Practical value of a test in improving decision-making efficiency, often evaluated via cost–benefit analysis.

New cards

Taylor-Russell Tables

Tables estimating improvement in selection accuracy when a test is added to a decision process.

New cards

Item Difficulty Index (p)

Proportion of examinees who answer an item correctly; higher values indicate easier items.

New cards

Item Discrimination Index (D)

Statistic reflecting how well an item differentiates high scorers from low scorers on the total test.

New cards

Item Characteristic Curve (ICC)

Graph in IRT showing probability of a correct response across levels of ability.

New cards

Guessing Parameter

IRT value estimating likelihood of low-ability examinees answering an item correctly by chance.

New cards

Computerized Adaptive Testing (CAT)

Interactive testing method in which subsequent items are selected based on a test taker’s previous responses.

New cards

Pilot Study

Preliminary research conducted to refine items and administration procedures before full test development.

New cards

Item Pool

Large reservoir of potential items created during test construction.

New cards

Test Blueprint (Table of Specifications)

Plan outlining content areas and cognitive levels to ensure representative item sampling.

New cards

Ipsative Scoring

Comparing an individual’s scores on different scales within the same test to each other.

New cards

Cumulative Scoring Model

Approach where higher total of keyed responses indicates higher standing on the trait.

New cards

Empirical Criterion Keying

Selecting items based on their ability to discriminate between groups defined by an external criterion.

New cards

Differential Item Functioning (DIF)

Condition where individuals from different groups but equal ability have differing probabilities of endorsing an item.

New cards

Flynn Effect

Observed generational rise in average intelligence test scores over time.

New cards

Culture Fair Test

Instrument designed to minimize cultural content and reduce bias across groups.

New cards

Culture Loading

Extent to which a test is saturated with language, knowledge, or values specific to a culture.

New cards

Back-Translation

Process of translating a test into another language and then back to the original to check equivalence.

New cards

Standard Error of Estimate

Standard deviation of residuals in regression, indicating accuracy of predicted criterion scores.

New cards

Hit Rate

Proportion of correct decisions (true positives + true negatives) made using the test.

New cards

Groupthink

Decision-making bias where desire for consensus overrides appraisal of alternative courses.

New cards

Halo Effect

Rating error in which a rater’s overall impression influences specific ratings.

New cards

Central Tendency Error

Rater bias of clustering evaluations around the middle category of a scale.

New cards

Leniency Error

Systematic rater tendency to give overly favorable evaluations.

New cards

Severity Error

Systematic rater tendency to give overly harsh evaluations.

New cards

Barnum Effect

People’s tendency to accept vague, general personality descriptions as uniquely applicable to themselves.

New cards

Reactance (Hawthorne Effect)

Change in behavior that occurs because individuals know they are being observed or evaluated.

New cards

True Score

Hypothetical average score a person would obtain over infinite administrations free from error.

New cards

Standard Error of the Difference

Index used to judge whether the difference between two test scores is statistically significant.

New cards

Confidence Interval

Range of scores around an observed score likely to contain the true score at a given probability level.

New cards

Restriction of Range

Condition where limited variability in scores lowers observed correlations.

New cards

Skewness

Degree of asymmetry in a distribution; positive skew has a long right tail, negative skew a long left tail.

New cards

Kurtosis

Measure of peakedness or flatness of a distribution relative to normal.

New cards

Mean

Arithmetic average; point of least squares in a distribution.

New cards

Median

Middle score that divides a distribution into two equal halves; resistant to outliers.

New cards

Mode

Most frequently occurring score in a distribution.

New cards

Variance

Average squared deviation of scores from the mean; basis for standard deviation.

New cards

Interquartile Range

Difference between the first and third quartiles; middle 50% spread.

New cards

Pearson r

Correlation coefficient for linear relationship between two continuous variables.

New cards

Spearman Rho

Rank-order correlation coefficient for ordinal data or non-linear monotonic relationships.

New cards

Point-Biserial Correlation

Correlation between a continuous variable and a true dichotomy.

New cards

Phi Coefficient

Correlation between two true dichotomous variables.

New cards

Independent t-Test

Statistical test comparing means of two independent groups.

New cards

Paired t-Test

Test comparing means of two related scores from the same group.

New cards

One-Way ANOVA

Analysis testing mean differences among three or more groups on one independent variable.

New cards

ANCOVA

ANOVA that statistically controls for effects of a covariate continuous variable.

New cards

Chi-Square Test of Independence

Non-parametric test assessing association between two categorical variables.

New cards

Kolmogorov-Smirnov Test

Test for normality used with sample sizes greater than 50.

New cards

Levene’s Test

Procedure for testing equality of variances across groups.

New cards

Item Response Theory (IRT)

Measurement model relating probability of a response to latent trait level and item parameters.

New cards

Latent Trait

Unobserved characteristic quantified in IRT models.

100

New cards

Discrimination Parameter

IRT indicator of how well an item differentiates among examinees with different ability levels.