PSYCHOMET FINALS: Understanding Test Utility + Test Scoring

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/49

Earn XP

Description and Tags

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

50 Terms

New cards

Cost efficiency, Time factor, How a valid and reliable test compares to another valid and reliable test, How useful is a test for diagnoses/treatment/classifying patients, How an admissions test can “trim down” # of applicants, Will it help that we add another test to our battery, Having a test VS not having a test

7 factors that must be considered in making decisions about using certain tests

New cards

Culture-Fair Test

Scores are unaffected by cultural differences.

New cards

Culture-Bound Test

Test that is specific to another language or culture.

New cards

Test Utility

The usefulness or practical value of testing to improve efficiency.
A test being both reliable and valid does not necessarily mean or ensure that it is useful.

New cards

MMPI

Primarily designed to assess clinical conditions (e.g., depression, schizophrenia, paranoia, psychopathic deviate, hypomania, etc.).
- Not effective for evaluating non-clinical or normal populations.
- Specifically useful for clinical diagnosis rather than general personality assessment.

New cards

567 items

How many items does the MMPI test have?

New cards

NEO-PI

Measures five major personality domains, often referred to as OCEAN.
- In certain contexts, this test can be more useful than the MMPI, especially when assessing normal populations or when the focus is on general personality traits rather than clinical conditions.

New cards

240 items

How many items does the NEO-PI test have?

New cards

Psychometric Soundness, Costs, Benefits

3 factors affecting Test Utility

New cards

Psychometric Soundness

3 Factors affecting Test Utility

The reliability and validity of a test (i.e., reliability and validity coefficients are acceptably high).
- Higher than .95 reliability = questionable

New cards

.95

If the reliability of a test is higher than _____, it is often questionable as it may indicate redundancy in test items (e.g., items like “I am happy,” “I am joyful,” “I am jovial” all reflects the same concept, which adds no value).

New cards

Index of Utility

The practical value of the information derived from scores on a test (considering both reliability and validity).

New cards

Convergent Validity

This is when scores from one test positively correlate with scores from another similar test. Excessively high ___________ can be concerning, as it suggests that the new test offers no additional insights or unique information, which reduces its utility.

New cards

valid

Normally, a ______ test is most likely going to be useful. But there are other factors that must be considered in determining a test’s utility.

New cards

Costs

3 Factors affecting Test Utility

Disadvantages, losses, or expenses in both economic and non-economic terms.
The usual meaning, of course, is economic.

New cards

Benefits

3 Factors affecting Test Utility

Profits, gains, or advantages derived from the use of a particular test.
While testing can have some cost to the company, the economic benefits can be tremendous, in terms of:
- Increase in quantity and quality of worker performance;
- Decrease in competency gaps (requiring training), accidents, and employee turnover.

New cards

Use of Expectancy Data, Use of Brogden-Cronbach-Gleser Formula, Decision Theory and Test Utility, Some Practical Considerations

4 ways to conduct Utility Analysis

New cards

Expectancy Table

Provides an indication of the likelihood that a test-taker will score within some interval of scores on a criterion measure — “passing,” “acceptable,” or “failing.”
- Can provide information helpful to decision makers.

New cards

Taylor-Russell Table

Typically used to help decide if a test is worth using for hiring employees based on how well it predicts success.

New cards

Brogden-Cronbach-Gleser Formula

Used to calculate the dollar amount of a utility gain resulting from the use of a particular selection instrument under specified conditions.

New cards

Utility Gain

Refers to an estimate of the benefit (monetary or otherwise) of using a particular test or selection method.

New cards

Decision Theory

Recommended to determine Test Utility by Cronbach and Gleser.
To illustrate this, we need to recall five terms: base rate, hit rate, miss rate, false positive, and false negative.

New cards

Base Rate

The extent to which a particular trait, behavior, characteristic, or attribute exists in the population (expressed as a proportion).

New cards

Hit Rate

The proportion of people a test accurately identifies as possessing or exhibiting a particular trait, behavior, characteristic, or attribute.

New cards

Miss Rate

The proportion of people the test fails to identify as having, or not having, a particular characteristic or attribute.

New cards

False Positive

A miss wherein the test predicted that the test-taker did possess the characteristic or attribute being measured when in fact the test-taker did not.

New cards

False Negative

A miss wherein the test predicted that the test-taker did not possess the characteristic or attribute being measured when the test-taker actually did.

New cards

perfect predictors

Tests are often “assumed” to be __________ of future performance.
- That is, those who score above the cut-off score are expected to be successful on the job, and those who do not meet the cut-off score are predicted to be unsuccessful.

New cards

Decision Theory

____________ provides guidelines for setting optimal cut-off scores.
- In certain professions, like airline pilots and surgeons, having false negatives would be preferable than false positives, for obvious reasons.

New cards

The Pool of Job Applicants, The Complexity of the Job, The Cut-off Score Used

3 Practical Considerations in Test Utility

New cards

The Pool of Job Applicants

Utility estimates assume that there is a steady supply of viable applicants to occupy the positions at stake.
There are some professions with few qualified applicants (or would they accept, even if they are qualified).

New cards

The Complexity of the Job

There are disagreements among experts as to whether it is appropriate to use the same utility models to jobs of varying complexities (i.e., a highly complex job may have more stringent standards of successful performance).

New cards

The Cut-off Score Used

A (usually numerical) reference point derived as a result of a judgment and used to divide a set of data into two or more classifications, with some action to be taken or some inference to be made on the basis of these classifications.
Can be a Relative Cut Score or a Fixed Cut Score.

New cards

Relative Cut Score

A reference point—in a distribution of test scores used to divide a set of data into two or more classifications—that is set based on norm-related considerations rather than on the relationship of test scores to a criterion.
Because this type of _______ is set with reference to the performance of a group (or some target segment of a group), it is also referred to as a norm-referenced cut score.

New cards

Norm-referenced cut score

Other term for Relative Cut Score.
This is set with reference to the performance of a group (or some target segment of a group).

New cards

Fixed Cut Score

A reference point—in a distribution of test scores used to divide a set of data into two or more classifications—that is typically set with reference to a judgment concerning a minimum level of proficiency required to be included in a particular classification.
May also be referred to as absolute cut scores.

New cards

Absolute cut score

Other term for Fixed Cut Score.

New cards

Multiple Cut Scores

Refer to the use of two or more cut scores with reference to one predictor for the purpose of categorizing testtakers.
For example, different cut scores are set to be equivalent to ratings of A, B, C. D, etc.

New cards

Multiple Hurdles

The achievement of one cut-off score is necessary to proceed to the next stage in the evaluation process.

New cards

Angoff Method, Known Groups Method

2 methods of Setting Cut Scores

New cards

Angoff Method

A way to set fixed scores that entails averaging the judgments of experts.
It determines how often a minimally qualified performer would answer a test item correctly.
- A panel of experts is chosen to review test items and estimate the probability that a minimally qualified performer would answer the item correctly.

New cards

Angoff Method

This simple technique has wide appeal, and works well—that is, as long as the experts agree.
There is low inter-rater reliability and major disagreements regarding how certain populations of testtakers should respond to items.

New cards

Known Groups Method

Also known as Method of Contrasting Groups.
A method of collecting data on a predictor of interest from groups known to possess (and not to possess) a trait, attribute, or ability of interest.

New cards

Known Groups Method

The main problem with using this method is that determining the cutoff score is inherently affected by the composition of the contrasting groups.
Based on data analysis, a cut score is set on the test that best discriminates the two groups’ test performance.

New cards

48 items

How many items are there in Purdue Non-Language Test?

New cards

50 items; 4 sub-sets

How many items and sub-sets are there in Culture Fair Intelligence Test?

New cards

40 items each subset

How many items are there in the Differential Aptitude Tests?

New cards

240 items

How many items are there in the NEO-PI Revised Test?