PSYCHOMET FINALS: Understanding Test Utility + Test Scoring

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 49

flashcard set

Earn XP

Description and Tags

hi

50 Terms

1

Cost efficiency, Time factor, How a valid and reliable test compares to another valid and reliable test, How useful is a test for diagnoses/treatment/classifying patients, How an admissions test can “trim down” # of applicants, Will it help that we add another test to our battery, Having a test VS not having a test

7 factors that must be considered in making decisions about using certain tests

New cards
2

Culture-Fair Test

Scores are unaffected by cultural differences.

New cards
3

Culture-Bound Test

Test that is specific to another language or culture.

New cards
4

Test Utility

  • The usefulness or practical value of testing to improve efficiency.

  • A test being both reliable and valid does not necessarily mean or ensure that it is useful.

New cards
5

MMPI

  • Primarily designed to assess clinical conditions (e.g., depression, schizophrenia, paranoia, psychopathic deviate, hypomania, etc.).

    • Not effective for evaluating non-clinical or normal populations.

    • Specifically useful for clinical diagnosis rather than general personality assessment.

New cards
6

567 items

How many items does the MMPI test have?

New cards
7

NEO-PI

  • Measures five major personality domains, often referred to as OCEAN.

    • In certain contexts, this test can be more useful than the MMPI, especially when assessing normal populations or when the focus is on general personality traits rather than clinical conditions.

New cards
8

240 items

How many items does the NEO-PI test have?

New cards
9

Psychometric Soundness, Costs, Benefits

3 factors affecting Test Utility

New cards
10

Psychometric Soundness

3 Factors affecting Test Utility

  • The reliability and validity of a test (i.e., reliability and validity coefficients are acceptably high).

    • Higher than .95 reliability = questionable

New cards
11

.95

If the reliability of a test is higher than _____, it is often questionable as it may indicate redundancy in test items (e.g., items like “I am happy,” “I am joyful,” “I am jovial” all reflects the same concept, which adds no value).

New cards
12

Index of Utility

The practical value of the information derived from scores on a test (considering both reliability and validity).

New cards
13

Convergent Validity

  • This is when scores from one test positively correlate with scores from another similar test. Excessively high ___________ can be concerning, as it suggests that the new test offers no additional insights or unique information, which reduces its utility.

New cards
14

valid

  • Normally, a ______ test is most likely going to be useful. But there are other factors that must be considered in determining a test’s utility.

New cards
15

Costs

3 Factors affecting Test Utility

  • Disadvantages, losses, or expenses in both economic and non-economic terms.

  • The usual meaning, of course, is economic.

New cards
16

Benefits

3 Factors affecting Test Utility

  • Profits, gains, or advantages derived from the use of a particular test.

  • While testing can have some cost to the company, the economic benefits can be tremendous, in terms of:

    • Increase in quantity and quality of worker performance;

    • Decrease in competency gaps (requiring training), accidents, and employee turnover.

New cards
17

Use of Expectancy Data, Use of Brogden-Cronbach-Gleser Formula, Decision Theory and Test Utility, Some Practical Considerations

4 ways to conduct Utility Analysis

New cards
18

Expectancy Table

  • Provides an indication of the likelihood that a test-taker will score within some interval of scores on a criterion measure — “passing,” “acceptable,” or “failing.”

    • Can provide information helpful to decision makers.

New cards
19

Taylor-Russell Table

Typically used to help decide if a test is worth using for hiring employees based on how well it predicts success.

New cards
20

Brogden-Cronbach-Gleser Formula

  • Used to calculate the dollar amount of a utility gain resulting from the use of a particular selection instrument under specified conditions.

New cards
21

Utility Gain

Refers to an estimate of the benefit (monetary or otherwise) of using a particular test or selection method. 

New cards
22

Decision Theory

  • Recommended to determine Test Utility by Cronbach and Gleser.

  • To illustrate this, we need to recall five terms: base rate, hit rate, miss rate, false positive, and false negative.

New cards
23

Base Rate

The extent to which a particular trait, behavior, characteristic, or attribute exists in the population (expressed as a proportion).

New cards
24

Hit Rate

The proportion of people a test accurately identifies as possessing or exhibiting a particular trait, behavior, characteristic, or attribute.

New cards
25

Miss Rate

The proportion of people the test fails to identify as having, or not having, a particular characteristic or attribute.

New cards
26

False Positive

A miss wherein the test predicted that the test-taker did possess the characteristic or attribute being measured when in fact the test-taker did not.

New cards
27

False Negative

A miss wherein the test predicted that the test-taker did not possess the characteristic or attribute being measured when the test-taker actually did.

New cards
28

perfect predictors

  • Tests are often “assumed” to be __________ of future performance.

    • That is, those who score above the cut-off score are expected to be successful on the job, and those who do not meet the cut-off score are predicted to be unsuccessful.

New cards
29

Decision Theory

  • ____________ provides guidelines for setting optimal cut-off scores.

    • In certain professions, like airline pilots and surgeons, having false negatives would be preferable than false positives, for obvious reasons.

New cards
30

The Pool of Job Applicants, The Complexity of the Job, The Cut-off Score Used

3 Practical Considerations in Test Utility

New cards
31

The Pool of Job Applicants

  • Utility estimates assume that there is a steady supply of viable applicants to occupy the positions at stake.

  • There are some professions with few qualified applicants (or would they accept, even if they are qualified).

New cards
32

The Complexity of the Job

There are disagreements among experts as to whether it is appropriate to use the same utility models to jobs of varying complexities (i.e., a highly complex job may have more stringent standards of successful performance).

New cards
33

The Cut-off Score Used

  • A (usually numerical) reference point derived as a result of a judgment and used to divide a set of data into two or more classifications, with some action to be taken or some inference to be made on the basis of these classifications.

  • Can be a Relative Cut Score or a Fixed Cut Score.

New cards
34

Relative Cut Score

  • A reference point—in a distribution of test scores used to divide a set of data into two or more classifications—that is set based on norm-related considerations rather than on the relationship of test scores to a criterion.

  • Because this type of _______ is set with reference to the performance of a group (or some target segment of a group), it is also referred to as a norm-referenced cut score.

New cards
35

Norm-referenced cut score

  • Other term for Relative Cut Score.

  • This is set with reference to the performance of a group (or some target segment of a group).

New cards
36

Fixed Cut Score

  • A reference point—in a distribution of test scores used to divide a set of data into two or more classifications—that is typically set with reference to a judgment concerning a minimum level of proficiency required to be included in a particular classification.

  • May also be referred to as absolute cut scores.

New cards
37

Absolute cut score

Other term for Fixed Cut Score.

New cards
38

Multiple Cut Scores

  • Refer to the use of two or more cut scores with reference to one predictor for the purpose of categorizing testtakers.

  • For example, different cut scores are set to be equivalent to ratings of A, B, C. D, etc.

New cards
39

Multiple Hurdles

The achievement of one cut-off score is necessary to proceed to the next stage in the evaluation process.

New cards
40

Angoff Method, Known Groups Method

2 methods of Setting Cut Scores

New cards
41

Angoff Method

  • A way to set fixed scores that entails averaging the judgments of experts.

  • It determines how often a minimally qualified performer would answer a test item correctly.

    • A panel of experts is chosen to review test items and estimate the probability that a minimally qualified performer would answer the item correctly.

New cards
42

Angoff Method

  • This simple technique has wide appeal, and works well—that is, as long as the experts agree.

  • There is low inter-rater reliability and major disagreements regarding how certain populations of testtakers should respond to items.

New cards
43

Known Groups Method

  • Also known as Method of Contrasting Groups.

  • A method of collecting data on a predictor of interest from groups known to possess (and not to possess) a trait, attribute, or ability of interest.

New cards
44

Known Groups Method

  • The main problem with using this method is that determining the cutoff score is inherently affected by the composition of the contrasting groups.

  • Based on data analysis, a cut score is set on the test that best discriminates the two groups’ test performance.

New cards
45

48 items

How many items are there in Purdue Non-Language Test?

New cards
46

50 items; 4 sub-sets

How many items and sub-sets are there in Culture Fair Intelligence Test?

New cards
47

40 items each subset

How many items are there in the Differential Aptitude Tests?

New cards
48

240 items

How many items are there in the NEO-PI Revised Test?

New cards
49

240 items

How many items are there in the Basic Personality Inventory tests?

New cards
50

240 items

How many items are there in the OASIS-3 Interest Inventory?

New cards
robot