PSY333: Measurement & Testing (some level of finals)

0.0(0)

Studied by 1 person

View linked note

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/130

Earn XP

Description and Tags

Psychology

Last updated 10:38 PM on 5/8/23

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

131 Terms

New cards

Cattell

"First used the term ""mental tests"""

New cards

Binet

Associated with the first modern-day intelligence test (measure higher mental processes)

New cards

Wundt

First psychological laboratory that used experimental research

New cards

Terman

First use of the term intelligence quotient (IQ); revised Binet

New cards

Thorndike

Associated with the Stanford Achievement Test

New cards

WWI

What was the era that first widely used group testing?

New cards

Army Alpha

Group administation of intelligence test for the military; reading literacy

New cards

Army Beta

Used as an intelligence test, but is the language-free version

New cards

Thorndike

Research on vocational assessments

New cards

Miner

Person involved in occupation selection for large groups of high school students

New cards

Strong

First \-- much more general career counseling for the future aptitude tests

New cards

Woodworth's Personal Data Sheet

First modern personality inventory (WWI); measured suspectibility to mental health problems

New cards

aptitude

measure whether or not you're ready for something

New cards

Step 1: Determine the goals of your client

"Defines purpose of test; demographics are considered; what context the test is on

New cards

Step 2: Choose instrument types to reach client goals

Asks the questions: What behaviors, content, skills is it intended to measure? What is the that the trait is based on? What about subsets/domains it is based on? Operationalization of test forms.

New cards

Step 3: Access information about possible instruments

Item formats are determined; test is written and item reviewers make sure it measures what is intended to measure

New cards

Step 4: Examine validity, reliability, cross-cultural, fairness, and practicality of the possible instruments

Before this is done, a pilot test is done to make sure the items are valid, reliable, and fair, among other items. , this happens.

New cards

pilot test

Validation process

New cards

Step 5: Choose an Instrument Wisely

Determines test length, testing time, scoring approaches, and test procedures, administers test materials.

New cards

Level A

Tests which can be administered, scored, and interpreted by laypeople

New cards

Level B

Tests that require a psychology degree or coursework in testing

New cards

Level C

Tests that require an advanced psychology degree, a license and/or advanced training for that particular test

New cards

cognitive sources of construct-irrelevant variance

Knowledge or skill not related to the purpose of the test is required to answer an item correctly.

New cards

affective sources of construct-irrelevant variance

Language or images causes strong emotions that may interfere with the ability to respond to an item correctly (i.e. political opinions, beliefs)

New cards

physical sources of construct-irrelevant variance

aspects of tests interfere with the test takers' ability to attend to, see, hear, or sense the items or stimuli (consider disabled people!)

New cards

correlation coefficient

statistical relationship between two variables

New cards

scatter plot

"Used to visually examine data, especially to discover patterns (such as curvilinear relationships)

$"Used to visually examine data, especially to discover patterns (such as curvilinear relationships)<br\><img src\=""scatter plot.png""\>"$

New cards

positive relationship

"an increase in one variable is related to an increase in the other variable

$"an increase in one variable is related to an increase in the other variable<br\><img src\=""pos relationship.png""\>"$

New cards

negative relationship

"an increase in one variable is related to a decrease in the other variable

$"an increase in one variable is related to a decrease in the other variable<br\><img src\=""neg relationship.png""\>"$

New cards

no relationship

"two variables that are not related to each other

$"two variables that are not related to each other<br\><img src\=""no relationship.png""\>"$

New cards

strong correlation

New cards

moderate strength

±0.30 ~ 0.69"

New cards

no strength

±.00 ~ 0.29"

New cards

scores from a test is a consistent measure of individuals’ true scores

reliability

New cards

correlation coefficient

To measure reliability, we use

New cards

method error

caused by test administrators or the testing environment

New cards

trait error

Error associated with test takers, subjects themselves

New cards

test-retest reliability

Relationship between scores on the same test administered twice with a time interval between the administration

New cards

practice effects

e.g., subjects may get better at second testing, subjects knowing how they answered in a similar test form

New cards

alternate-forms reliability

Coefficients of two equivalent tests are compared (time interval)

New cards

internal consistency

obtaining a reliability coefficient by assessing how items are correlated as a group

New cards

split-half reliability

internal consistency; correlation between scores from even-numbered items and scores from odd-numbered items

New cards

validity

whether a test measures what it is supposed to measure (

New cards

content validity

Does the \______ \______ cover a representative sample of behaviors to be measured in its entirety? Content experts

New cards

criterion validity

Does a test predict the target trait it is intended to measure?

New cards

concurrent validity

Focuses on the prediction of current performance or psychological behavior

New cards

predictive validity

Focuses on the prediction of future performance or psychological behavior

New cards

construct validity

Does an assessment measure a theoretical construct that it is designed to measure (e.g., intelligence)?

New cards

convergent validity

Are two assessments measuring the construct related?

New cards

discriminant validity

Are two asssessments measuring different constructs ?

New cards

factor analysis

Found construct you want to measure from the test scores

New cards

fairness

whether an individual's score is not affected by potential bias inherent in a test, test procedure and interpretation

New cards

the 1960s (civil rights movement)

Fairness did not get much attention until

New cards

fairness in testing process

Equal testing condition + proctors

New cards

fairness as lack of measurement bias

Idea that all items should behave equally across all examinees

New cards

fairness in access to the construct as measured

accessibility in testing; showing their status on target without being advantaged or disadvantaged by their individual characteristics or opportunity to learn

New cards

differential item functioning

Statistical approach to examine test fairness by identifying items that perform differentially across subgroups of test takers while controlling for test takers' ability

New cards

cognitive interview

examining response processes through probing questions

New cards

achievement testing

tests that measure what one has learned

New cards

aptitude testing

measure what one is capable of learning

New cards

personality assessment

used to assess habits, temperament, likes and dislikes, character, and similar behaviors

New cards

diagnostic tests

tests that assess problem areas of learning; often used to assess learning disabilities

New cards

cognitive ability tests

tests that measure a broad range of cognitive ability

New cards

intellectual and cognitive functioning

tests that measure a broad range of cognitive functioning in general intelligence, intellectual disabilities, giftedness, changes in overall cognitive functioning

New cards

special aptitude tests

tests that measure one aspect of ability; likelihood of success in a vocation

New cards

multiple aptitude tests

tests that measure many aspects of ability; likelihood of success in multiple vocations

New cards

interest inventories

tests that measure likes and dislikes as well as one's personality orientation toward the world of work; career counseling

New cards

classification methods

a tool whereby an individual identifies whether he or she has, or does not have, specific attributes or characteristics

New cards

readiness tests

tests that measure one's readiness for moving ahead in school. used to assess readiness to enter first grade

New cards

mental age/chronological age x 100

How do you calculate IQ (use / as a division sign)?

New cards

Spearman-Brown formula

What formula is used for split-half reliability due to the test being cut in half?

New cards

$"bar graph<br\><img src\=""Screenshot 2023-02-08 114332.png""\>"$

"bar graph

"visual for a categorical, discrete variable"

New cards

$"histogram<br\><img src\=""histogram.png""\>"$

"histogram

visual for continuous variables

New cards

$"frequency polygon<br\><img src\=""freq_poly.jpg""\>"$

"frequency polygon

used to see the distributional shape of data

New cards

positively skewed

"(Type of curve)

$"(Type of curve)<br\><img src\=""paste-2fe6b0bb553a7741fc5cf57bc207a669dd093661.jpg""\>"$

New cards

negatively skewed

"(Type of curve)

$"(Type of curve)<br\><img src\=""negative skewed.png""\>"$

New cards

Mode < Median < Mean

"Left to right, how are measures of central tendency distributed in positively skewed distributions?

$"Left to right, how are measures of central tendency distributed in positively skewed distributions?<br\><img src\=""positively skewed.png""\>"$

New cards

Mode > Median > Mean

"Left to right, how are measures of central tendency distributed in negatively skewed distributions?

$"Left to right, how are measures of central tendency distributed in negatively skewed distributions?<br\><img src\=""negative skewed.png""\>"$

New cards

variance

avg of squared distance from the mean

New cards

deviation score

the difference between an individual score and the mean

New cards

norm referenced

scores that are compared to a set of test scores called the norm group

New cards

criterion-referenced scores

scores are compared to a predetermined standard; i.e. mastering a certain level of knowledge, used for diagnoses

New cards

percentile

proportion of people falling at and below a score in a standard normal distribution

New cards

T-scores

µ \= 50, σ \= 10; used for personality tests

New cards

deviation IQ

µ \= 100, σ \= 15; used for tests of intelligence

New cards

Stanines

µ \= 5, σ \= 2, round to nearest whole number; used for achievement testing

New cards

Sten scores

µ \= 5.5, σ \= 2, round to nearest whole number; used for personality inventories and questionnaires

New cards

NCE scores

µ \= 50, σ \= 21.06; used for educational tests

New cards

SAT scores

µ \= 500, σ \= 100

New cards

ACT scores

µ \= 21, σ \= 5

New cards

Publisher type scores

µ and σ are artbitrarily set by publisher

New cards

SEM

σ of test scores x √1 - reliability of a test

New cards

standard error of measurement

Tells us how much error there is in the test and ultimately how much any individual's score might fluctuate due to this error

New cards

comprehension

problems with the \_____ of questions

New cards

information retrieval

failure in the information retrieving to answer (related to background characteristics)

New cards

decision process

low motivation/intention of faking or impression enhancement

New cards

response process

mismatch in the choice of response option; difference in interpretation of option meanings

New cards

interquartile range formula

$\$

New cards

Deviation score

X (raw score) - M (mean score)

100

New cards

Variance

Deviation score squared

Explore top notes

Chapter 13: Rise of Manufacturing and the Age of Jackson (1820–1845)

Updated 1071d ago

0.0(0)

AP Calculus AB/BC Formula Sheet

Updated 378d ago

0.0(0)

AP World History - Unit 7: Global Conflict

Updated 63d ago

0.0(0)

Verpleegkundige visies

Updated 418d ago

0.0(0)

Timeline National Jan-June 2022

Updated 1076d ago

0.0(0)

Chapter 6: An Overview of Literary Movements

Updated 1065d ago

0.0(0)

Key Stuff - All Ideologies

Updated 1001d ago

0.0(0)

🌱 AP Environmental Science Unit 5 Notes

Updated 314d ago

0.0(0)

Chapter 13: Rise of Manufacturing and the Age of Jackson (1820–1845)

Updated 1071d ago

0.0(0)

AP Calculus AB/BC Formula Sheet

Updated 378d ago

0.0(0)

AP World History - Unit 7: Global Conflict

Updated 63d ago

0.0(0)

Verpleegkundige visies

Updated 418d ago

0.0(0)

Timeline National Jan-June 2022

Updated 1076d ago

0.0(0)

Chapter 6: An Overview of Literary Movements

Updated 1065d ago

0.0(0)

Key Stuff - All Ideologies

Updated 1001d ago

0.0(0)

🌱 AP Environmental Science Unit 5 Notes

Updated 314d ago

0.0(0)

Explore top flashcards

Cells, diffusion, osmosis

31Updated 729d ago

0.0(0)

Human Rights Final

74Updated 1040d ago

0.0(0)

ap world history unit 3

76Updated 1216d ago

0.0(0)

Woordenschat M11

36Updated 1021d ago

0.0(0)

bioe 3

166Updated 1210d ago

0.0(0)

practical: urinary system

57Updated 300d ago

0.0(0)

Exploring the Bible Exam 3

66Updated 845d ago

0.0(0)

HL Biological Approach to Understanding Behavior

73Updated 615d ago

0.0(0)

Cells, diffusion, osmosis

31Updated 729d ago

0.0(0)

Human Rights Final

74Updated 1040d ago

0.0(0)

ap world history unit 3

76Updated 1216d ago

0.0(0)

Woordenschat M11

36Updated 1021d ago

0.0(0)

bioe 3

166Updated 1210d ago

0.0(0)

practical: urinary system

57Updated 300d ago

0.0(0)

Exploring the Bible Exam 3

66Updated 845d ago

0.0(0)

HL Biological Approach to Understanding Behavior

73Updated 615d ago

0.0(0)