PSYC3020 - Exam 3

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/95

There's no tags or description

Looks like no tags are added yet.

Last updated 1:54 PM on 11/6/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

96 Terms

New cards

explain some of the major laws that govern the use of tests and measurement in the United States

New cards

compare the motivations and consequences of various lawsuits related to testing

New cards

contrast the motivations and consequences of various lawsuits related to testing

New cards

list several important ethical rules of conduct for measurement folks

New cards

Individuals with Disabilities Education Act (IDEA, 1997)

all children are entitled to a free and appropriate public education
testing used primarily to help place children in correct programs and measure progress
each child given an Individual Education Program (IEP)
children with disabilities educated in the least restrictive environment
both students and parents are involved in decision-making
mechanisms needed to ensure the above five principles

New cards

Truth in Testing Law (1979)

passed after investigation of Education Testing Services by New York Public Interest Research Group
require testing companies:
- disclose all validity studies of a test
- full disclosure on meaning of scores and how they’re calculated
- provide copies of test questions, correct answers, and student’s answers if the student requests it

New cards

No Child Left Behind (NCLB, 2002)

assure that all children meet or exceed their state’s level of academic achievement
issues:
- standardized achievement tests used to measure “school performance”
- unrealistic standards
- everyone must be tested at grade-level
- expensive with limited funding
- only applied to public schools

New cards

Every Student Succeeds Act (ESSA, 2015)

revised NCLB
- relaxes requirements about testing every student in a school
- extra funding and interventions for high schools with more needs
- states given more control over how to help students and schools in needs
issues:
- still using standardized achievement testing to evaluate school performance
- still only applied to public schools
- still limited in funding

New cards

Family Educational Rights and Privacy Act (FERPA)

parents/students can inspect student’s education records, but schools are not required to provide copies
parents/students can request records be corrected
schools need written permission from parents/students to release any information

exceptions:
- school officials
- destination schools after tranfser
- specified officials for audit/evaluation
- etc.

New cards

What was Hobson v. Hansen (1967) about?

standardized tests used to place students in different learning tracks
African American children were disproportionately placed into based tracks while white children moved to other tracks

New cards

What was the consequence in Hobson v. Hansen (1967)?

grouping would be permissible if based on innate ability
problem: tests used were influenced by cultural experiences

New cards

What was Diana v. State Board of Education about?

intelligence tests used to place students in EMR tracks
problematic for bilingual children
tests were standardized for only white children

New cards

What was the consequence in Diana v. State Board of Education?

further research revealed bilingual children receive higher IQs if tested in their primary language

New cards

What was Larry P. v. Wilson Riles (1979) about?

1/6th of African American elementary-school children tracked to EMR classes based on IQ scores
side 1 argued:
- retesting done by African American psychologists yielded higher IQ scores, EMR placement was detrimental long-term

side 2 argued:
- IQ scores were valid and unbiased
- retesting was not standardized

New cards

What was the consequence in Larry P. v. Wilson Riles (1979)?

practice of IQ tests for EMR tracking ended
mixed feelings about outcome

New cards

What was Parents in Action on Special Education v. Hannon (1980) about?

racial bias only found for a subset of items on teh WISC, WISC-R, and Stanford-Binet IQ tests

New cards

What was the consequence in Parents in Action on Special Education v. Hannon (1980)?

racial bias findings didn’t justify removal of tests
conflicted with Larry P. v. Wilson Riles

New cards

What was Griggs v. Duke Power Company about?

raised concerns about segregation in the workplace
company claimed education was needed for advancement and created a test
- nobody passed this test
- concerns about validity of the test

New cards

What was the consequence in Griggs v. Duke Power company?

employment test results must be valid and reliable

New cards

What was Watson v. Fort Worth Bank and Trust about?

misrepresentation of African American personnel
passed over for promotion multiple times

New cards

What was the consequence in Watson v. Fort Worth Bank and Trust?

lower courts argued that statistical bias only applied to psychological tests
Supreme Court disagreed

New cards

key ethical principles

no physical, emotional, or psychological harm
consent is important
reasonable and appropriate incentives
responses are made anonymous
confidentiality must be ensured
careful reporting of information
use of appropriate assessment techniques
test scores must be sufficiently valid and reliable
tests should have a purpose

New cards

define test fairness described by psychometricians

New cards

define test bias as described by psychometricians

New cards

compare test fairness and bias

New cards

contrast test fairness and bias

New cards

describe how threats to test fairness weaken validity arguments

New cards

describe the ways test developers ensure test fairness

New cards

principles for making assessments using universal design

New cards

describe various ways of detecting bias and their limitations

New cards

test fairness

validity issue combining morality, philosophy, and legality

New cards

What are views on test fairness?

equitable treatment during testing
accessibility to the measured constructs
validity of individual test score interpretations for intended uses
lack of measurement bias

New cards

What is equitable treatment composed of?

standardization and consistency of administration
qualified test administrators
flexibility

New cards

What is accessibility composed of?

respondents can accurately record their responses
congruence of construct intended to be measured
congruence of constructs needed to respond to the measure

New cards

What is validity of individual test scores composed of?

heterogeneity within groups
- group level accommodations or modifications are not always appropriate

New cards

What are the kinds of threats to fairness?

content
context
response process
lack of opportunity to learn

New cards

What is content? How does it threaten fairness?

problems with words or vocabulary inside an item
- terms may be more likely known by another group
- offensive language
- representativeness within a question

New cards

What is context? How does it threaten fairness?

problems surrounding a test or measurement
- stereotype threat
- unclear instruction
- advanced or unfamiliar language
- differential treatment

New cards

What is response process? How does it threaten fairness?

problems with the processes used to take in an item, process, and respond
occur between test-taker and item
- faking good
- misinterpreted communication
- lack of accessibility

New cards

What is lack of opportunity to learn? How does it threaten fairness?

New cards

universal design

assessment development approach that maximizes accessibility of the test for all of the intended takers
begins by defining constructs precisely with clear differentiation from construct-irrelevant invariance

New cards

What are best practices for content and wording?

test takers share the same experience
appropriate complexity of sentences and vocabulary
shorter sentences

New cards

What are best practices for formatting?

text formatting
typefaces
white space
contrast
illustrations

New cards

accommodation

changes made to a test to improve accessibility
doesn’t affect the measured construct

New cards

modifications

changes made to a test to improve accessibiility
does affect the measured construct

New cards

When are accommodations necessary?

not appropriate if the affected ability is directly relevant to the construct being measured
not appropriate for an assessment if the purpose of the test is to assess the presence and degree of the disability
not necessary for all students with disabilities

New cards

test bias

system difference in scores between groups due to some unrelated factor
empirical observation

New cards

What are total-scores test bias?

difference-difference bias
cleary model

New cards

What is item-specific test bias?

content evaluation
differential item functioning (DIF)

New cards

difference-difference bias

bias evidenced by differences in scores among groups

New cards

cleary model

test scores are unbiased if equivalent scores from different groups equally predict some criterion
linear regression with interaction effect

New cards

problems with cleary model

assumes all relevant predictors or covariates are included
assumed unbiased criterion
interaction/model tests are often underpowered

New cards

content examination

review items for obvious cultural, racial, or gender related bias

New cards

differential item functioning (DIF)

occurs when an item behaves differently among groups
respondents from different groups equated across scores

New cards

Mantel-Haenszel Test

simplest approach for detecting DIF
procedure:
- group respondents into score groups
- create a contingency table, for each score range group, of incorrect/correct responses and comparison group membership
- calculate expected counts and variances of counts within each score range group
- use all information to calculate chi-square statistic
limitation: how are ranges picked
advantage: can be used with smaller samples

New cards

parent model

allow item parameters to differ between groups

New cards

nested models

constrain a single item’s parameters to be equal across groups

New cards

bias and fairness

evidence of test bias does not mean a test is unfair
DIF may be detected, but might not cause impactful differences in scores

New cards

understand how to create table of specifications for item development

New cards

understand how to use a table of specifications for item development

New cards

describe different item formats

New cards

what kind of tests are different item formats suited for

New cards

describe and write good achievement test items

New cards

be familiar with Bloom’s taxonomy

New cards

alternative ways of defining the cognitive demands of items

New cards

describe and write good survey items

New cards

How is a test made?

choose and define constructs
determine the best method to use
develop possible questions or items

New cards

What are types of tests or measurements?

achievement
aptitude
ability or intelligence
personality
neuropsychology
career interests

New cards

achievement test

assess an individual’s level of knowledge in a particular domain

New cards

aptitude test

measure an individual’s potential to succeed in an activity requiring a particular skill or set of skills and can predict future outcomes

New cards

ability or intelligence test

assess one’s level of skill or competence in a wide variety of areas

New cards

personality test

assess an individual’s unique and table set of characteristics, traits, or attitudes

New cards

neuropsychological test

asses the functioning of the brain as it relates to everyday behaviors, including emotions and thinking

New cards

vocational or career test

assess and individual’s interests and help classify those interests as they relate to particular jobs and careers

New cards

content/table of specifications

define the construct or content domain you are measuring in excruciating detail

New cards

What must be defined in clinical/psychological assessments?

define the construct and describe the associated observable behaviors

New cards

What must be defined in organizational tests?

define the knowledge and skills needed to do a job successfully

New cards

What must be defined in educational assessments?

describe the curriculum to be assessed

New cards

What are selected-response formats?

Likert format
category format
multiple-choice

New cards

What are constructed-response formats?

essay questions
interview questions
performance assessment

New cards

Likert format

people presented with a statement and asked to use a rating scale to respond according to the anchor

New cards

anchor

labels for different positions on the Likert scale

New cards

category format

rating scale between a and b

New cards

How might category format lead to reliability and validity issues?

too many categories

New cards

guidelines for survey items

every item is important and requires a response
1. item should apply to all respondents unless filter questions are used to exclude a participant
avoid double-barreled items
item should be technically accurate
item should be a complete question or sentence with a simple structure
use as few words as possible in each item stem and options
use simple, familiar words
use specific, concrete words to specify concepts clearly
avoid negatively worded or connotative inconsistent items and options
avoid leading or loaded items that suggest an appropriate response

New cards

guidelines for ordinal scales

balance the item stem
choose an appropriate rating scale length
avoid the middle or neutral category
provide balanced scales where categories are relatively equal distance apart conceptually
verbally label all response categories
align response options in one column (single item) or horizontally on one row (multiple items)
response categories should be exhaustive, including all plausible responses
response categories should be mutually exclusive
response categories should approximate the actual distribution of the characteristic in the population