1/95
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
trait
A relatively enduring, distinguishable way in which one individual varies from another; not always manifested 100% of the time.
state
A temporary way in which a person differs from others; less enduring than traits but also observable through behavior.
Traits are inferred from a sample of behavior, obtained through direct observation, self-reports, or test responses.
How are traits inferred in psychological assessment?
construct
an unobservable psychological quality, like a trait, that we infer from patterns of overt behavior.
overt behavior
Observable actions or the results of actions, including responses during tests or assessments.
The stronger the trait, the more likely it will be expressed in behavior—but it’s still influenced by the situation.
How does trait strength affect behavior?
A violent parolee may act calm with a parole officer but aggressive at home. The situation moderates behavior expression.
Give an example of trait manifestation influenced by context.
Context helps us select appropriate descriptors for behavior. For example, praying aloud in a church = religious; in a movie theater = possibly deviant.
Why is context important in interpreting behavior?
Carefully defining the specific trait or state using an operational definition provided by the test developer.
It clarifies what exactly is being measured (e.g., defining aggression as acts like pushing, hitting, or kicking).
Why is an operational definition important in psychological testing?
A universe of behaviors presumed to reflect the trait helps developers choose items that best assess it.
What guides the selection of test items to measure a construct?
It could include items like “Who killed Lapu-Lapu?
How might "intelligence in Filipino adults" be operationalized in a test, if Philippine history is seen as a component of intelligence.?
Technical considerations, construct definition, and the value society/test developers place on those behaviors.
What factors influence how much weight is given to certain test items?
The more responses aligned with the trait (as defined by the test manual), the higher the presumed trait level.
What does cumulative scoring mean in psychological testing?
The strength or level of the targeted trait, state, or ability being measured.
What does a psychological test score typically represent?
Because it determines how test responses translate into meaningful trait or ability levels.
Why is scoring and interpretation crucial in testing traits and states?
Test-related behavior can predict non-test-related behavior.
What does Assumption 3 of psychological testing state?
They mimic the actual behaviors the test aims to understand or predict.
How do some test tasks relate to real-life behavior?
To make predictions about future behavior outside the test setting.
What is the purpose of analyzing test-related behavior?
Using test results to understand behavior that has already happened.
What does "postdict" mean in psychological testing?
In forensic contexts, such as evaluating a defendant’s mental state during a crime.
In what setting is postdiction commonly used?
Yes, under certain conditions, test behavior can shed light on past mental states.
Can behavior samples provide insight into past states of mind?
Tests and other measurement techniques have strength and weaknesses
What does Assumption 4 of psychological testing emphasize?
They should recognize the limitations of the tests they use.
What should competent test users understand about tests?
By supplementing test results with data from other sources.
How can test limitations be addressed?
It ensures responsible interpretation and avoids over-reliance on a single test.
Why is awareness of test limitations important?
Factors other than what a test intends to measure that influence test performance.
What is "error" in psychological testing?
The portion of a test score attributed to influences other than the trait or ability being measured.
What is "error variance"?
The assessee taking the test while sick, like having the flu, which may affect performance.
Give an example of an assessee-related source of error variance.
Some tests are more accurate or valid than others in measuring the intended construct.
How can measuring instruments contribute to error variance?
To improve test reliability and ensure valid interpretation of scores.
Why is it important to consider sources of error in assessment?
Testing and Assessment can be conducted in a fair and unbiased manner
What does Assumption 6 state about testing and assessment?
By designing tests that are fair when used according to the test manual’s guidelines.
How do major test publishers ensure fairness?
Misuse of a test on individuals with different backgrounds than those the test was designed for.
What is a common source of fairness-related problems in testing?
To ensure the test is appropriate for the population being assessed
What is a key responsibility of the test user regarding fairness?
They play a vital role in supporting fair, informed, and socially beneficial decision-making.
What does Assumption 7 emphasize about testing and assessment?
Without testing, anyone could claim to be a surgeon or pilot without proof of competence.
Give an example of why testing is important in professional settings.
It prevents hiring based solely on nepotism and promotes selection based on merit and qualifications.
How does testing help ensure fair hiring practices?
It helps prevent arbitrary placement of students in programs and ensures decisions are evidence-based.
How does testing promote fairness in education?
Reliability, Validity, and Practical Utility.
What are the 3 main characteristics of a good test?
The consistency or precision of a test; it gives the same result when measuring the same thing under the same conditions.
What is reliability in a psychological test?
The extent to which a test measures what it claims to measure.
What is validity in a test?
Whether the items adequately sample the range of the construct being measured.
What aspects of test items affect validity?
What test scores really tell us about the trait and how scores relate to behavior.
What interpretive concerns relate to test validity?
Do test items cover the whole construct? What does a high score imply about real-world behavior?
What are examples of questions to assess test validity?
It can be easily administered, scored, and interpreted by trained professionals.
What makes a test practical for use?
A good test provides actionable, beneficial results for individuals or society.
Why is usefulness important in a good test?
Test performance data from a specific group used as a reference for interpreting individual scores.
What are norms in psychological testing?
The representative group whose test performance is analyzed to create norms for score interpretation.
What is a normative sample?
Its members should be typical of the population the test is intended for, based on relevant characteristics.
What makes a normative sample appropriate?
A distribution of scores that serve as the test’s norms for evaluating future testtakers.
What does the normative sample provide after test administration?
The process of deriving norms from a representative sample’s test performance.
What is norming?
Evaluating a test score by comparing it to the scores of a defined group of testtakers.
What is norm-referenced testing?
To determine a testtaker’s standing or ranking relative to others in the comparison group.
What is the main goal of norm-referenced testing?
The process of administering a test to a representative sample to establish norms.
What is standardization in testing?
The full group of individuals who share at least one observable characteristic for whom the test is designed.
What is the target population in test development?
A portion of the population that is selected to represent the whole.
What is a sample?
Selecting a group from the population that is representative of the intended testtakers.
What is sampling in test development?
Sampling method that ensures subgroups within a population are proportionately represented to avoid bias.
What is stratified sampling?
Arbitrary selection of participants believed to represent the population.
What is purposive sampling?
A sample made up of people who are easiest to access or most available.
What is an incidental or convenience sample?
By administering the test under consistent instructions and conditions, then analyzing the data using descriptive statistics.
How are norms developed for a standardized test?
To ensure comparability between the scores of the normative sample and future testtakers.
Why is it important to follow standard test instructions during norming?
Yes. New norms can be created using new samples, especially to include underrepresented groups, but the test remains standardized based on the original sample.
Can new norms be developed after standardization?
percentile, developmental, national, national anchor, subgroup, local
types of norms
age, grade
types of developmental norms
It indicates the percentage of testtakers who scored below a specific raw score (e.g., 90th percentile = scored higher than or equal to 90% of others).
What is a percentile in testing?
The percentage of testtakers who scored lower than a given raw score.
Define percentile rank.
The testtaker scored higher than or equal to 90% of others; only 10% scored higher.
What does a 90th percentile mean?
Norms based on traits or abilities that change with age, grade, or life stage.
What are developmental norms?
Average scores based on age (e.g., height or reading ability by age group).
What are age norms?
Average performance of students by school grade, based on large, representative samples.
What are grade norms?
Not useful for out-of-school children or adults returning to school.
Limitation of grade norms?
Norms based on a sample representative of the national population at the time of norming.
What are national norms?
Age, gender, ethnicity, socioeconomic status, geography, and community type.
What do national norm samples consider?
Norms used to equate scores between different tests using the same sample and percentile ranks.
What are national anchor norms?
The equipercentile method, comparing corresponding percentile scores across tests.
What method is used to establish national anchor norms?
Equating a score of 69 on the BRT to a score of 14 on the XYZ test if both fall in the 96th percentile.
Example of national anchor norms usage?
Norms for specific segments of the population (e.g., based on age, gender, or location).
What are subgroup norms?
Norms based on performance data from a local population or specific institution.
What are local norms?
They provide more relevant comparisons for individuals within a specific local context (e.g., a school or district).
Why are local norms useful?
A system where future scores are interpreted based on a distribution from a previously tested group.
What is a Fixed Reference Group Scoring System?
To ensure consistent score interpretation across future test administrations.
Why use a fixed reference group?
Interpreting an individual's score in relation to the scores of others who took the same test.
What is norm-referenced testing?
How a testtaker performs compared to others.
Main focus of norm-referenced evaluation?
A score in the 80th percentile means better performance than 80% of the group.
Example of a norm-referenced outcome?
Scoring is interpreted by comparing to a fixed standard or cut-off, not other testtakers.
What is criterion-referenced testing?
Passing a licensure exam by achieving the state's required score.
Example of criterion-referenced use?
What the testtaker can or cannot do based on preset criteria.
Focus of criterion-referenced interpretation?
No, only the individual’s performance against the defined standard.
Does criterion-referenced testing consider other testtakers’ scores?
Yes, it’s ideal for ranking individuals in a group.
Is norm-referenced testing good for ranking?
Yes, it determines if a testtaker meets specific performance standards.
Is criterion-referenced testing good for qualification decisions?
Norm-referenced testing.
Which system uses percentiles?
Criterion-referenced testing.
Which system uses cut scores or benchmarks?
Criterion-referenced — measures skill or knowledge level.
What type of interpretation is best for certification exams?
Norm-referenced — shows standing relative to peers.
What type of interpretation is best for standardized academic comparisons?