Stanford - Binet (SB5)
what does this test measure?
intelligence
theoretical background?
first version started in 1900 by Goddard, revised and popularized by professor Terman
Termand introduced the IQ (1916)
target group?
children and adolescents with intellectual disabilities or “gifted” children
strengths and weaknesses?
+ uses routing procedure to assess general cognitive abilities before proceeding (saves time without losing precision)
+ high reliability
+ supported validity
+ new version is good for high and low scorers, the extreme
subscales?
each subcomponent is verbal and nonverbal
fluid reasoning, quantitative reasoning, knowledge, visual spatial processing, working memory
10 subtests, 10 subtest scores
verbal, non verbal, and full IQ scores attained
Army Alpha
what does this test measure?
intelligence
theoretical background?
devised by Otis and Yerkes as group intelligence tests for army soldiers in WWI
influential for subsequent intelligence testing
target group?
army recruits
English speaking
varying IQ levels
strengths and weaknesses?
+ first group IQ test
+ lots of practice material for future tests
+ influenced many tests
- low validity
- unclear instructions
- bad testing environment
subscales?
following oral directions
arithmetical reasoning
practical judgement
synonym antonym pairs
disarranged sentences
number series completion
analogies
information
Army Beta
what does this test measure?
intelligence
theoretical background?
devised by Otis and Yerkes as group intelligence tests for army soldiers in WWI
influential for subsequent intelligence testing
target group?
army recruits
non-English speaking
illiterate
strengths and weaknesses?
+ not much verbal explanation needed
+ influenced many tests afterwards
+ higher test taking accessibility
- unclear instructions
- low validity
- bad testing environment
subscales?
visual/perceptual
motor tests
WAIS Wechsler Intelligence Scale
what does this test measure?
intelligence of adults
measures mental age (attained IQ/IQ expected for age = mental age IQ) M=100, SD=15
theoretical background?
invented by Wechsler, inspired by Binet and the Army Alpha
used to be a combination of already available tests
comparison sample of this version is normal, healthy adults
target group?
adults
strengths and weaknesses?
+ exceptional reliability and good validity
+ reduced emphasis on speed, higher emphasis on accuracy
+ IQ score is age dependent
+ added performance items to balance verbal items
- low reliability for subscales
subscales?
verbal comprehension
information
vocabulary
similarities
perceptual reasoning
matrix reasoning
block design
visual puzzles
working memory
digit span
arithmetic
processing speed
symbol search
coding
picture completion
WISC Wechsler Intelligence Scales
what does this test measure?
intelligence of adults
measures mental age (attained IQ/IQ expected for age = mental age IQ) M=100, SD=15
theoretical background?
invented by Wechsler, inspired by Binet and the Army Alpha
used to be a combination of already available tests
comparison sample of this version is to 5.7% of children with problems like learning difficulties, head injuries, or autism
target group?
children
strengths and weaknesses?
+ strong reliability
+ strong validity
subscales:
Verbal comprehension
vocabulary
similarities
perceptual reasoning
matrix reasoning
block design
working memory
processing speed
symbol search
coding
extra = picture concepts, letter-number sequencing
13-15 subtests, M = 10, `SD = 3
Bayley - III Test
what does this test measure?
evaluation of developmental delay
intelligence test
theoretical background?
target group?
children from 1 → 42 mos. (3.5 years)
strengths and weaknesses?
- does not produce standard scores like and IQ
+ high technical quality
+ excellent standardization
+ amazing psychometric properties
+ high internal consistency and supported validity
subscales?
5:
cognitive scale
language scale (expressive + receptive)
motor scales (socio-emotional scales)
adaptive behaviors scales
WPPSI Test Wechsler Preschool and Primary Scale of Intelligence
what does this test measure?
intelligence level in children
theoretical background?
target group?
children aged 2 yrs. 6 mos. → 7 yrs. 7 mos.
strengths and weaknesses?
+ good quality, based on other Wechsler tests
+ child-friendly stimuli
subscales?
standard is 10 subtests, but 6 (below) are needed for a full IQ
5 primary index scales
verbal comprehension (information, similarities)
visual spatial (block design)
fluid reasoning (matrix reasoning)
working memory (picture memory?)
processing speed (bug search)
Stanford-Binet for early Childhood
what does this test measure?
intelligence scale for children
theoretical background?
target group?
children aged 2 → 7 years (age range with very high distractibility)
strengths and weaknesses?
+ uses routing procedure to assess general cognitive abilities before proceeding (saves time without losing precision)
+ high reliability
+ supported validity
+ test observation checklist informs about relevant behaviors during examination (distractibility)
subscales?
characteristics: traits found in various situations
specific behaviors: actually observed during testing
parent report
Leiter Test
what does this test measure?
nonverbal intelligence
theoretical background?
used as an instrument for hearing or speech impaired people
target group?
2-20 yrs. 11 mos.
non-english speaking, brain injury, autism, speech impairment, hearing problems, ADHD, giftedness, impoverished, lack of education, etc.
strengths and weaknesses?
+ not a single word is needed to conduct this test
+ high validity and reliability
+ no evidence of bias
+ obtains a composite IQ score (M = 100, SD = 15)
subscales?
match small cards according to illustrations on a display, is not timed
20 subtests that are sorted into 2 batteries
visualization and reasoning
matching, figure ground test, paper folding, figure rotation
memory and attention
memory span, spatial memory, associative memory, delayed recognition memory
The Thematic Apperception Test (TAT)
what does this test measure?
projective test
designed to access constructs of needs, press (influence of your environment) and personality through interpretations of stories told about ambiguous pictures
theoretical background?
developed by Henry Murray
based on idea that needs organize perception, thought and action
Alpha press = real forces
Beta press = subjective forces
target group?
everyone
strengths and weaknesses?
+ can be helpful for understanding certain conditions
- no standardized procedures (more method than test)
- low reliability
- 97% of users apply subjective criteria
- likely to over-diagnose
subscales?
30 picture with one or more people engaged in ambiguous activities
20 cards per exam
participant has to make up a dramatic story for each picture (what happened before, during, and after, what were the people feeling in that moment)
critical that the participant identifies the protagonist of the picture
The Draw-A-Person Test
what does this test measure?
projective test that measures personality
theoretical background?
designed by Machover, said that examinees were likely to project acceptable impulses onto the same-sex figure and unacceptable impulses onto the opposite sex figure
thought that size of parts of the figure could determine sexual orientation
generally bad psychometric properties
good for assessment of children with behavior disorders
target group?
everyone, but usually used on children
strengths and weaknesses?
- not valid
- not empirically supported
- not predictive of intelligence
- not good for inferring nuances of personality
+ good for assessing children with behavior disorders or emotional disturbances
subscales?
draw one figure with a pencil and eraser
draw a person of the opposite sex
make up a story line about a novel character
The House-Tree-Person Test
what does this test measure?
projective test used to measure personality
freehand drawings of a house, a tree, and a person
theoretical background?
was first used to assess intelligence levels, now only used as a projective measure of personality
house = home life
tree = perception of the environment
person = interpersonal relationships
target group?
everyone, but usually used on children
strengths and weaknesses?
- many people find the drawings not worth the effort, do not actually put in the work
- no support for validity or reliability
- validation research not possible
+ can help derive hypothesis for further investigation through other tests
subscales?
free drawings of a house, a tree, and a person first with crayons and then with pencils
then asked 60 questions about aspects of their drawings
Kinetic Family Drawing
what does this test measure?
projective test to assess family dynamics
theoretical background?
used to assess intelligence levels at first
used as a projective measure of personality
drawing of a whole family doing something
target group?
children, specifically ones who have survived child abuse
strengths and weaknesses?
- low validity and low reliability
subscales?
drawing of a whole family doing something, examinee is then asked about the drawing and the people in the drawing
State-Trait Anxiety Inventory (STAI)
what does this test measure?
self-report measure of anxiety
theoretical background?
differentiates between temporary state and longer lasting trait anxiety
state anxiety = transitory emotional state that has tension and activation of the ANS
trait anxiety = stable anxiety
target group?
anyone, usually adolescents → adults
used in research and clinical settings
strengths and weaknesses
+ many fields of application
+ high internal consistency
+ well established validity
0.5 reliability for state anxiety
0.8 reliability for trait anxiety
subscales?
state scale:
20 items about how respondent feels at that moment
trait scale:
20 items about how respondent feels generally
4 point scale, between 20 and 80, the higher the score the more anxious you are
Eysenck Personality Questionnaire
what does this test measure?
major dimensions of normal and abnormal personality
theoretical background?
derived through factor analysis, found out three major personality dimensions:
psychoticism (high P = aggressive, hostile, antisocial)
extraversion (high E = loud, gregarious, outgoing)
neuroticism (high N = nervous, maladjusted, emotional)
target group?
test for everyone 16 and older
test for everyone 7 → 15 years
strengths and weaknesses?
+ good reliability and construct validity
+ strong psychometric properties
subscales?
three scales:
psychoticism (P)
extraversion (E)
neuroticism (N)
to check validity of answers there is a lie (L) scale
90 yes/no questions for adults
ages 7-15 has 80 questions
Behavioral Avoidance Test (BAT)
what does this test measure?
measures how long a person can tolerate anxiety inducing stimuli
theoretical background?
based on the reasoning that it is mainly the fear of a person which determines their subsequent behaviors
exposure based test
target group?
people undergoing exposure-based treatment
used a lot for phobias
strengths and weaknesses?
- clients responses are often influenced by demand characteristics (situation specific) and therefore are not generalizable
+ strongly correlated with self-reports of catastrophic thoughts
subscales?
test how far a client is willing to go in a fear-related situation
if they are afraid of spiders, points are given for each step they take towards touching a spider (for example, 3 points for walking towards it, 6 points for opening the container it is in, etc.)
Beck Depression Inventory
what does this test measure?
measure of depression
theoretical background?
focuses on the cognitive distortions that underlie depression
target group?
used in cognitive behavioral therapy
used for variety of conditions: MD, dysthymia, alcoholism, addiction, mixed problems
used in primary care
strengths and weaknesses?
+ great internal consistency
+ no racial bias
+ high correlations with other measures of depression
- differential item functioning (old people have less cognitive symptoms, etc.)
- transparency, can easily hide symptoms
- low test-retest reliability, but this is due to fluctuations in depression levels not measurement error
subscales?
21 items, all scored from 0→3 (highest score is 63)
13 items for cognitive and affective components
8 items for somatic and performance variables
total scores norms:
0-9 = normal
10-19 = mild to moderate depression
20-29 = moderate to severe depression
30-63 = severe depression
Rapid Couples Interaction Scoring System (RCISS)
what does this test measure?
analogue behavioral assessment for couples’ interaction
can also be used for parent-child interactions
theoretical background?
target group?
couples
parents and children
strengths and weaknesses?
- does not detect subtle behavior, only obviously observable behavior
- often low interrater agreement on many dimensions
- very little test-retest reliability data
+ often good interrater agreement on general positive or negative affect
subscales?
22 codes that address speaker-listener behaviors, verbal and nonverbal communication, humor, smiling, disagreement, and compromise
NEO PI-R
neo personality inventory revised
what does this test measure?
personality test
theoretical background?
5-factor model of personality (BIG-5)
target group?
can be used for adolescents, but mostly just used for adults
used to make portraits of great leaders, letting biographers take the test for dead leaders (?)
used to detect psychopathology (as a supplementary instrument)
strengths and weaknesses?
+ high internal consistency
+ lots of evidence for validity
+ useful for research purposes
+ added improvements for adolescents and young adults
- only 3 items assess the validity of respondents answers (assumes honesty)
- lack of validty scales
subscales?
self report form (S-form) and observer form (R-form)
2 parallel forms with 240 items,
5 point dimensions (strongly disagree → strongly agree)
(O.C.E.A.N)
openness, conscientiousness, extraversion, agreeableness, neuroticism
Child Behavior Checklist (CBCL) and Teacher Report Form (TRF)
(both belong to the Achenbach System of Empirically Based Assessment, ASEBA)
what does this test measure?
problem behaviors in children with either internalizing or externalizing problems
theoretical background?
a screening tool for problem behaviors, not a diagnostic tool
gives 3 composite scores:
internalizing problems
externalizing problems
overall problems
target group?
two forms:
children 2→ 3 years
children 4→ 18 years
the TRF is used for children 5→ 18 years
strengths and weaknesses?
+ reliable and valid
+ 58 languages
+ intuitive and quick detection of when children fall into the clinical range
+ omnibus test that measures wide range of abilities/behavioral disorders
subscales?
2 sections
activities (competencies, social functioning, schooling)
problem behaviors w/ clinical subscales
withdrawn (internalizing scale)
somatic complaints (internalizing scale)
anxious/depressed (internalizing scale)
social problems, thought problems, attention problems
delinquent problems (externalizing scale)
aggressive behavior (externalizing scale)
Behavioral Assessment For Children
(TRS/PRS) / BASC 2 (TRS/PRS)
what does this test measure?
behavioral assessment of children
theoretical background?
derived from exploratory and confirmatory analysis
target group?
children from 2 → 21 yrs.
strengths and weaknesses?
+ contains validity scale to detect false responses
+ contain a fake bad scale (when teachers/parents tend to rate the children high on most dimensions)
+ assessment of both negative and adaptive behaviors
+ 3 norm referenced comparisons scales
subscales?
self-report scale, parent rating scale (assesses parent-child relationship and home environment behavior), teacher rating scale (assesses school related behavior), classroom observation system, structured developmental history scale
responses: never, sometimes, often, almost always
5 composite scales, 16 primary scales, 7 content scales
Behavioral Symptoms Index (overall level of behavioral problems, gives non-specific index or pathology)
aggression, attention problems, anxiety, atypicality, depression, and somatization
4 lower-order composite scores
internalizing problems (anxiety + depression + somatization)
externalizing problems (aggression + conduct problems + hyperactivity)
school problems (attention problems + learning problems)
adaptive skills (activities of daily living + adaptability + leadership + social skills + study skills)
Childhood Autism Rating Scale (CARS)
what does this test measure?
autism symptoms
theoretical background?
single domain rating scale, should be used following an omnibus scale (following an initial screening)
target group?
Children checked for autism above 2 yrs. old
strengths and weaknesses?
+ can be used in various settings (clinics, schools)
+ can be given by teachers
+ good reliability and good validity
subscales?
total score is used to rate on a continuum from non to mild to severe autism
BASC monitor for ADHD
what does this test measure?
measures abilities of ADHD children
theoretical background?
assesses primary symptoms of the disorder
designed to facilitate treatment for those with ADHD
target group?
children aged 4→ 18 yrs. with ADHD
strengths and weaknesses?
+ good reliability and validity
+ scales for teachers and parents
subscales?
rating scales for teachers and parents
4 scales:
attention problems
hyperactivity
internalizing problems
adaptive skills
Connors Rating Scale for ADHD
what does this test measure?
ADHD
theoretical background?
designed to be one piece in the diagnostic process, not a clinical tool itself
target group?
strengths and weaknesses?
subscales?
Pediatric Behavior Rating Scale (PBRS)
what does this test measure?
early onset bipolar disorder
theoretical background?
designed to distinguish children with early onset bipolar disorder from other children
target group?
children 3→ 18 yrs.
strengths and weaknesses?
+ adequate reliability and validity
+ separate scales for teachers and parents
subscales?
rating scales for teachers and parents
9 scales:
atipicality
irritability
grandiosity
hyper-activity/impulsivity
agression
inattention
affect
interaction
total bipolar score
adaptive behavior rating scales
Vineland Adaptive Behavior Scales
what does this test measure?
adaptive skills of people who have developmental/intellectual disabilities
theoretical background?
target group?
individuals with developmental and intellectual disabilities
strengths and weaknesses?
subscales?
survey/interview form (interview with the caregiver and parent with purpose of clinician gathering important info)
parent/caregiver rating form (covers same behaviors but more objectively)
teacher rating form (same behaviors but focuses on those classroom related)
Clinical Assessment Scale for the Elderly (CASE)
what does this test measure?
measures behavior and personality
theoretical background?
adult behavior rating scale
omnibus behavior rating scale, designed to be completed by a caregiver
target group?
caregivers of persons aged 55-90 years
strengths and weaknesses?
+ takes <30 minutes to complete
subscales?
13 scales:
anxiety
cognitive competence
depression
fear of aging
mania
obsessive-compulsive
paranoia
psychoticism
somatization
substance abuse
infrequency
lie
validity
Continuous Performance Test (CPT)
what does this test measure?
vigilance, sustained and selective attention, executive control and inhibition
theoretical background?
computer-based type of behavioral test
usually simple stimuli to exclude the effects of short-term memory
abnormalities in attention can hint at different conditions like bipolar disorder and borderline personality disorder
target group?
strengths and weaknesses?
+ highly sensitive for detecting disorders of self-regulation, attention, concentration, or inhibition
+ still offers valuable insights into executive control functions
- generally does not correlate well with behavioral observations
subscales?
requires intense levels of concentration for 15-20 minutes
psychophysiological assessment examples
what does this test measure?
physical changes during some event
theoretical background?
target group?
strengths and weaknesses?
+ highly sensitive
- requires careful calibration in addition to standardized protocols
subscales?
lie detector: heart rate, respiratory rate, galvanic skin response
EEG: brain activity
Electromyographs: muscle activity
penile plethysmograph: blood rushing to penis (arousal)
Peabody Picture Vocabulary Tests
what does this test measure?
hearing vocabulary
not a good measure of intelligence
theoretical background?
target group?
people who are deaf, have neurological speech and hearing impairments, impaired motor abilities (ex. after a stroke)
2.5 yrs → 90 + yrs.
strengths and weaknesses?
+ high validity and reliability
+ good measure of vocabulary
- validity for intelligence is low
- biased against african americans from low SES and indigenous americans
- may underestimate IQ
- not a valid measure for intelligence
subscales?
228 testing plates, each with 4 pictures
four pictures are presented, child points at the one that matches the verbal description the examiner gives (“point to sleeping”)
gives standard scores (M=100, SD=15)
screening tests
test that are supposed to identify people with a certain condition or disease before this condition becomes apparent
(ex. screening for school readiness before a child goes to school)
adaptive tests
computer adaptive testing (CAT)
method for administering tests that dynamically adapt to the examinee’s performance levels, varying the difficulty depending on the pervious answers
also called tailored testing
criterion tests
test that uses test scores to generate a statement about the behavior that can be expected by a person.
score is compared to a fixed standard, used in training and rehabilitation
example: most achievement tests in school
sentence completion tests
what does this test measure?
measures underlying attitudes, motivations, and needs
theoretical background?
respondent is presented with a series of items consisting of the first few words of a sentence, task is to provide an ending
target group?
strengths and weaknesses?
subscales?
themes of the sentences are parents, upbringing, wishes, ideals, relationships
BASC 2 - SOS (student observation system)
what does this test measure?
standardized observational recording system
theoretical background?
used in any setting that has educational goals, uses direct observation in a natural environment (a classroom, etc.)
target group?
students
strengths and weaknesses?
+ unique looks at immediate antecedents and consequences
+ useful in diagnosis, treatment planning, monitoring changes and treatment effectiveness of high interrater agreement
subscales?
14 subscales of dimensions of behavior (both positive and negative)
adaptive behavior tests
tests that assess adaptive behaviors, focus on conceptual skills, social skills, and practical skills
norm-references tests
tests where the score is compared to a reference group, used to assist in diagnosis and classification