1/152
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
Placement
Which of the following involves situations where people are assigned to different tracks, ordered in some way?
20,000
How many tests did the American Psychological Association (APA) estimate are developed every year?
Assessment
is defined as a systematic procedure for collecting information that can be used to make inferences about the characteristics of people or objects.
Speed tests
typically contain test items that are all about the same level of
difficulty
can focus on aptitude or achievement
A power test
Ratio
With which scale(s) of measurement is it possible to correctly state that a score of 80 reflects twice as much as a score of 40?
few scores at the low end and many scores grouped at the high end.
A negatively skewed distribution is one with
too difficult
If the distribution of scores on a classroom test has a strong positive skew, then for this group of students, test was most likely:
84%
Approximately what percentage of scores falls below 1 standard deviation above the mean?
0.16
Assume a correlation between two tests is 0.40 (r = 0.40). What would the coefficient of determination equal?
be decreased
If the range of one or both variables is restricted, the resulting correlation coefficient will likely
less than 0.40
If the correlation between math and writing achievement scores in the general population is 0.40, the correlation between these scores among students at MIT, Yale, and Stanford (selective and prestigious universities) would likely be:
49%
The correlation between two variables is 0.70. Using the concept of the coefficient of determination, the proportion of variance that is determined or predictable from the relationship between the two measures is:
51%
The correlation between two variables is 0.70. Using the concept of the coefficient of nondetermination, the proportion of variance that is NOT determined or predictable from the relationship between the two variables is:
Linear regression
A special mathematical procedure for predicting scores on one variable (criterion or Y) given a score on another (predictor or X) is
It uses squared raw score units.
Why would a psychologist feel that the variance might be difficult to interpret?
14
What is the range of the data 8, 10, 12, 14, 16, 18, 20, 22
0; 1
Z-scores have a mean of ___ and a standard deviation of ____.
1
What is the variance of z-scores?
He scored better than 98% of the other students.
Jimmy received a z-score of +2.0 on a math test. What do we know about Jimmy's performance, assuming that the math test scores are distributed normally?
50; 10
T-scores have a mean ____ and a standard deviation of _____.
She scored better than 2% of the other students.
Julie receives a t-score of 30 on a reading skills assessment. What can be determined from this information, assuming that the reading skills assessment scores are normally distributed?
Grade equivalents
According to the text authors, ___________ are norm-referenced derived scores that need to be interpreted with extreme caution and are only ordinal in nature.
how clearly the knowledge or skill domain being assessed is specified.
For criterion-referenced interpretations, the most important consideration is
The person with a rank of 55 had a higher score.
What can you conclude about two people with percentile rank scores of 55 and 30?
500; 100
CEEB (SAT/GRE) scores have a mean of _____ and a standard deviation of ______.
60
A z-score of 1 is equal to a T-score of
no
There is (a) _______ relationship between an individual's level on a construct and the amount of measurement error impacting their observed score.
observed score; stable test-taker characteristics
In Classical Test Theory, the X represents _______ and the T represents ________.
Content sampling
________ is/are usually considered the largest source of error in test scores.
temporal instability
Test-retest reliability is primarily sensitive to measurement error due to:
Split-half coefficient
Which reliability estimate would be preferred for a score derived from a test with heterogeneous content?
test-retest reliability
Which method of rating reliability would be appropriate for scores from a speed test?
smaller
Reliability coefficients based on a homogeneous sample would likely be ________ coefficients based on a heterogeneous sample.
73 to 77
Sally's obtained scored on a statistics exam is 75. The SEM is 2. With what confidence interval would we capture her true score 68% of the time?
practice effects
A limitation of the test-retest approach to estimating reliability is the influence of
0.90
If the reliability coefficient equals .81, the reliability index equals:
that validity is a unitary construct.
(validity) Contemporary psychometric standards emphasize
indeterminate — it depends on many factors.
According to the authors of the text, the minimum size for acceptable validity coefficients is
sensitivity
The __________ of a measure to a diagnostic condition is the ability of the test at a predetermined cut score to detect the presence of the disorder.
1.0
When deciding how many factors to retain in a factor analysis, a common approach, referred to as the Kaiser-Guttman criteria, is to examine eigenvalues and retain those
greater than ______.
can be generalized more than previously thought.
Research has shown that validity coefficients
linear regression
Which statistical procedure allows one to predict performance on one test from performance on another?
Principal factor analysis
Which factor analytic method analyzes only shared variance while excluding unique and error variance?
internal structure
Factor analysis is commonly used to collect validity evidence based on
As the validity coefficient increases, the standard error of estimate decreases.
What is the relationship between the standard error of estimate and the validity coefficient?
validity evidence based on relations to other variables
Test-criterion validity evidence is an example of:
criterion contamination
In order to avoid __________ when collecting test-criterion evidence, it is important that predictor and criterion scores be obtained independently.
the avoidance of inadvertent cues to the answers
General guidelines for writing test items include:
true-false
What item format is commonly used in both maximum performance tests and typical response tests?
the incorrect response on a multiple-choice items.
Distracters are
0.30
Your employer wants a test that will help him to select the upper 30% of employees to consider for new positions. It would be beneficial for the item difficulty index to average around
speed
On ___________ tests, measures of item difficulty largely reflects the position of the item in the test.
0.50
What is the optimal Item Difficulty Index on a test consisting of only constructed response items?
0.75
What is the approximate optimal Item Difficulty Index for a multiple-choice item with 4 choices?
0.20
A general recommendation is to use items with p values that have a range of approximately _________ around the optimal value.
0.60
On a reading comprehension exam, the proportion of examinees in the top group that answered item 5 correctly equaled 0.70 and the proportion of examinees in the bottom group that answered item 5 correctly equaled 0.10. What is the discrimination index for item 5?
relatively difficult and does not discriminate well.
A test item with a p = .30 and D = .15 is
at least some examinees and demonstrate negative discrimination
An effective distracter should be selected by
three-parameter
The ___________ IRT model takes into consideration the possibility of an examinee with no 'ability' answering the item correctly by chance.
Test Information Function
__________ illustrates the reliability of measurement at different points along the distribution.
All 3
Which of the following was described as a qualitative approach to item analysis? (Set the test aside and review at a later time; Have a colleague review the test; Allow examinees to provide feedback)
Items with a low p value are more difficult
Which of the following statements regarding p values is correct? (They can range from -1.0 to 1.0; Items with a high p value are more difficult; Items with a low p value are more difficult; Items with a p value of 1.0 were answered incorrectly by all students)
0.00; 1.0
Items with a value of _____ and ______ do not contribute to the variability of test scores.
0.85
he optimal p value for a true/false items is approximately:
Instruction in generic test-taking skills
What test preparation practices is recommended by the authors?
Preparation using multiple instructional techniques
What test preparation practices is most likely to produce increases in test scores associated with increases in mastery of the underlying domain of skills and knowledge?
Numerical Operations
The Wechsler Individual Achievement Test (WIAT-II) contains which of the following subtests? (Numerical Operations; Letter-Word Recognition; Math Fluency; Story Recall)
Woodcock-Johnson III Tests of Achievement
Which comprehensive individual achievement test battery is available in two parallel forms?
1/3
According to Stiggins & Conklin, approximately how much professional time do teachers devote to assessment related activities
Wide Range Achievement Test 4
What test is acceptable for screening purposes but not for in-depth diagnostic purposes?
Objectives should specify a narrow spectrum of knowledge and abilities.
When developing educational objectives, one should follow what general guideline?
formative evaluation
Dr. Mark addresses Susie and gives her specific feedback regarding the term paper she submitted and then allows her to resubmit her paper for grading. This initial evaluation is best described as:
Norm-referenced approach
Ms. Wilson assigns the top 10% of her students an A, the next 20% a B, and so on. This is an example of which score interpretation?
"Make informed guesses by process of elimination."
An important test-taking skill to teach students is:
3%
The No Child Left Behind Act of 2001 allows states to administer alternative assessments to _______ of their students.
Section 504 of the Rehabilitation Act of 1973
What law mandates that any institution that receives federal funds must ensure that individuals with disabilities have equal access to all programs and services provided by the institution?
1920s
In her research on the history of grading in the United States, Brookhart (2004) found that letter grades became common practice during the
aptitude test
A test designed to measure the skills, abilities, and knowledge accumulated over a lifetime would be properly called a(n)
Wechsler Intelligence Scale for Children, 4th edition
The ______________ is an example of a popular aptitude test.
less than 55
The more severe categories of moderate, severe, and profound mental retardation apply when the IQ is:
Iowa Tests of Basic Skills
The Cognitive Abilities Test, Form 6 is designed to be administered with which
achievement test?
Processing speed
The WISC-IV subtest Coding is classified by the text author as primarily a measure of:
The core subtest is seen as being inappropriate for the examinee
In what situation may a supplemental subtest be substituted for a core subtest on the WISC-IV?
Working Memory Index
Which of the following WISC-IV Indexes is a composite of the Digit Span and Letter-Number Sequencing subtests and measures abilities such as attention, concentration, and mental control?
10-12 points
According to adoption studies, what is the maximum gain in IQ points due to environmental interventions?
9
CHC theory incorporates _____ broad ability domains.
Two-Factor Theory, which hypothesizes the existence of crystallized and fluid
intelligence
Regarding research on intelligence, Cattell is best known for:
stable; good
According to the research, IQs are __________ over time and are __________ predictors of school performance.
personality
The relatively consistent patterns of thought, feeling, and behaviors that characterize each person as a unique individual define:
V index
What validity scale would you expect to be elevated if the examinee was overly careless while answering the items on the Behavior Assessment for Children - Self-Report of Personality?
14-18
The Minnesota Multiphasic Personality Inventory - Adolescents is designed for use with which age group?
MCMI-III
The ____________ contains personality and clinical syndrome scales that parallel the DSM-IV-TR diagnostic categories.
positive behaviors
A high score on the Personal Adjustment composite scale of the BASC-SRP indicates:
unconscious
The central hypothesis of projective techniques is that clients will interpret ambiguous material in a way that reveals important and often ___________ aspects of their personality.
to reject stereotypical gender roles.
A high score on the Masculinity-Femininity scale on the MMPI-2 clinical scale would indicate a tendency:
Forer effect
In the early days of personality assessment, one common validation technique was to have people take the test and then have a clinician write a description of the examinee's personality structure based on the test results. The description would then be provided to the examinee who would rate how much they agreed with the description. What is the major threat to this technique?
MCMI-III
The _________ is an example of a theoretically based contemporary objective personality test.
Conduct Problems
Which subscale of the BASC-2 Teacher Rating Scale would be indicative of the display of antisocial behavior?
Content scales
What is the new feature of the BASC-2?