1/99
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai | Chat |
|---|
No analytics yet
Send a link to your students to track their progress
Appraisal can be defined as
the process of assessing or estimating attributes.
testing which is always performed in a group setting.
testing which is always performed on a single individual.
a pencil and paper measurement of assessing attributes.
the process of assessing or estimating attributes.
A test can be defined as a systematic method of measuring a sample of behavior. Test format refers to the manner in which test items are presented. The format of an essay test is considered a(n) ________
format.
subjective
objective
very precise
concise
subjective
The National Counselor Exam (NCE) is a(n) ________
test because the scoring procedure is specific.
subjective
objective
projective
subtest
objective
A short answer test is a(n) ________ test.
objective
culture-free
forced choice
free choice
free choice
The NCE and the CPCE would be examples of a(n) ________ test.
free choice
forced choice
projective
intelligence
forced choice
The _______index indicates the percentage of individuals who answered each item correctly.
difficulty
critical
intelligence
personal
difficulty
Short answer tests and projective measures utilize free response items. The NCE and the CPCE uses forced choice or so-called ________
vague
subjective
recognition
numerical
recognition
A true/false test has ____ recognition items,
similar
free choice
dichotomous
no
dichotomous
A test format could be normative or ipsative. In the normative format
each item depends on the item before it.
each item depends on the item after it.
the client must possess an IQ within the normal range.
each item is independent of all other items.
each item is independent of all other items.
A client who takes a normative test
cannot legitimately be compared to others who have taken the test.
can legitimately be compared to others who have taken the test.
could not have taken an IQ test.
could not have taken a personality test.
can legitimately be compared to others who have taken the test.
In an ipsative measure the person taking the test must compare items to one another. The result is that
an ipsative measure cannot be utilized for career guidance.
you cannot legitimately compare two or more people who have taken an ipsative test.
an ipsative measure is never a forced choice format.
an ipsative measure is never reliable.
you cannot legitimately compare two or more people who have taken an ipsative test.
Tests are often classified as speed tests versus power tests. A timed typing test used to hire secretaries would be
a power test.
neither a speed test nor a power test.
a speed test.
a fine example of an ipsative measure.
a speed test.
A counseling test consists of 300 forced response items. The person taking the test can take as long as he or she wants to answer the questions.
This is most likely a projective measure.
This is most likely a speed test.
This is most likely a power test.
This is most likely an invalid measure.
This is most likely a power test.
An achievement test measures maximum performance or present level of skill. Tests of this nature are also called attainment tests, while a personality test or interest inventory measures
typical performance.
minimum performance.
unconscious traits.
self-esteem by always relying on a Q-Sort design.
typical performance.
In a spiral test
the items get progressively easier.
the difficulty of the items remains constant.
the client must answer each question in a specified period of time.
the items get progressively more difficult.
the items get progressively more difficult.
In a cyclical test
the items get progressively easier.
the difficulty of the items remains constant.
you have several sections which are spiral in nature.
the client must answer each question in a specified period of time.
you have several sections which are spiral in nature.
A test battery is considered
a horizontal test.
a vertical test.
a valid test.
a reliable test.
a horizontal test.
In a counseling research study, two groups of subjects took a test with the same name. However, when they talked with each other they discovered that the questions were different. The researcher assured both groups that they were given the same test. How is this possible?
The researcher is not telling the truth. The groups could not possibly have taken the same test.
The test was horizontal.
The test was not a power test.
The researcher gave parallel forms of the same test.
The researcher gave parallel forms of the same test.
The most critical factors in test selection are
the length of the test and the number of people who took the test in the norming process.
horizontal versus vertical.
validity and reliability.
spiral versus cyclical format.
validity and reliability.
Which is more important, validity or reliability?
Reliability.
They are equally important.
Validity.
It depends on the test in question.
Validity.
In the field of testing, validity refers to
whether the test really measures what it purports to measure.
whether the same test gives consistent measurement.
the degree of cultural bias in a test.
the fact that numerous tests measure the same traits.
whether the test really measures what it purports to measure.
A counselor peruses a testing catalog in search of a test which will repeatedly give consistent results. The counselor
is interested in reliability.
is interested in validity.
is looking for information which is not available.
is magnifying an unimportant issue.
is interested in reliability.
Which measure would yield the highest level of reliability?
A TAT, projective test popular with psychodynamic helpers.
The WAIS-IV, a popular IQ test.
The MMPI-2, a popular personality test.
A very accurate postage scale.
A very accurate postage scale.
Construct validity refers to the extent that a test measures an abstract trait or psychological notion. An example would be
height.
weight.
ego strength.
the ability to name all men who have served as U.S. presidents.
ego strength.
Face validity refers to the extent that a test
looks or appears to measure the intended attribute.
measures a theoretical construct.
appears to be constructed in an artistic fashion.
can be compared to job performance.
looks or appears to measure the intended attribute.
A job test which predicted future performance on a job very well would
have high criterion/predictive validity
have excellent face validity
have excellent construct validity
not have incremental validity or synthetic validity
have high criterion/predictive validity
A new IQ test which yielded results nearly identical to other standardized measures would be said to have
good concurrent validity.
good face validity.
superb internal consistency.
all of the above.
good concurrent validity.
When a counselor tells a client that the Graduate Record Examination (GRE) will predict her ability to handle graduate work, the counselor is referring to
good concurrent validity.
construct validity.
face validity.
predictive validity.
predictive validity.
A reliable test is ________valid.
always
90%
not always
80%
not always
A valid test is ________reliable.
not always
always
never
80%
always
One method of testing reliability is to give the same test to the same group of people two times and then correlate the scores. This is called
a. test-retest reliability
b. equivalent forms reliability
c. alternate forms reliability
d. the split-half methods
test-retest reliability
One method of testing reliability is to give the same population alternate forms of the identical test. Each form will have the same psychometric/statistical properties as the original instrument. This is known as
test-retest reliability
equivalent or alternate forms reliability
the split-half method
internal consistency
equivalent or alternate forms reliability
A counselor doing research decided to split a standardized test in half by using the even items as one test and the odd items as a second test and then correlating them. The counselor
used an invalid procedure to test reliability.
was testing reliability via the split-half correlation method.
was testing reliability via the equivalent forms method.
was testing reliability via the inter-rater method.
was testing reliability via the split-half correlation method.
Which method of reliability testing would be useful with any easy test but not with a test of algebra problems?
a. test-retest
b. alternate forms
c. split-half
d. inter-rater/inter-observer
d. inter-rater/inter-observer
A reliability coefficient of 1.00 indicates
a lot of variance in the test.
a score with a high level of error.
a perfect score which has no error.
a typical correlation on most psychological and counseling tests.
a perfect score which has no error.
An excellent psychological or counseling test would have a reliability efficient of
a. 50
b. .90
c. 1.00
d. -.90
b. .90
A researcher working with a personality test discovers that the test has a reliability coefficient of .70 which is somewhat typical. This indicates that
70% of the score is accurate while 30% is inaccurate.
30% of the people who are tested will receive accurate scores.
70% of the people who are tested will receive accurate scores.
30% of the score is accurate while 70% is inaccurate.
70% of the score is accurate while 30% is inaccurate.
A career counselor is using a test for job selection purposes. An acceptable reliability coefficient would be ______ or higher.
.20
.55
.80
.70
.80
The same test is given to the same group of people using the test-retest reliability method. The correlation between the first and second administration is .70. The true variance (i.e., the percentage of shared variance or the level of the same thing measured in both) is
a. 70%
b. 100%
c. 50%
d. 49%
d. 49%
IQ means
a query of intelligence.
indication of intelligence.
intelligence quotient.
intelligence questions for test construction.
intelligence quotient.
_____ did research and concluded that intelligence was normally distributed like height or weight and that it was primarily genetic.
Spearman
Guilford
Williamson
Galton
Galton
Francis Galton felt intelligence was
a. a unitary faculty
b. best explained via a two factor theory
c. best explained via the person’s environment
d. fluid and crystallized in nature
a unitary faculty
J. P. Guilford isolated 120 factors which added up to intelligence. He also is remembered for his
thoughts on convergent and divergent thinking.
work on cognitive therapy.
work on behavior therapy.
work to create the first standardized IQ test.
thoughts on convergent and divergent thinking.
A counselor is told by his supervisor to measure the internal consistency reliability (i.e., homogeneity) of a test but not to divide the test in halves. The counselor would need to utilize
a. the split-half method
b. the test-retest method
c. the Kuder-Richardson coefficients of equivalence
d. cross-validation
the Kuder-Richardson coefficients of equivalence
The first intelligence test was created by
David Wechsler.
J. P. Guilford.
Francis Galton.
Alfred Binet and Theodore Simon.
Alfred Binet and Theodore Simon.
Today, the Stanford-Binet IQ test is
a. a nonstandardized meausre
b. a standardized measure
c. a projective measure
d. b and c
b. a standardized measure.
IQ stands for intelligence quotient, which is expressed by
a. CA/MA x 100
b. CA/MA x 100
c. MA/CA x 50
d. MA/CA x 100
d. MA/CA x 100
The Binet stressed age-related tasks. Utilizing this method, a 9-year-old task would be one which
only a 10-year-old child could answer.
only an 8-year-old child could answer.
50% of the 9-year-olds could answer correctly.
75% of the 9-year-olds could answer correctly.
50% of the 9-year-olds could answer correctly.
Simon and Binet pioneered the first IQ test around 1905. The test was created to
assess high school seniors in America.
assess U.S. military recruits.
discriminate children without an intellectual disability from children with an intellectual disability.
measure genius in the college population.
discriminate children without an intellectual disability from children with an intellectual disability.
Today the Stanford-Binet is used from age 2 to adulthood. The IQ formula has been replaced by the
SAS
SUDS
entropy
KR-20 formula
SAS
Most experts would agree that the Wechsler IQ tests gained popularity, as the Binet
must be administered in a group
favored the geriatric population
didn’t seem to be the best test for adults
was biased toward women
didn’t seem to be the best test for adults
The best IQ test for a 22-year-old single male would be the
WPPSI-III.
WAIS-IV.
WISC-IV.
any computer-based IQ test.The best IQ test for a 22-year-old single male would be the
WAIS-IV.
The best intelligence test for a sixth-grade girl would be the
WPPSI-IV
WAIS-IV
WISC-IV
Merrill-Plamer
WISC-IV
The best intelligence test for a kindergartner would be the
WPPSI-IV
WAIS-IV
WISC-IV
MYERS-BRIGGS TYPE TYPE INDICATOR
WPPSI-IV
The mean on the Wechsler and the Stanford-Binet Intelligence scale (SB5) is ____ and the SD is ____-
100;100
100;15 Wechsler, 16 Stanford-Binet
100;20
100;1
100;15 Wechsler, 16 Stanford-Binet
Group IQ tests like the Otis-Lennon, the Lorge-Thorndlike, and the California test of mental Abilities are popular in school settings. The advantage is that
group tests are quicker to administer
group tests are superior in terms of predicting school performance
group tests always have a higher degree of reliability
individual IQ tests are not appropriate for school children
group tests are quicker to administer
The group IQ test movement began
in 1905
with the work of Binet
with the Army Alpha and Army beta in World War 1
with freudian psychoanalysis and the psychodynamic movement
with the Army Alpha and Army beta in World War 1
In a culture-fair test
items are known to the subjects regardless of his or her culture
the test is not standardized
culture free items cannot be utilized
African Americans generally score higher than whites
items are known to the subjects regardless of his or her culture
The black versus white IQ controversy was sparked mainly by a 1969 article written by ____.
John Ertl
Raymond B. Cattell
Arthur Jensen
Robert Williams
Arthur Jensen
The MMPI-2 is
an IQ test.
a neurological test.
a projective personality test.
a standardized personality test.
a standardized personality test.
The word psychometric means
a form of measurement used by a neurologist.
any form of mental testing.
a mental trait which cannot be measured.
the test relies on a summated or linear rating scale.
any form of mental testing.
In a projective test the client is shown
something which is highly reinforcing.
something which is highly charged from an emotional standpoint.
a and b.
neutral stimuli.
neutral stimuli.
The 16 PF reflects the work of
Raymond B. Cattell
Carl Jung
James McKeen Cattell
Oscar K. Buros
Raymond B. Cattell
The Myers-Briggs type indicator reflects the work of
Raymond B. Cattell
Carl Jung
William Glasser
Oscar K buros.
Carl Jung
The counselor who favors projective measures would most likely be a
Rogerian.
strict behaviorist.
TA therapist.
psychodynamic clinician.
psychodynamic clinician.
An aptitude test is to ________ as an achievement test is to ________
what has been learned; potential
potential; what has been learned
profit from learning; potential
a measurement of current skills; potential
potential; what has been learned
Both the Rorschach and the Thematic Apperception Test (TAT) are projective tests. The Rorschach uses 10 inkblot cards while the TAT uses
a dozen inkblot cards..
verbal and performance IQ scales.
pictures
incomplete sentences.
pictures
Test bias primarily results from
a test being normed solely on white middle-class clients.
the use of projective measures.
using whites to score the test.
using IQ rather than personality tests.
a test being normed solely on white middle-class clients.
A counselor who fears the client has an organic, neurological, or motoric difficulty would most likely use the
Bender Gestalt II.
Rorschach.
Minnesota Multiphasic Personality Inventory-2.
Thematic Apperception Test.
Bender Gestalt II.
An interest inventory would be least valid when used with
a first-year college student majoring in philosophy.
a third-year college student majoring in physics.
an eighth-grade male with an IQ of 136.
a 46-year-old white male construction worker.
an eighth-grade male with an IQ of 136.
One major criticism of interest inventories is that
they have far too many questions.
they are most appropriate for dry young children
they emphasize professional positions and minimize blue-collar jobs
they favor jobs that will require a bachelors degree or higher
they emphasize professional positions and minimize blue-collar jobs
Interest inventories are positive in the sense that
they are reliable and not threatening to the test taker
they are always graded by the test taker
they require little or no reading skills
they have high validity in nearly all age brackets
they require little or no reading skills
A counselor who had an interest primarily in testing would most likely be a member of
HS-BCP.
AARC.
NASW.
ACES.
AARC.
The NCE is
an intelligence test.
an aptitude test.
a personality test.
an achievement test.
an achievement test.
The ________ are examples of aptitude tests.
O*NET Ability Profiler and the MCAT
GZTS and the MMPI-2
CPI and the MMPI-2
Strong and the LSAT
O*NET Ability Profiler and the MCAT
One problem with interest inventories is that the person often tries to answer the questions in a socially acceptable manner. Psychometricians call this response style phenomenon
standard error.
social desirability (the right way to feel in society).
cultural bias.
acquiescence.
social desirability (the right way to feel in society).
An aptitude test predicts future behavior while an achievement test measures what you have mastered or learned. In the case of a test like the ________ the distinction is unclear.
Binet
Wechsler
GRE
Bender
GRE
Your supervisor wants you to find a new personality test for your counseling agency. You should read
professional journals.
the Buros Mental Measurements Yearbook.
classic textbooks in the field as well as test materials produced by the testing company.
all of the above.
all of the above.
The standard error of measurement tells you
how accurate or inaccurate a test score is.
what population responds best to the test.
something about social loafing.
the number of people used in norming the test.
how accurate or inaccurate a test score is.
A new IQ test has a standard error of measurement (SEM) of 3. Tom scores 106 on the test. If he takes the test a lot, we can predict that about 68% of the time
Tom will score between 100 and 103.
Tom will score between 100 and 106.
Tom will score between 103 and 109.
Tom will score higher than Betty who scored 139.
Tom will score between 103 and 109.
A counselor created an achievement test with a reliability coefficient of .82. The test is shortened since many clients felt it was too long. The counselor shortened the test but logically assumed that the reliability coefficient would now
be approximately .88.
remain at .82.
be at least 10 points higher or lower.
be lower than .82.
be lower than .82.
A counselor can utilize psychological tests to help secure a ________
diagnosis if third-party payments
CPT
DSM or ICD
percentile
standard error
DSM or ICD
A colleague of yours invents a new projective test. Seventeen counselors rated the same client using the measure and came up with nearly identical assessments. This would indicate
high validity.
high reliability.
excellent norming studies.
culture fairness.
high reliability.
Counselors often shy away from self-reports since
clients often give inaccurate answers.
ACA ethics do not allow them.
clients need a very high IQ to understand them.
they are generally very lengthy.
clients often give inaccurate answers.
In most instances, who would be the best qualified to give the Rorschach Inkblot Test?
A counselor with NCC after his or her name.
A clinical psychologist.
A D.O. psychiatrist.
A social worker with LCSW after his or her name.
A clinical psychologist.
Your client, who is in an outpatient hospital program, is keeping a journal of irrational thoughts. This would be
an unethical practice based on NBCC ethical guidelines.
considered a standardized test.
an informal assessment technique.
an aptitude measure.
an informal assessment technique.
You are uncertain whether a test is intended for the population served by your not-for-profit agency. The best method of researching this dilemma would be to
contact a local APA clinical psychology graduate program.
e-mail the person who created the test.
read the test manual included with the test.
give the test to six or more clients at random.
read the test manual included with the test.
Clients should know that
validity is more important than reliability.
projective tests favor psychodynamic theory.
face validity is not that important.
a test is merely a single source of data and not infallible.
a test is merely a single source of data and not infallible.
One major testing trend is
computer-assisted testing and computer interpretations
more paper and pencil measures
to give school children more standardized tests
to train pastoral counselors to do projective testing
computer-assisted testing and computer interpretations
One future trend which seems contradictory is that some experts are pushing for
a greater reliance on tests while others want to rely on them less
social workers to do most of the testing
psychiatrists to do most of the testing
counselors to ban all computer assisted tests
a greater reliance on tests while others want to rely on them less
Most counselors would agree that
more preschool IQ testing is necessary
teachers need to give more personality tests
more public education is needed in the area of testing
the testing mystique has been beneficial to the general public
more public education is needed in the area of testing
____ would be an informal method of appraisal
IQ testing
standardized personality testing
GRE scores
a checklist
a checklist
The WAIS-IV is given to 100,000 individuals in the United States who are picked at random. A counselor would expect that
approximately 68% would score between 85 and 115
approximately 68% would score between 70 and 130
the mean IQ would be 112
50% of those tested would score 112 or above
approximately 68% would score between 85 and 115
A word association test would be an example of
a neuropsychological test
a motoric test
an achievement test
a projective test
a projective test
Infant IQ tests are
more reliable than those given later in life
more unreliable than those given later in life
not related to learning expereinices
never used
more unreliable than those given later in life
A good practice for counselors is to
always test the client yourself rather than referring the client for testing
never generalize on the basis of a single test score
stay away from culture-free tests
stay away from scoring the test yourself
never generalize on the basis of a single test score
You want to admit only 25% of all counselors to an advanced training program in psychodynamic group therapy. The item difficulty on the entrance exam for applicants would be best set at
0.0
.5 regardless of the admission requirement
1.0
.25
.25
According to public law 93-380, also known as the Buckley amendment, a 19 year old college student attending college
could view her record, which included test data
could view her daughter’s infant IQ test given at preschool
could demand a correction she discovered while reading a file
all of the above
all of the above
Lewis Terman
constructed the Weschler tests
constructed the initial Binet prior to 1910
constructed the Rorschach
americanized the Binet
americanized the Binet
In constructing a test you notice that all 75 people correctly answered item number 12. This gives you an item difficulty of
1.2
.75
1.0
0.0
1.0