Chapter 8 Assessment & testing

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/99

There's no tags or description

Looks like no tags are added yet.

Last updated 4:02 PM on 7/4/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai	Chat

No analytics yet

Send a link to your students to track their progress

100 Terms

New cards

Appraisal can be defined as

the process of assessing or estimating attributes.

testing which is always performed in a group setting.

testing which is always performed on a single individual.

a pencil and paper measurement of assessing attributes.

the process of assessing or estimating attributes.

New cards

A test can be defined as a systematic method of measuring a sample of behavior. Test format refers to the manner in which test items are presented. The format of an essay test is considered a(n) ________

format.

subjective

objective

very precise

concise

subjective

New cards

The National Counselor Exam (NCE) is a(n) ________

test because the scoring procedure is specific.

subjective

objective

projective

subtest

objective

New cards

A short answer test is a(n) ________ test.

objective

culture-free

forced choice

free choice

New cards

The NCE and the CPCE would be examples of a(n) ________ test.

free choice

forced choice

projective

intelligence

forced choice

New cards

The _______index indicates the percentage of individuals who answered each item correctly.

difficulty

critical

intelligence

personal

difficulty

New cards

Short answer tests and projective measures utilize free response items. The NCE and the CPCE uses forced choice or so-called ________

vague

subjective

recognition

numerical

recognition

New cards

A true/false test has ____ recognition items,

similar

free choice

dichotomous

New cards

A test format could be normative or ipsative. In the normative format

each item depends on the item before it.

each item depends on the item after it.

the client must possess an IQ within the normal range.

each item is independent of all other items.

New cards

A client who takes a normative test

cannot legitimately be compared to others who have taken the test.

can legitimately be compared to others who have taken the test.

could not have taken an IQ test.

could not have taken a personality test.

can legitimately be compared to others who have taken the test.

New cards

In an ipsative measure the person taking the test must compare items to one another. The result is that

an ipsative measure cannot be utilized for career guidance.

you cannot legitimately compare two or more people who have taken an ipsative test.

an ipsative measure is never a forced choice format.

an ipsative measure is never reliable.

you cannot legitimately compare two or more people who have taken an ipsative test.

New cards

Tests are often classified as speed tests versus power tests. A timed typing test used to hire secretaries would be

a power test.

neither a speed test nor a power test.

a speed test.

a fine example of an ipsative measure.

a speed test.

New cards

A counseling test consists of 300 forced response items. The person taking the test can take as long as he or she wants to answer the questions.

This is most likely a projective measure.

This is most likely a speed test.

This is most likely a power test.

This is most likely an invalid measure.

This is most likely a power test.

New cards

An achievement test measures maximum performance or present level of skill. Tests of this nature are also called attainment tests, while a personality test or interest inventory measures

typical performance.

minimum performance.

unconscious traits.

self-esteem by always relying on a Q-Sort design.

typical performance.

New cards

In a spiral test

the items get progressively easier.

the difficulty of the items remains constant.

the client must answer each question in a specified period of time.

the items get progressively more difficult.

New cards

In a cyclical test

the items get progressively easier.

the difficulty of the items remains constant.

you have several sections which are spiral in nature.

the client must answer each question in a specified period of time.

you have several sections which are spiral in nature.

New cards

A test battery is considered

a horizontal test.

a vertical test.

a valid test.

a reliable test.

a horizontal test.

New cards

In a counseling research study, two groups of subjects took a test with the same name. However, when they talked with each other they discovered that the questions were different. The researcher assured both groups that they were given the same test. How is this possible?

The researcher is not telling the truth. The groups could not possibly have taken the same test.

The test was horizontal.

The test was not a power test.

The researcher gave parallel forms of the same test.

New cards

The most critical factors in test selection are

the length of the test and the number of people who took the test in the norming process.

horizontal versus vertical.

validity and reliability.

spiral versus cyclical format.

validity and reliability.

New cards

Which is more important, validity or reliability?

Reliability.

They are equally important.

Validity.

It depends on the test in question.

Validity.

New cards

In the field of testing, validity refers to

whether the test really measures what it purports to measure.

whether the same test gives consistent measurement.

the degree of cultural bias in a test.

the fact that numerous tests measure the same traits.

whether the test really measures what it purports to measure.

New cards

A counselor peruses a testing catalog in search of a test which will repeatedly give consistent results. The counselor

is interested in reliability.

is interested in validity.

is looking for information which is not available.

is magnifying an unimportant issue.

is interested in reliability.

New cards

Which measure would yield the highest level of reliability?

A TAT, projective test popular with psychodynamic helpers.

The WAIS-IV, a popular IQ test.

The MMPI-2, a popular personality test.

A very accurate postage scale.

New cards

Construct validity refers to the extent that a test measures an abstract trait or psychological notion. An example would be

height.

weight.

ego strength.

the ability to name all men who have served as U.S. presidents.

ego strength.

New cards

Face validity refers to the extent that a test

looks or appears to measure the intended attribute.

measures a theoretical construct.

appears to be constructed in an artistic fashion.

can be compared to job performance.

looks or appears to measure the intended attribute.

New cards

A job test which predicted future performance on a job very well would

have high criterion/predictive validity

have excellent face validity

have excellent construct validity

not have incremental validity or synthetic validity

have high criterion/predictive validity

New cards

A new IQ test which yielded results nearly identical to other standardized measures would be said to have

good concurrent validity.

good face validity.

superb internal consistency.

all of the above.

good concurrent validity.

New cards

When a counselor tells a client that the Graduate Record Examination (GRE) will predict her ability to handle graduate work, the counselor is referring to

good concurrent validity.

construct validity.

face validity.

predictive validity.

New cards

A reliable test is ________valid.

always

90%

not always

80%

not always

New cards

A valid test is ________reliable.

not always

always

never

80%

always

New cards

One method of testing reliability is to give the same test to the same group of people two times and then correlate the scores. This is called

a. test-retest reliability

b. equivalent forms reliability

c. alternate forms reliability

d. the split-half methods

test-retest reliability

New cards

One method of testing reliability is to give the same population alternate forms of the identical test. Each form will have the same psychometric/statistical properties as the original instrument. This is known as

test-retest reliability

equivalent or alternate forms reliability

the split-half method

internal consistency

equivalent or alternate forms reliability

New cards

A counselor doing research decided to split a standardized test in half by using the even items as one test and the odd items as a second test and then correlating them. The counselor

used an invalid procedure to test reliability.

was testing reliability via the split-half correlation method.

was testing reliability via the equivalent forms method.

was testing reliability via the inter-rater method.

was testing reliability via the split-half correlation method.

New cards

Which method of reliability testing would be useful with any easy test but not with a test of algebra problems?

a. test-retest

b. alternate forms

c. split-half

d. inter-rater/inter-observer

New cards

A reliability coefficient of 1.00 indicates

a lot of variance in the test.

a score with a high level of error.

a perfect score which has no error.

a typical correlation on most psychological and counseling tests.

a perfect score which has no error.

New cards

An excellent psychological or counseling test would have a reliability efficient of

a. 50

b. .90

c. 1.00

d. -.90

b. .90

New cards

A researcher working with a personality test discovers that the test has a reliability coefficient of .70 which is somewhat typical. This indicates that

70% of the score is accurate while 30% is inaccurate.

30% of the people who are tested will receive accurate scores.

70% of the people who are tested will receive accurate scores.

30% of the score is accurate while 70% is inaccurate.

70% of the score is accurate while 30% is inaccurate.

New cards

A career counselor is using a test for job selection purposes. An acceptable reliability coefficient would be ______ or higher.

.20

.55

.80

.70

.80

New cards

The same test is given to the same group of people using the test-retest reliability method. The correlation between the first and second administration is .70. The true variance (i.e., the percentage of shared variance or the level of the same thing measured in both) is

a. 70%

b. 100%

c. 50%

d. 49%

New cards

IQ means

a query of intelligence.

indication of intelligence.

intelligence quotient.

intelligence questions for test construction.

intelligence quotient.

New cards

_____ did research and concluded that intelligence was normally distributed like height or weight and that it was primarily genetic.

Spearman

Guilford

Williamson

Galton

New cards

Francis Galton felt intelligence was

a. a unitary faculty

b. best explained via a two factor theory

c. best explained via the person’s environment

d. fluid and crystallized in nature

a unitary faculty

New cards

J. P. Guilford isolated 120 factors which added up to intelligence. He also is remembered for his

thoughts on convergent and divergent thinking.

work on cognitive therapy.

work on behavior therapy.

work to create the first standardized IQ test.

thoughts on convergent and divergent thinking.

New cards

A counselor is told by his supervisor to measure the internal consistency reliability (i.e., homogeneity) of a test but not to divide the test in halves. The counselor would need to utilize

a. the split-half method

b. the test-retest method

c. the Kuder-Richardson coefficients of equivalence

d. cross-validation

the Kuder-Richardson coefficients of equivalence

New cards

The first intelligence test was created by

David Wechsler.

J. P. Guilford.

Francis Galton.

Alfred Binet and Theodore Simon.

New cards

Today, the Stanford-Binet IQ test is

a. a nonstandardized meausre

b. a standardized measure

c. a projective measure

d. b and c

b. a standardized measure.

New cards

IQ stands for intelligence quotient, which is expressed by

a. CA/MA x 100

b. CA/MA x 100

c. MA/CA x 50

d. MA/CA x 100

New cards

The Binet stressed age-related tasks. Utilizing this method, a 9-year-old task would be one which

only a 10-year-old child could answer.

only an 8-year-old child could answer.

50% of the 9-year-olds could answer correctly.

75% of the 9-year-olds could answer correctly.

50% of the 9-year-olds could answer correctly.

New cards

Simon and Binet pioneered the first IQ test around 1905. The test was created to

assess high school seniors in America.

assess U.S. military recruits.

discriminate children without an intellectual disability from children with an intellectual disability.

measure genius in the college population.

discriminate children without an intellectual disability from children with an intellectual disability.

New cards

Today the Stanford-Binet is used from age 2 to adulthood. The IQ formula has been replaced by the

SAS

SUDS

entropy

KR-20 formula

SAS

New cards

Most experts would agree that the Wechsler IQ tests gained popularity, as the Binet

must be administered in a group

favored the geriatric population

didn’t seem to be the best test for adults

was biased toward women

didn’t seem to be the best test for adults

New cards

The best IQ test for a 22-year-old single male would be the

WPPSI-III.

WAIS-IV.

WISC-IV.

any computer-based IQ test.The best IQ test for a 22-year-old single male would be the

WAIS-IV.

New cards

The best intelligence test for a sixth-grade girl would be the

WPPSI-IV

WAIS-IV

WISC-IV

Merrill-Plamer

WISC-IV

New cards

The best intelligence test for a kindergartner would be the

WPPSI-IV

WAIS-IV

WISC-IV

MYERS-BRIGGS TYPE TYPE INDICATOR

WPPSI-IV

New cards

The mean on the Wechsler and the Stanford-Binet Intelligence scale (SB5) is ____ and the SD is ____-

100;100

100;15 Wechsler, 16 Stanford-Binet

100;20

100;1

100;15 Wechsler, 16 Stanford-Binet

New cards

Group IQ tests like the Otis-Lennon, the Lorge-Thorndlike, and the California test of mental Abilities are popular in school settings. The advantage is that

group tests are quicker to administer

group tests are superior in terms of predicting school performance

group tests always have a higher degree of reliability

individual IQ tests are not appropriate for school children

group tests are quicker to administer

New cards

The group IQ test movement began

in 1905

with the work of Binet

with the Army Alpha and Army beta in World War 1

with freudian psychoanalysis and the psychodynamic movement

with the Army Alpha and Army beta in World War 1

New cards

In a culture-fair test

items are known to the subjects regardless of his or her culture

the test is not standardized

culture free items cannot be utilized

African Americans generally score higher than whites

items are known to the subjects regardless of his or her culture

New cards

The black versus white IQ controversy was sparked mainly by a 1969 article written by ____.

John Ertl

Raymond B. Cattell

Arthur Jensen

Robert Williams

Arthur Jensen

New cards

The MMPI-2 is

an IQ test.

a neurological test.

a projective personality test.

a standardized personality test.

New cards

The word psychometric means

a form of measurement used by a neurologist.

any form of mental testing.

a mental trait which cannot be measured.

the test relies on a summated or linear rating scale.

any form of mental testing.

New cards

In a projective test the client is shown

something which is highly reinforcing.

something which is highly charged from an emotional standpoint.

a and b.

neutral stimuli.

New cards

The 16 PF reflects the work of

Raymond B. Cattell

Carl Jung

James McKeen Cattell

Oscar K. Buros

Raymond B. Cattell

New cards

The Myers-Briggs type indicator reflects the work of

Raymond B. Cattell

Carl Jung

William Glasser

Oscar K buros.

Carl Jung

New cards

The counselor who favors projective measures would most likely be a

Rogerian.

strict behaviorist.

TA therapist.

psychodynamic clinician.

New cards

An aptitude test is to ________ as an achievement test is to ________

what has been learned; potential

potential; what has been learned

profit from learning; potential

a measurement of current skills; potential

potential; what has been learned

New cards

Both the Rorschach and the Thematic Apperception Test (TAT) are projective tests. The Rorschach uses 10 inkblot cards while the TAT uses

a dozen inkblot cards..

verbal and performance IQ scales.

pictures

incomplete sentences.

pictures

New cards

Test bias primarily results from

a test being normed solely on white middle-class clients.

the use of projective measures.

using whites to score the test.

using IQ rather than personality tests.

a test being normed solely on white middle-class clients.

New cards

A counselor who fears the client has an organic, neurological, or motoric difficulty would most likely use the

Bender Gestalt II.

Rorschach.

Minnesota Multiphasic Personality Inventory-2.

Thematic Apperception Test.

Bender Gestalt II.

New cards

An interest inventory would be least valid when used with

a first-year college student majoring in philosophy.

a third-year college student majoring in physics.

an eighth-grade male with an IQ of 136.

a 46-year-old white male construction worker.

an eighth-grade male with an IQ of 136.

New cards

One major criticism of interest inventories is that

they have far too many questions.

they are most appropriate for dry young children

they emphasize professional positions and minimize blue-collar jobs

they favor jobs that will require a bachelors degree or higher

they emphasize professional positions and minimize blue-collar jobs

New cards

Interest inventories are positive in the sense that

they are reliable and not threatening to the test taker

they are always graded by the test taker

they require little or no reading skills

they have high validity in nearly all age brackets

they require little or no reading skills

New cards

A counselor who had an interest primarily in testing would most likely be a member of

HS-BCP.

AARC.

NASW.

ACES.

AARC.

New cards

The NCE is

an intelligence test.

an aptitude test.

a personality test.

an achievement test.

New cards

The ________ are examples of aptitude tests.

O*NET Ability Profiler and the MCAT

GZTS and the MMPI-2

CPI and the MMPI-2

Strong and the LSAT

O*NET Ability Profiler and the MCAT

New cards

One problem with interest inventories is that the person often tries to answer the questions in a socially acceptable manner. Psychometricians call this response style phenomenon

standard error.

social desirability (the right way to feel in society).

cultural bias.

acquiescence.

social desirability (the right way to feel in society).

New cards

An aptitude test predicts future behavior while an achievement test measures what you have mastered or learned. In the case of a test like the ________ the distinction is unclear.

Binet

Wechsler

GRE

Bender

GRE

New cards

Your supervisor wants you to find a new personality test for your counseling agency. You should read

professional journals.

the Buros Mental Measurements Yearbook.

classic textbooks in the field as well as test materials produced by the testing company.

all of the above.

New cards

The standard error of measurement tells you

how accurate or inaccurate a test score is.

what population responds best to the test.

something about social loafing.

the number of people used in norming the test.

how accurate or inaccurate a test score is.

New cards

A new IQ test has a standard error of measurement (SEM) of 3. Tom scores 106 on the test. If he takes the test a lot, we can predict that about 68% of the time

Tom will score between 100 and 103.

Tom will score between 100 and 106.

Tom will score between 103 and 109.

Tom will score higher than Betty who scored 139.

Tom will score between 103 and 109.

New cards

A counselor created an achievement test with a reliability coefficient of .82. The test is shortened since many clients felt it was too long. The counselor shortened the test but logically assumed that the reliability coefficient would now

be approximately .88.

remain at .82.

be at least 10 points higher or lower.

be lower than .82.

New cards

A counselor can utilize psychological tests to help secure a ________

diagnosis if third-party payments

CPT

DSM or ICD

percentile

standard error

DSM or ICD

New cards

A colleague of yours invents a new projective test. Seventeen counselors rated the same client using the measure and came up with nearly identical assessments. This would indicate

high validity.

high reliability.

excellent norming studies.

culture fairness.

high reliability.

New cards

Counselors often shy away from self-reports since

clients often give inaccurate answers.

ACA ethics do not allow them.

clients need a very high IQ to understand them.

they are generally very lengthy.

clients often give inaccurate answers.

New cards

In most instances, who would be the best qualified to give the Rorschach Inkblot Test?

A counselor with NCC after his or her name.

A clinical psychologist.

A D.O. psychiatrist.

A social worker with LCSW after his or her name.

A clinical psychologist.

New cards

Your client, who is in an outpatient hospital program, is keeping a journal of irrational thoughts. This would be

an unethical practice based on NBCC ethical guidelines.

considered a standardized test.

an informal assessment technique.

an aptitude measure.

an informal assessment technique.

New cards

You are uncertain whether a test is intended for the population served by your not-for-profit agency. The best method of researching this dilemma would be to

contact a local APA clinical psychology graduate program.

e-mail the person who created the test.

read the test manual included with the test.

give the test to six or more clients at random.

read the test manual included with the test.

New cards

Clients should know that

validity is more important than reliability.

projective tests favor psychodynamic theory.

face validity is not that important.

a test is merely a single source of data and not infallible.

New cards

One major testing trend is

computer-assisted testing and computer interpretations

more paper and pencil measures

to give school children more standardized tests

to train pastoral counselors to do projective testing

computer-assisted testing and computer interpretations

New cards

One future trend which seems contradictory is that some experts are pushing for

a greater reliance on tests while others want to rely on them less

social workers to do most of the testing

psychiatrists to do most of the testing

counselors to ban all computer assisted tests

a greater reliance on tests while others want to rely on them less

New cards

Most counselors would agree that

more preschool IQ testing is necessary

teachers need to give more personality tests

more public education is needed in the area of testing

the testing mystique has been beneficial to the general public

more public education is needed in the area of testing

New cards

____ would be an informal method of appraisal

IQ testing

standardized personality testing

GRE scores

a checklist

New cards

The WAIS-IV is given to 100,000 individuals in the United States who are picked at random. A counselor would expect that

approximately 68% would score between 85 and 115

approximately 68% would score between 70 and 130

the mean IQ would be 112

50% of those tested would score 112 or above

approximately 68% would score between 85 and 115

New cards

A word association test would be an example of

a neuropsychological test

a motoric test

an achievement test

a projective test

New cards

Infant IQ tests are

more reliable than those given later in life

more unreliable than those given later in life

not related to learning expereinices

never used

more unreliable than those given later in life

New cards

A good practice for counselors is to

always test the client yourself rather than referring the client for testing

never generalize on the basis of a single test score

stay away from culture-free tests

stay away from scoring the test yourself

never generalize on the basis of a single test score

New cards

You want to admit only 25% of all counselors to an advanced training program in psychodynamic group therapy. The item difficulty on the entrance exam for applicants would be best set at

0.0

.5 regardless of the admission requirement

1.0

.25

New cards

According to public law 93-380, also known as the Buckley amendment, a 19 year old college student attending college

could view her record, which included test data

could view her daughter’s infant IQ test given at preschool

could demand a correction she discovered while reading a file

all of the above

New cards

Lewis Terman

constructed the Weschler tests

constructed the initial Binet prior to 1910

constructed the Rorschach

americanized the Binet

100

New cards

In constructing a test you notice that all 75 people correctly answered item number 12. This gives you an item difficulty of

1.2

.75

1.0

0.0

1.0