1905
1917
1939 – 1945
Alfred Binet
WW1 – Psychological Testing for Military
WW2 – Developed more tests (IQ Test, Personality) Distinction between testing and assessment.
Psychological assessment
as the gathering and integration of psychology - related data for the purpose of making a psychological evaluation that is accomplished through the use of tools such as tests, interviews, case studies, behavioral observation, and specially designed apparatuses and measurement procedures.
Psychological testing
as the process of measuring psychology-related variables by means of devices or procedures designed to obtain a sample of behavior.
Collaborative psychological assessment
the assessor and assessed may work as “partners” from initial contact through final feedback.
Therapeutic psychological assessment
therapeutic self-discovery and new understandings are encouraged throughout the assessment process.
Dynamic assessment
refers to an interactive approach to psychological assessment that usually follows a model of evaluation, intervention of some sort, evaluation.
Test
may be defined simply as a measuring device or procedure. Administration and interpretation of a test score.
Psychological Test
refers to a device or procedure designed to measure variables related to psychology.
Tests may differ with:
Content, Format, Administration Procedures, Scoring and Interpretation Procedures
Content
subject matter of the test will, of course, vary with the focus of the particular test.
Format
pertains to the form, plan, structure, arrangement, and layout of test items as well as to related considerations such as time limits.
Also used to refer to the form in which a test is administered: computerized, pencil-and-paper, or some other form.
Administration procedures
demonstration of various kinds of tasks on the part of the assesses as well as trained observation of an assesses performance.
Score
code or summary statement, usually but not necessarily numerical in nature, that reflects an evaluation of performance on a test, task, interview, or some other sample of behavior.
Scoring
is the process of assigning such evaluative codes or statements to performance on tests, tasks, interviews, or other behavior samples.
Cut Score
(also referred to as a cutoff score or simply a cutoff) is a reference point, usually numerical, derived by judgment and used to divide a set of data into two or more classifications.
Psychometric soundness
when referring to how consistently and how accurately a psychological test measures what it purports to measure.
Psychometrics
the science of psychological measurements.
Psychometrist and Psychometrician
both terms referring to a professional who uses, analyzes, and interprets psychological test data.
Utility
refers to the usefulness or practical value that a test or assessment technique has for a particular purpose.
Interview
a method of gathering information through direct communication involving reciprocal exchange.
Interviews differ with regard to many variables such as their:
purpose, length, and nature.
Portfolio
As samples of one’s ability and accomplishment, may be used as a tool of evaluation.
Case history data
refers to records, transcripts, and other accounts in written, pictorial, or other form that preserve archival information, official and informal accounts, and other data and items relevant to an assesses.
Case study
report or illustrative account concerning a person or an event that was compiled on the basis of case history data.
Groupthink
arises as a result of the varied forces that drive decision-makers to reach a consensus.
Behavioral observation
as it is employed by assessment professionals, may be defined as monitoring the actions of others or oneself by visual or electronic means while recording quantitative and/or qualitative information regarding the actions.
naturalistic observation
observe behavior of humans in a natural setting—that is, the setting in which the behavior would typically be expected to occur.
A role-play test
is a tool of assessment wherein assesses are directed to act as if they were in a particular situation. Assesses may then be evaluated with regard to their expressed thoughts, behaviors, abilities, and other variable.
Local processing
Scoring may be done on-site
Central processing
conducted at some central location
Teleprocessing
test-related data may be sent to and returned from this central facility by means of phone lines by mail, or courier.
Simple scoring report
mere listing of score or scores, extended scoring report, which includes statistical analyses of the test taker’s performance.
Interpretive report
which is distinguished by its inclusion of numerical or narrative interpretive statements in the report.
Some contain relatively little interpretation and are limited to calling the test user’s attention to certain scores that need to be focused on.
Consultative report
usually written in language appropriate for communication between assessment professionals, may provide expert opinion concerning analysis of the data.
Integrative report
will employ previously collected data (such as medication records or behavioral observation data) into the test report.
Parties
in the assessment enterprise include developers and publishers of tests, users of tests, and people who are evaluated by means of tests.
Test developers and publishers
create tests or other methods of assessment.
The test user
Psychological tests and assessment methodologies are used by a wide range of professionals, including clinicians, counselors, school psychologists, human resources personnel, consumer psychologists, experimental psychologists, social psychologists, the list goes on.
The test taker
Having all taken tests, we all have had firsthand experience in the role of ___.
psychological autopsy
may be defined as a reconstruction of a deceased individual’s psychological profile on the basis of archival records, artifacts, and interviews previously conducted with the deceased assessee or with people who knew him or her.
Society at large
The societal need for “organizing” and “systematizing” has historically manifested itself in such varied questions
Educational settings
You are probably no stranger to the many types of tests administered in the classroom. As mandated by law, tests are administered early in school life to help identify children who may have special needs
achievement test
which evaluates accomplishment or the degree of learning that has taken place.
diagnostic test
refers to a tool of assessment used to help narrow down and identify areas of deficit to be targeted for intervention.
informal evaluation
as a typically nonsystematic assessment that leads to the formation of an opinion or attitude.
Clinical settings
These tools are used to help screen for or diagnose behavior problems.
Group testing
is used primarily for screening—that is, identifying those individuals who require further diagnostic evaluation.
Counseling settings
Assessment in a counseling context may occur in environments as diverse as schools, prisons, and government or privately owned institutions. Regardless of the particular tools used, the ultimate objective of many such assessments is the improvement of the assessed in terms of adjustment, productivity, or some related variable. Measures of social and academic skills and measures of personality, interest, attitudes, and values are among the many types of tests that a counselor might administer to a client.
Business and military settings
wide range of achievement, aptitude, interest, motivational, and other tests may be employed in the decision to hire as well as in related decisions regarding promotions, transfer, job satisfaction, and eligibility for further training.
Governmental and organizational
credentialing One of the many applications of measurement is in governmental licensing, certification, or general credentialing of professionals.
How Are Assessments Conducted?
Responsible test users have obligations before, during, and after a test or any measurement procedure is administered.
China (2200 BCE)
testing first came in, for selecting applicants for governmental jobs.
The content of the test is designed to measure proficiency in music, archery, horsemanship, writing, arithmetic, agriculture, geography, civil law, military, rites, and ceremonies.
Greco-Roman writings
attempted to classify people in terms of personality types. It is measured by abundance or deficiency in bodily fluids (blood or phlegm).
Middle Ages
it is a question of who is in league with the devil and tests are used to measure such.
Renaissance
psychological assessment in the modern sense started to begin.
Christian Von Wolf
in 18th century, he anticipated psych as a science and psych measurement as a specialty.
Charles Darwin
argued that chance variation would be selected by nature according to adaptability and survival. It spurred scientific interest in individual differences.
This aroused interest, admiration, and good deal of enmity. Sparked experimentation in animals. Compared consciousness level between animals and humans.
Francis Galton
half cousin of Darwin, became an influencer in the field of measurement of psych related variables.
He aspired to classify people according to their natural gifts and to understand their “deviation from an average”.
Credit with contributing to the development of contemporary measurement tools like questionnaires, rating scale and self-report inventories (he worked with sweet peas)
o Pioneered the use of statistical concept central to psych experimentation and testing (coefficient of correlation).
Karl Pearson
followed Galton and developed the product-moment correlation technique.
Had an anthropometric Laboratory
anthropometric Laboratory (1852)
to measure some variable like height, weight and etc.
Wilhelm Max Wundt
established first experimental psychology laboratory (1852) founded in Leipzig, Germany.
It also stressed the importance of assessment. Focused on the similarities rather than differences.
James McKeen-Cattell
student of Wundt, completed a doctoral dissertation that dealt with individual differences with regards to reaction time. Met Galton
Coined the term “mental test”
Charles Spearman
originated the concept of test reliability and building the framework for statistical technique of factor analysis.
Victor Henri
worked with Alfred Binet on papers suggesting how mental test measure higher mental processes.
Emil Kraepelin
experimented word association technique as a formal test.
Lightner Witmer
cited as the “little known founder of clinical psychology.
The nineteenth-century
much of the nineteenth-century testing that is psychological in nature involved the measurement of sensory abilities, reaction time and the like
The Twentieth Century
The measurement of Intelligence, etc.
Alfred Binet and Victor Henri (1895)
published several articles for the measurement of memory and social comprehension.
Alfred Binet and Theodore Simon
publish a 30-item “measuring scale of intelligence” to identify Paris school children with intellectual disability.
go through many revisions and translations and in the process of it, it launches both the intelligence testing movement and the clinical testing movement
David Wechsler
clinical psychologist at Bellevue NY, introduced a test designed to measure adult intelligence. originally called “Wechsler-Bellevue Intelligence Scale” then revised and renamed “Wechsler Adult Intelligence Scale” (WAIS)
David Wechsler
intelligence was “the aggregate or global capacity of the individual to act purposely, to think rationally and to deal effectively with his environment”
Group intelligence test
is a response to the military’s need for an efficient method of screening the intellectual ability of WW1 recruits.
Robert S. Woodworth
developed a measure of adjustment and emotional stability that could be administered quickly and effectively in groups of recruits
Personal Data Sheet
test is disguised for its true purpose, never went beyond the experimental stages because of the treaty of peace.
Woodworth Psychoneurotic Inventory
a personality test for civilian use that was based in the Personality Data Sheet. It is the first widely used as a self-report measure of personality.
Self-report
a process whereby assesses themselves supply assessment-related information by responding to questions, keeping a diary, or self-monitoring thoughts and behavior. There are shortcomings of the self-report method, and it was deemed that an alternative method is needed.
Projective Tests
an individual is assumed to “project” onto some ambiguous stimulus to his or her unique needs, fears, hopes and motivation. The ambiguous stimuli may be inkblot, a drawing, photograph or etc
Rorschach Inkblot test by Hermann Rorschach
it is a series of inkblots. When pictures are used as a projective stimulus, respondents are typically asked to tell a story about the picture which is then analyzed in terms of what needs and motivations the respondents may be trying to project onto the ambiguous pictures.
Thematic Apperception Test (Henry
Murray & Christiana D. Morgan)
Series of pictures and presented and test taker tells a story about the picture.
Academic Tradition
The scholars like Galton, Wundt and etc. uses the tools of assessment to help advance knowledge and understanding of human and animal behavior.
Applied Tradition
dates back to ancient China and the examinations developed there is to help select applicants for various positions on the basis of merit
Contemporary test
recognizes the need for cultural sensitivity in the development and use of tools.
Culture
defined as the “socially transmitted behavior patterns, beliefs and products of work of a particular population, community or group of people”. it prescribes behaviors and ways of thinking.
Collectivistic culture
value conformity, cooperation, interdependence and striving toward group goals (Eastern cultures).
Individualistic culture
values traits that focuses on self-reliance, autonomy, independence, uniqueness, and competitiveness (Western Cultures).
Henry H. Goddard
raised questions about how meaningful such tests are when used with people from various cultural and language backgrounds.
Block design test
tests to immigrants and found most immigrants from various nationalities to be mentally deficient when tested, blinded by the cultural factor, as a result those who failed at deported o He deemed immigrants as “feebleminded”.
His research fueled the fires of an ongoing nature-nurture debate about what intelligence tests actually measure.
Culture-specific tests
created to deal with the impact of language and culture on tests of mental ability, in essence, they isolated cultural variables.
Eugenics
science of improving the qualities of a breed through intervention with factors related to heredity.
Verbal Communication
Examiner and examinee must speak the same language
Especially tricky with infrequently used vocabulary or unusual idioms employed o Translator may lose nuances of translation or give unintentional hints toward more desirable answer
Also requires understanding of culture.
Nonverbal Communication and Behavior
Different between cultures o Ex.) meaning of not making eye contact Body movement could even have physical cause
Psychoanalysis
Freud’s theory of personality and psychological treatment which stated that symbolic significance is assigned to many nonverbal acts. Timing tests in cultures not obsessed with speed o Lack of speaking could be reverence for elders.
Tests and Group Membership
Tests of job ability should be basis for hiring not physical attributes.
Ethics
a body of rules thought to be good for society as a whole.
Law
a body of principles of right, proper, or good conduct.
Measurement
The act of assigning numbers or symbols to characteristics of things according to rules.
Scale
A set of numbers whose properties model empirical properties of the objects to which the numbers are assigned
Nominal Scales
These scales involve classification or categorization based on one or more distinguishing characteristics, where all things measured must be placed into mutually exclusive categories
Ordinal Scales
Also permits classification. Rank ordering on some characteristic. Have no absolute zero point.