1/168
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Mode
It is the most frequently occurring score in a distribution.
Standard Score
This is defined as a raw score that has been converted from one scale to another scale, where the latter scale has some arbitrarily set mean and standard deviation.
Historical Testing Purpose
The earliest testing can be traced back to Han Dynasty in China, with the purpose to determine the individuals who can obtain government positions.
Z-Score Calculation
If the mean score is 2, and Standard Deviation is 2, the z-score for a raw score of 10 would be calculated as 4.
Psychological Assessment
In the case of being referred for evaluation tools to answer a referral question, the process conducted is called Psychological Assessment.
T-Score
This is a standard score with a Mean of 50 and a Standard Deviation of 10.
Average Calculation
The average of the numbers 23, 4, 5, 8, 16, 12, 6, 8 is 10.25.
Aptitude Test
When asked to take a test for the purpose of identifying potential to learn or acquire a specific skill, the type of test taken is an Aptitude Test.
Test User Role
As human resources personnel conducting tests to job applicants, you function as the Test User.
Negative Correlation Example
A negative correlation is illustrated by the example: As the internet speed decreases, the number of people admitted due to hypertension increases.
Personal Data Sheet
During World War I, Robert Woodworth developed the Personal Data Sheet used to measure adjustment and emotional stability among soldiers.
Measure of Correlation
To correlate the passing and failing scores of Riverdale High students with males and females, Junas should use the Point-Biserial Correlation.
Level Tests Administration
Maria, a Registered Psychometrician, can only administer Level B and Level C tests.
Norm-Referenced Testing
The evaluation of individual scores by comparing them to the scores of the whole class is called Norm-Referenced Testing.
Properties of Scales
The properties of scales include Magnitude, Equal Intervals, and Divisibility, but not Absolute Zero.
Ordinal Scale
Ranking characters according to preference is an example of an Ordinal scale.
Stratified Sampling
Stratified sampling reduces bias.
Random Sampling Limitation
Random sampling is often not recommended due to the reduction of giving the same chance to every member of the population of being included in the sample.
Purposive sampling
Involves arbitrarily selecting some sample because it is believed that it will be the best to represent the population.
Incidental Sampling
Done due to budgetary limitations or other constraints.
Kurtosis
Defined as the steepness of a distribution in its center.
Peaked kurtosis
Referred to as Leptokurtic.
Scatterplot
A simple graph of the coordinate points for values of the X-variable and the Y variable, providing a quick indication of the direction and magnitude of the relationship between variables.
Z-score of 1
Indicates that the student scored greater than or equal to 84% of her class.
Reliability
Refers to the consistency in measurement.
Random Error
An error attributed to unexpected variations affecting the test scores.
Criterion-Related Validity
Evidence of validity based on the correlation between test scores and other measures.
Test-taker Variable
A variable that may affect the reliability of scores based on the test-taker's condition.
Convergent Evidence/Validity
Established when calculating correlations with tests measuring an identical construct and related constructs.
Test-Retest Reliability
Reliability established by administering the same test to the same sample after a period of time.
Central Tendency Error
A rating error where test-takers tend to choose the middle option due to a tendency to play safe.
Halo Effect
A rating error committed when the rater has a preconceived positive view about the person being rated.
Split-Half Reliability
Refers to reliability obtained by correlating two pairs of scores obtained from equivalent halves of a single test administered once.
Estimates
Used to determine the number of items needed to attain a desired level of reliability.
Health Error
An error attributed to the test-taker's health condition affecting test performance.
Validity
The extent to which a test measures what it purports to measure.
Content Validity
The degree to which test items represent the content they are intended to measure.
Construct Validity
The extent to which a test measures a theoretical construct or trait.
Systematic Error
An error that consistently affects test scores in a particular direction.
Divergent Validity
Evidence established when tests measuring different constructs show low correlations.
Predictive Validity
The extent to which a test predicts future performance.
Interpersonal Variable
A variable that may affect the reliability of scores based on interactions with others.
Spearman-Brown Prophecy
Used to determine the number of items needed to attain a desired level of reliability.
Brogden-Cronbach-GIesser Formula
A formula used to calculate the dollar amount of a utility gain resulting from the use of a particular selection instrument under specified conditions.
False Positive
When the assessment tool says not to hire an applicant driver but he or she would have been rated as qualified.
Inter-Scorer Reliability
Type of reliability calculated to identify the degree of agreement of consistency between two or more scorers (judges or raters) with regard to a particular measure.
KR20
Method for estimating the internal consistency of a test with right or wrong answers, considering varying difficulty.
Absolute Cut Score
The cut score used when whoever gets a score of 75 on a 100-item quiz on Biochemistry will receive a sweet yes.
Test Development Steps
The correct order is (i) Test Conceptualization, (ii) Test Construction, (iii) Test Tryout, (iv) Test Revision, (v) Item Analysis.
Trinitarian View of Validity
Refers to Face Validity, Content Validity, and Construct Validity.
Face Validity
An issue when a test does not appear to measure what it actually intends to measure.
Item Difficulty Index
If 20 out of 40 students got the item correct, the item difficulty index is 0.50.
Floor Effect
Phenomenon when test-takers score very low due to very hard questions.
Item Difficulty Comparison
If Item # 1 has an Item Difficulty Index of 0.70, Item # 2 has an Item Difficulty Index of 0.30 and Item # 3 has an Item Difficulty of 0.50, then Item # 3 is more difficult than Item # 2.
IPSATIVE scoring
An approach to test scoring where the test-taker's responses are interpreted relative to the measured strength of the other traits for that test-taker.
Item Discrimination Index
If calculated as -1, it means both the high scorers and low scorers got the item wrong.
Parallel Forms Method
Method of reliability used to measure sources of error variance associated with time-sampling.
Optimal Item Difficulty Level
0
Test Performance Data
It refers to the test performance data of a particular group of test-takers that are designed for use as a reference when evaluating test scores.
Source of Error Variance
All of the above (Test Construction, Test Administration, Test Scoring and Interpretation).
Optimal Range of Item Discrimination Index
0.30 - 0.70
Scaling Method for Brand Preference
Paired-Comparison.
Item Reservoir
It is the reservoir from which items will or will not be drawn for the final version of the test.
Multiple-choice Format Elements
The following elements are necessary in this format except one: Root option.
Selected-Response Format Example
Matching-Item.
Experts vs. Laypeople on Academic Intelligence
Experts put emphasis on Social Aspects; Laypeople put emphasis on Motivation.
Depth of Knowledge Format
Constructed-Response Item: Essay.
Heritability of Intelligence
He was the first person to publish findings on the Heritability of Intelligence, thus framing the contemporary nature-nurture debate.
Computer-Administered Test Adaptation
Item Branching.
Advantage of Computer Adaptive Testing
It reduces the phenomenon arising from the diminished utility of a tool of assessment in distinguishing test-takers at the low end of the ability.
Binet-Simon Test Purpose
To determine which students did not learn effectively from regular classroom instruction so they could be given remedial work.
Coined the Term Mental Test
James McKeen Cattell.
Expert Opinion Method
The method of asking experts for their opinions on the inclusion of items in a test.
Concurrent Validity
A type of validity that assesses the correlation between a test and a criterion measure taken at the same time.
Assimilation
A basic mental operation where new information is incorporated into existing cognitive schemas.
Accommodation
A basic mental operation that involves changing cognitive schemas to fit new information.
Generalization
The application of a learned response to similar stimuli.
Diffusion
The spread of cultural beliefs and social activities from one group to another.
Standard Normal Distribution
In a standard normal distribution, a z-score of ± 2 constitutes 95% of the data.
Acquiescence
A rating error related to the rater's tendency to agree on all personality items regardless of the content of the item.
Social Desirability
The tendency of respondents to answer questions in a manner that will be viewed favorably by others.
Theory of Multiple Intelligence
Howard Gardner's theory that proposes the existence of various types of intelligences.
Speed Test
A specific type of test that requires the examinee to complete as many items as possible, containing items of uniform and generally simple levels of difficulty.
Ratio Scales
A scale of measurement that has a true zero and allows for the comparison of absolute magnitudes of the numbers.
Crystallized Intelligence
Acquired skills and knowledge that are dependent on exposure to a particular culture as well as on formal and informal education.
Fluid Intelligence
Nonverbal, relatively culture-free cognitive ability that is independent of specific instruction.
Predeterminism
The belief that one's abilities are dictated by genetic inheritance and cannot be enhanced by learning or intervention.
Wechsler Tests
A series of standardized tests designed to measure intelligence and cognitive ability in adults and children.
Flynn Effect
The phenomenon of population's experience of IQ gain over time, necessitating the routine standardization of IQ tests.
Army Beta Test
A test administered to foreign-born recruits with a poor grasp of English during World War 1, involving tasks such as maze completion and picture completion.
Culture-Free Tests
Tests designed to minimize the influence of culture on the evaluation procedure.
Culture-Fair Test
A test that aims to measure intelligence without cultural bias.
Black Intelligence Test of Cultural Homogeneity
A test that is least likely to be classified as a Culture-Fair Test.
Culture Fair
It is when a test incorporates the vocabulary, concepts, traditions, knowledge, and feelings associated with a particular culture.
Power Test
A type of test where items increase in difficulty as one answers the test. The aim is to identify the highest level of difficulty that the test-taker can take.
Operational Definition
Defining terms based on observable characteristics and how it is used in the study.
Benton Visual Retention Test
A neuropsychological test used in the assessment of brain damage. It consists of nine geometric figures that the subject is simply asked to copy.
APGAR Test
It is everybody's first test, which is a simple and rapid method of evaluating newborn infants.
IQ Calculation
Calculate the IQ of Juan if his chronological age is 12, while his mental age is 15.
Regression Analysis
A statistical mode used to predict the values of a dependent or a response variable based on values of at least one independent or exploratory variable.