PSYC3302 - Complete terms

studied byStudied by 2 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 218

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

219 Terms

1
Psychometrics
The science concerned with evaluating the attributes of psychological tests and the procedures used to estimate and evaluate the attributes of tests. Involves reliabilty and validity.
New cards
2
Psychological construct 
An unobservable hypothetical entity that is used to represent a pattern of psychologically related phenomena. Nature is abstract.
New cards
3
Tests or indicators used to represent a construct 
Should be __linked theoretically__ to the definition of the construct.
New cards
4
Psychological test
A systematic procedure for comparing the behavior of two or more people.
New cards
5
Criteria of tests
  1. Tests involve behavioural samples of some kind.

  2. The behavioural samples must be collected in some systematic way.

  3. The purpose of tests is to compare behaviours of two ore more people.

New cards
6
Inter-individual differences
__Comparing__ the cognition/behaviour of different people.
New cards
7
Intra-individual differences
Measure changes in the behaviour of the same people across time.
New cards
8
Types of tests

1: Content (e.g., skill, personality, attitudes)

2: Types of responses (e.g., multiple-choice, open-ended)

3: Administration procedure (e.g., individual vs. group)

4: Intended purpose (criterion vs. norm referenced)

5: Time constraints (speed vs. power)

New cards
9
Criterion referenced tests
Involve a __cut-off score__ that specifies whether someone has achieved a sufficient level of the skill or capacity. Used in contexts where a decision must be made about someone’s __skill level__.
New cards
10
Norm referenced test
Involve comparing a person’s score with that of a __reference sample__. Reference sample is considered to be __representative__ of the population of interest. Compared against the average score.
New cards
11
Scaling
The way in which numerical values are assigned to psychological attributes.
New cards
12
Properties of Identity

Differentiate categories of people who share a psychological feature (do and don’t). Numbers may be used to label the categories - do not have any inherent meaning or true mathematical value.

  1. Categories must be mutually exclusive.

  2. Categories should be exhaustive.

  3. All people classified within a given category must be identical with respect to the attribute of interest.

New cards
13
Properities of Order
More informative - Give the person with the highest level of an attribute a rank of 1, or alternatively, a rank of N (i.e., where N equals the total sample size). 
New cards
14
Properties of Quantity
Reflects the ability of numerals to provide information about the magnitude of differences between people. Numerals used in measurment are real numbers that indicate the amount of something. Can compare cases with each other in a meaningful way.
New cards
15
The number zero
Some are absolute, whereas others are relative (or arbitrary).
New cards
16
Additivity
Implies that the unit size of a measurement does not change as the units are being counted. When perfectly satisified is known as __conjoint measurement__.

Rarely satisfied in psychology.
New cards
17
Counting
Is a necessary but not sufficient condition for measurement.
New cards
18
Measurement
The assignment of numerals to objects or events according to rules. Numbers can be __assigned__ to __represent__ the quantities of psychological attributes.

Includes Steven’s four levels.
New cards
19
Nominal scales
Symbols or numerals that have a property of identity are used to label observations in which behaviors have been sorted into categories according to some psychological attribute.

Has identity.
New cards
20
Ordinal scales
Links observations of behaviour thought to reflect qualitative differences in amounts of an attribute to symbols or numerals that have the __property of order.__

Has identity and order.
New cards
21
Interval scales 
A constant distance between each of the units. However, there is no meaningful zero point.

Has identity, order and quantity.
New cards
22
Ratio scales
Same properites as interval but can say that a case has, say, 37% more of the attribute of interest than another case.

Has identity, order, quantity and absolute zero.
New cards
23
Dispersion
Represents the amount of variability in the data. Includes range, SD and variance.
New cards
24
Correlation
Standardized representation of the __association__ between two variables

Can range from -1.0 to 1.0
New cards
25
Composite Variables
Test __scores__ based on the __sum__ of __two__ or more items. Also called sum scores.
New cards
26
The variance of a composite score is a function of

1. The variance associated with the individual items.

2. The correlations amongst the items.
New cards
27
Composite variables - Variance
knowt flashcard image
New cards
28
Binary Items
Dichotomous items - correct or incorrect answer.

Responses are scored 0 or 1.
New cards
29
Binary items - Variance
The variance of a dichotomously scored item is maximized when half of the people score 1 and the other half score 0.
The variance of a dichotomously scored item is maximized when half of the people score 1 and the other half score 0.
New cards
30
Relative interpretations

Based on the analysis of data.

1: Knowing the mean of the distribution of scores

2: Knowing the standard deviation of the distribution of scores

New cards
31
Abstract interpretations
Based on the __theoretically relevant characteristics__ of the body of research which supports the test scores as valid indicators of a psychological construct.
New cards
32
Limitations of raw scores
Need to be __converted__ into standardized scores which incorporate information about the mean and standard deviation. More precise method.
New cards
33
*Z* scores
Have a mean of zero and a *SD* of one.

__Useful__ for the purposes of rendering inherently meaningless raw scores into easily __interpretable relative scores__.
Have a mean of zero and a *SD* of one.

__Useful__ for the purposes of rendering inherently meaningless raw scores into easily __interpretable relative scores__.
New cards
34
T scores

Involves rescaling z scores so that the converted scores have a different mean and standard deviation.

Mean is always 50 and the standard deviation is always 10.

Used in MMPI.

  1. Convert the raw scores into z-scores

  2. Convert the z-scores using the following formula: T = z(10)+50

New cards
35
Percentile ranks 
Indicate the __percentage__ of scores that are __equal to__ or __below a specific test score__.

Assumes that the distribution of raw scores is perfectly normal.
New cards
36
Probability samples
Use procedures that ensure a representative sample.
New cards
37
Random sampling
Type of probability sample, as you would expect a random sample from the population to be representative of the population.

Rarely happens in practice.
New cards
38
Closed-ended
Questions ask respondents to choose from a fixed set of alternatives. For example, yes/no questions.
New cards
39
Open-ended questions
Do not provide respondents with any response alternatives. Given space to respond in their own words.

This may take a narrative form (i.e., long-answer) or short-answer
New cards
40
Rating Scales
The most widely used response format such as the Likert scale. Likert scales use words to ‘anchor’ the numerical ratings. Most surveys use between 4 and 7 points.

Respondents are not capable of making the distinctions between adjacent points in a rating scale with 11 or more points.

6-7 points is best → 5 points is the bare mimimum for a continously measured variable.
New cards
41
Reliability
Pertinent to the __consistency__ of measurement.

The degree of correspondence between __observed scores__ and __true scores__. Theoretically, the correlation between observed scores and true scores.

A __necessary but not sufficient condition for validity__.

The __discrepancy__ between observed scores and true scores is considered to be due to __measurement error__.

Opposite to measurement error.

A property of test scores (from samples), not tests, per se.
New cards
42
Classical Test Theory 
A measurement theory that __defines the conceptual basis of reliability__.

Specifies procedures for estimating the reliability of scores derived from a psychological test or instrument.
New cards
43
True scores
A __hypothetical score__ devoid of measurement error.

Conceived in the context of a particular test or instrument – not a construct. Not “construct scores”.

\
May be “perfect”, but this is __only true in the context of measurement error__ associated with data derived from a particular test or instrument. We assume they are devoid of measurement error.
New cards
44
Observed scores
The scores we obtain __from tests__ or instruments.

We want the ______ scores to be as close to their corresponding true scores as possible.

\
Want a __large (positive) correlation__ between observed scores and true scores 

Want observed scores and error scores to be __uncorrelated__.

If observed scores and error scores are correlated highly, it means they are measuring the same process: error.
New cards
45
An observed score is
True score + measurment error.
True score + measurment error.
New cards
46
Error scores
Should have a __mean of zero__ - Should be just as many people that should have an observed score that is too large as too small (and the magnitudes should be the same).

\
Error scores should be a __random process__ - should not correlate with anything (except possibly their corresponding observed scores).

\
Error scores should be __uncorrelated with true scores__ - “extraneous” error related factors should affect people equally.
New cards
47

1. Reliablity is the ratio of true score variance to observed variance
The ratio of SS Effect to SS Total

Conceptually, in the reliability case, it is the ratio of SS True to SS Observed
The ratio of SS Effect to SS Total

Conceptually, in the reliability case, it is the ratio of SS True to SS Observed
New cards
48

2. Reliability is the (squared) correlation between observed scores and true scores
When you __square the reliability index__, you get a conceptual estimation of reliability.

Conceptual - it is not possible to know what the true scores are, so we’ll never be able to estimate the actual correlation.
When you __square the reliability index__, you get a conceptual estimation of reliability.

Conceptual - it is not possible to know what the true scores are, so we’ll never be able to estimate the actual correlation.
New cards
49

3. Reliability is the lack of error variance
Subtract the ratio of error variance to observed variance by 1 to place in the same context of reliability (rather than error).
Subtract the ratio of error variance to observed variance by 1 to place in the same context of reliability (rather than error).
New cards
50

4. Reliability is the lack of correlation between observed scores and error scores.
If reliability is the correlation between true scores and observed scores, then it is necessarily the case that it is the absence of a correlation between observed scores and error scores.
If reliability is the correlation between true scores and observed scores, then it is necessarily the case that it is the absence of a correlation between observed scores and error scores.
New cards
51
Parallel tests
Two tests are considered ____ if they are __identical__ to each other __psychometrically__, but __differ__ in the actual __items__ that make up each test.

Tau equivalence - a person’s true score on one test would be expected to be __identical__ on the other test. Assumes equal error variances between the two tests.
New cards
52
Reliability guidelines
According to Nunnally.

Exploratory Research = .70 or higher

Basic Research = .80 or higher

Applied Cases = .90 or even .95 or higher
According to Nunnally.

Exploratory Research = .70 or higher

Basic Research = .80 or higher

Applied Cases = .90 or even .95 or higher
New cards
53
Validity
Represents the degree to which the observed scores from a test __actually represent__ the attribute (or construct) of interest.
New cards
54
Alternative forms reliability
Same as parallel forms.

Carry-over effects of concern - would have to wipe the memory of test taker
New cards
55
Test-retest reliability
Create only __one test__, but administer it on __two different occasions__.

The same items are presented on both occasions, so the true scores should represent the same construct. Assume the construct of interest to be a stable one.

All other assumptions associated with parallel forms apply.
New cards
56
Equal error variances
If you had people sit the test under the same conditions (e.g., same room, same time of day), you might be able to assume that responses are affected by error to the same degree.
New cards
57
Test-retest interval
All other things equal, the __magnitude of the interval__ between the two testing sessions will affect the magnitude of the correlation between the scores.

Also known as ‘__stability coefficient__’.
New cards
58
Internal-Consistency Reliability
P__ractical alternative__ to the alternative forms procedure and the test-retest procedure. Respondents complete __one form of the test__ → diff items = diff forms.

\
1\. The __length__ of the test - A longer test (more items) will be found to yield more reliable scores than a shorter test.

2\. The degree of __consistency__ between the parts/items in the test.
New cards
59
Spearman-Brown formula
Makes an __adjustment__ to the internal consistency reliability estimate to reflect the actual length of the test.
New cards
60
Split-half issues
You get a different reliability estimate depending on how you specify the two halves of the test.

Very unlikely to __get the same correlation__ as in the first split-half example.
New cards
61
Cronbach’s alpha
A reliability formula that represents the reliability of __all possible split-halves__.

The ratio of true score variance to total variance, which is how reliability has been defined at the theoretical level.
New cards
62
Cronbach’s Alpha Assumptions
  1. Indicators (items) are essentially tau-equivalent - each item is an equally strong indicator of the true score scores (may differ by constant).

  2. Each item’s error term is uncorrelated with every other item’s error term - For items that are more alike, a positive correl exists (Cronbach can’t deal with this so it pretends it doesn’t exist).

  3. Error scores are uncorrelated with the true scores.

  4. Items used to generate a composite score measure only one attribute or construct - unidimensionality.

New cards
63
Standardized Coefficient Alpha
Should be applied to scores that have been converted from raw scores to standardized scores.
New cards
64
Kuder-Richardson 20 formula
Introduced to estimate the internal consistency reliability associated with composite scores based on dichotomously scored items.

Cronbach’s alpha formula works just fine on dichotomously scored items.
New cards
65
Parallel assumptions
Equal true score variance, equal means, equal error variance.

Most strict.
New cards
66
Tau-Equivalent
Equal true score variance, equal means, unequal error variances.
New cards
67
Essentially Tau-Equivalent
Equal true score variance, unequal means, unequal error variances.
New cards
68
Congeneric
Unequal true score variance (but all greater than zero), unequal means, unequal error variances.

Least strict.
New cards
69
Cronbach’s alpha is a lower-bound estimate

If the assumption of essential tau-equivalence is not satisfied, then Cronbach’s alpha will tend to underestimate reliability.

Substantial in smaller scales.

The larger the sample size, the greater the confidence.

  • 0.7 = Sample of 400.

  • 0.9 = Sample of 100.

New cards
70
Relationship between number of items and reliability
When the mean inter-item correlation is .30, it can be observed that the level of Cronbach’s alpha rises relatively rapidly from 2 items to about 8 items. 

After about 15 items, the increase is much more gradual and arguably not sufficiently beneficial in most cases.
When the mean inter-item correlation is .30, it can be observed that the level of Cronbach’s alpha rises relatively rapidly from 2 items to about 8 items. 

After about 15 items, the increase is much more gradual and arguably not sufficiently beneficial in most cases.
New cards
71
Sample Homogeneity
A __more homogenous__ sample will yield lower reliability estimates than a __heterogeneous__ sample.

This is because greater homogeneity implies less variance. Less variance implies smaller inter-item correlations, all other things equal.
New cards
72
Coefficient omega (*ω*) 
A modern approach to estimating internal consistency reliability. Introduced by McDonald (1999)

Sometimes referred to as McDonald’s omega. Unlike coefficient alpha, it does not assume essential tau-equivalence.

A lower-bound estimate of reliability.
New cards
73
Standard error of measurement
Represents the amount of error “around” a __point estimate__ in standard deviation form.

A point-estimate is one’s “best guess” of what a person’s score is on the test.

A __confidence interval__ can be estimated around a point-estimate - reflects a range of values that is often interpreted as a range in which the true score is likely to fall.
New cards
74
Is high internal consistency all we want?
While high levels of reliability are desired, they are __not the only consideration__.

Tests with extremely high levels of internal consistency may be __extremely narrow__ in breadth.
New cards
75
Observed score correlation
The correlation you get based on the data you have. Will be compromised to the degree to which there is measurement error in your data.

“Compromised” (or __attenuated__) to the degree that the scores are associated with __less than perfect reliability__.
New cards
76
True score correlation
A hypothetical correlation you can estimate, if you know the reliabilities associated with the scores. Not compromised by measurement error.
New cards
77
R max
The maximum possible correlation between two variables is equal to the square root of the product of their respective reliabilities.
The maximum possible correlation between two variables is equal to the square root of the product of their respective reliabilities.
New cards
78
Correction for Attenuation Formula
What the correlation would have been had the measures provided perfectly reliable scores?

The ratio of the observed correlation to the square root of the product of the reliabilities (i.e., *r*max).
What the correlation would have been had the measures provided perfectly reliable scores?

The ratio of the observed correlation to the square root of the product of the reliabilities (i.e., *r*max).
New cards
79
Validity
The degree to which a test measures __what it is supposed to measure__. M__ost important__ issue in psychological measurement

Or the degree to which evidence and theory support the interpretations of test scores entailed by the proposed uses of a test.

Related to the __proposed uses__ of the scores.
New cards
80
Construct validity 
The degree to which test scores can be interpreted as __reflecting__ a particular psychological __construct__.
New cards
81
Test content
Represents the match between the __actual content__ of the test and the __content that should be__ included in the test.

The __description__ of the nature of the construct should __help define__ the appropriate content of the test.

Includes construct and face validity.
New cards
82
Content Validity
When the items __cover the entire breadth__ of the construct. Items can __not exceed the boundaries__ of the construct.

Constructs such as personality __suffer from a lack of clear boundaries__.

Assessed by experts in field.
New cards
83
Domain sampling theory
If 110 questions in an exam represent a __random assortment__ of material covered across the entire unit, __the correlation__ between performance on a 110 final exam and a 1500 final exam would likely __be very high__.
New cards
84
Standard Error of Estimate
Allows for finding validity of a short form vs a long form of a particular test.
New cards
85
Face Validity
Face validity represents the degree to which the items associated with a measure __appear__ to be related to the construct of interest.

__Not crucial__ from a fundamental psychometric perspective More of a __practical consideration__.

Assessed by non-experts.

People can also respond in a way that they think is __most advantageous__ for them.
New cards
86
Factorial Validity 
The internal structure of a test - its design and what the test actually measures.

1\. The number of dimensions measured by a test.

2\. Whether the items of a test are related to the dimensions of interest.

3\. Whether the dimensions of interest are related to each other.

The ______ (hypothesized structure) of a test can be tested using various __quantitative techniques__, such as factor analysis.

Determining whether the scores of a measure correspond to the __number and nature of dimensions__ of theorized dimensions that underlie the construct of interest.

Can determine what factors “load” onto a dimension of interest.
New cards
87
Response Processes
Should be a __close match__ between the psychological processes that the respondents __actually use__ when completing a measure and the process that they __should use__.

Cheating is an example behaviour that seriously comprises a researcher’s capacity to interpret the scores as valid indicators of performance.
New cards
88
Convergent validity/evidence
The degree to which test scores __are correlated__ with tests of __similar__ constructs. Should be a __positive correlation__ between your self-reported scores and the rater-reported scores (convergent validity).
New cards
89
Discriminant validity/evidence
The degree to which test scores are __uncorrelated__ with tests of unrelated constructs.

It helps to know what a construct __is not__ in the process of its validation. Constructs should not correlate with everything under the sun.
New cards
90
Concurrent Validity Evidence
Observed when scores from one measure correlate in a theoretically meaningful way with the scores of another measure considered to be the “__gold standard__”.

Does not have to based on measures administered __precisely at the same time__ - but the time period __should be very close__.

N__ot particularly impressive__.
New cards
91
Predictive Validity Evidence
The degree to which test scores are correlated with relevant variables that are measured at a __future point in time__.

V__ery impressive__ - relatively rare.
New cards
92
Consequential Validity
S__ocial/personal consequences__ associated with using a particular test.
New cards
93
Criterion Validity
The observation of an association between a psychometric measure and a relevant outcome variable, such as different groups. 

Sometimes referred to as ‘criterion groups validity’.

The __oldest form of validity__.

For example, diagnosed with a clinical disorder versus not diagnosed with a clinical disorder
New cards
94
Induction-Construct Development Interplay
A measure is developed solely from an __inductive perspective__ (essentially through discovery).

Lot of refinement in the model along the way.

Eg. Dictionary example about personality.
New cards
95
Measurement as Theory
The primary objective of validation research is to offer a theoretical explanation of the processes that lead up to the measurement outcome.

Measurement considered a __fundamental theory development__ end in its own right.

Hope/assume that the measures we use yield scores that are valid indicators of the constructs of interest.

Constructs and __well articulated theories__ play a primary role in measurement and in psychology more generally.
New cards
96
Factor analysis

A data analytic technique used to help determine the number and nature of dimensions associated with the scores derived from a test.

  1. Helps us clarify the number of factors within a set of items (or indicators).

  2. Helps us determine the nature of the associations among the factors.

  3. Helps us determine which items are linked to which factor, which facilitates the interpretation of those factors.

New cards
97
Unidimensional test
Consists of items which all measure one, single factor.

Each item is linked to one attribute only.

The degree to which each indicator is linked to the attribute can be estimated with __factor analysis__ or __component analysis__.
New cards
98
Multidimensional test (uncorrelated)
Consists of items which measure two or more dimensions which are unrelated to each other.

N__o link__ between the two dimensions (components/ factors). You would calculate a __separate score__ for each dimension

U__nacceptable__ to calculate a __total score__
New cards
99
Multidimensional test (correlated)
Consists of items which measure two or more dimensions which are correlated with each other (positively or negatively).
New cards
100
Higher-order model.
An overall (global) component is included, instead of a link between the two dimensions.
New cards

Explore top notes

note Note
studied byStudied by 369 people
936 days ago
5.0(5)
note Note
studied byStudied by 3 people
189 days ago
4.0(1)
note Note
studied byStudied by 4 people
32 days ago
5.0(1)
note Note
studied byStudied by 10 people
690 days ago
5.0(1)
note Note
studied byStudied by 42 people
616 days ago
5.0(1)
note Note
studied byStudied by 329 people
694 days ago
5.0(2)
note Note
studied byStudied by 16 people
726 days ago
5.0(1)
note Note
studied byStudied by 423 people
709 days ago
5.0(1)

Explore top flashcards

flashcards Flashcard (36)
studied byStudied by 4 people
786 days ago
5.0(1)
flashcards Flashcard (63)
studied byStudied by 2 people
13 days ago
5.0(1)
flashcards Flashcard (24)
studied byStudied by 864 people
185 days ago
4.5(18)
flashcards Flashcard (77)
studied byStudied by 50 people
25 days ago
5.0(1)
flashcards Flashcard (266)
studied byStudied by 8 people
819 days ago
5.0(1)
flashcards Flashcard (39)
studied byStudied by 49 people
902 days ago
5.0(1)
flashcards Flashcard (36)
studied byStudied by 1 person
172 days ago
5.0(1)
flashcards Flashcard (45)
studied byStudied by 6 people
824 days ago
5.0(1)
robot