RM1 W4: Reliability, Sampling & Ethics

0.0(0)

Studied by 0 people

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/49

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

50 Terms

New cards

3 types of reliability:

Across time
Across raters
Across items within the test

New cards

Random error

We may not get identical scores when repeating a measure on the same person

Participant-driven error: mood, hunger, fatigue
Environment-driven error: temperature, noise, time of the day

New cards

Reliability across time (2 procedures)

Test-retest reliability
Parallel-forms reliability

New cards

Test-retest reliability

The extent to which scores on identical measures correlate with e/o when administered at two different times

New cards

General procedure

Administer the test
Get the results
Have the interval/time gap
Repeat step 1&2
Correlate both results

New cards

r-value

The more reliable; same scores observed again and again, the higher the r-value (from 0.00-1.00)

New cards

Limitation

The completion of the 1st test may influence one’s knowledge in completing the subsequent tests

Eg. it is easier to complete an IQ test the 2nd time when we already know the questions

New cards

To fix that limitation, we may use:

Parallel-forms reliability

New cards

Parallel-forms reliability

The extent to which scores on similar, but not identical, measures correlate with e/o when administered at two different times

New cards

General procedure

Administer the test (form A)
Get the results
Have the interval/time gap
Administer the other test (form B)
Get the results
Correlate both results

New cards

Limitation

Expensive to create double the number of tests, and difficult to make sure the two tests are really equivalent

New cards

Reliability estimates are only meaningful when:

the construct does not change over time

New cards

Example: measuring children’s IQ at age 5 and 10

Low reliability (r= 0.5) may not be meaningful as the difference in IQ scores at age 5 vs 10 could be due to changes in intelligence

New cards

Why are changes in intelligence not considered random error?

Because intelligence is the construct we are measuring, and there is a genuine change in the construct between age 5 and 10 that is not due to random error

New cards

Example: BFLM scale administered to a 1 year r/s couple and a 10 year r/s couple

Low reliability (r= 0.3) could be due to the fact that love can change after 10 years

New cards

Time interval

Choose a time interval that makes sense, depending on CONTEXT and what we are measuring

New cards

Example: time interval for BFLM scale

3 day time interval > found low TRR > more likely due to random error (as love cannot change in such a short period) > consider to revise

New cards

When is appropriate to use short/long intervals?

Long: when the construct is more resistant to change eg. personality

Short: when the construct is more susceptible to change eg. moods

New cards

Possible limitation of short intervals

Susceptible to low TRR

Eg. 5 min interval between eating a plate of Hokkien mee and low consistency in taste could be due to fullness

New cards

Two ways to improve TRR & PFR:

Revise your measurement: remove subjective questions, or make them more specific to avoid multiple interpretations —> increase resistance to random error

Administer the measure more times (across the day) and aggregate the scores together

over a series of measurements, the inconsistencies in scores caused by random error should be averaged to 0

New cards

Reliability across raters; inter-rater reliability

The extent to which the ratings of one or more judges correlate with e/o

New cards

Why do we need inter-rater reliability (IRR)?

Observer error may arise as raters may differ in moods, attention, motivation, interests etc

New cards

3 ways to improve low IRR

Train your raters and provide clearer guidelines for ratings
Revise your scale (similar to TRR & PFR)
Have more no. of raters before aggregating the scores

the overestimates and underestimates caused by observer error should be averaged to 0

New cards

Reliability across items within the test; internal consistency

The extent to which the scores of the different items on a scale correlate with e/o, thus measuring the true score

New cards

Why do we need internal consistency?

Most measures consist of >1 item to fulfil content validity, and each item is assumed to measure a part of the total construct

New cards

2 ways to calculate internal consistency

Split-half reliability
Cronbach’s alpha

New cards

Split-half reliability

The extent to which scores between two halves of a scale are correlated

New cards

Procedure example: Odd-even order

One half consists of items 1&3, the other half consists of items 2&4
If the scores are similar for both halves, then the scale has good SHR

New cards

Limitation of using SHR

The 2 halves/versions may not really be equivalent

New cards

To fix that limitation, we may use:

Cronbach’s alpha

New cards

Cronbach’s alpha

An estimation of the average correlation among all the items on the scale = the average of all possible SHR outcomes

New cards

What is a suitable Cronbach’s alpha score for an acceptable scale?

>0.7

New cards

Relationship between reliability and validity

Reliability is a pre-requisite for validity; a measure must first be reliable then it can be valid, but a measure can be reliable without being valid

New cards

Step 4a

Select your sample

New cards

Selection bias

When the sampling method favours the selection of some individuals over other

New cards

2 categories of Sampling methods:

Probability sampling
Non-probability sampling

New cards

When to use either sampling methods?

Probability sampling: when the entire population is known, sampling occurs through an unbiased and equal-chance selection process

Non-probability sampling: when the population is not completely known, sampling is guided by common sense or convenience, while maintaining representativeness and avoiding bias

New cards

5 types of probability sampling methods:

Random sampling
Systematic sampling
Stratified random sampling
Proportionate stratified sampling
Cluster sampling

New cards

2 types of non-probability sampling methods:

Convenience sampling
Quota sampling

New cards

When to use convenience vs quota sampling?

Convenience: when it doesn’t matter if there is any group differences in the variable of interest eg. males or females will produce the same results, hence numbers do not matter
Quota: when you believe there are established group differences in your variable of interest eg. males and females will produce different results, hence numbers matter

New cards

Is this probability or non-probability sampling?

Non-probability - not everyone has an equal chance of being selected because this survey is only conducted on one day, those who did not take mrt/bus that day were ntot able to take part

New cards

Which sampling method is more employed?

Nonprobability, due to its convenience and much lower cost &

Probability sampling requires every individual in a population to be known and to have equal chance of being selected, which is difficult

New cards

What is the standard sample size for accurate generalisation?

>50 individuals from each group

New cards

Step 4b

Prepare for ethics approval

New cards

4 elements of ethical procedure involving human participants:

No harm
Informed consent
Privacy and confidentiality
Debriefing

New cards

No harm

Researchers are to avoid harming subjects, and to minimise harm when it is foreseeable and unavoidable

Eg. Little Albert experiment was super harmful

New cards

Informed consent

Researchers must obtain consent from participants

typical containing: the purpose of the research & expected duration, study procedure, any prospective research benefits, potential risks, limits of confidentiality, contact details, and participants rights to decline and withdraw from the research

New cards

Privacy and confidentiality

Researchers are obligated to take precautions to protect confidential information

Practising anonymity for publication: ensuring subjects’ names are not directly associated with any information or measurements obtained from them

New cards

Debriefing

Researchers must quickly let participants know about the nature, results, and conclusions of the research, especially necessary when there is a cover story

New cards

The role of The Institutional Review Board (IRB)

Reviews all research w.r.t. human treatment and their approval must be obtained before conducting any real research