Course Title: PY2501 Research Methods and Data Analysis - Questionnaire Design: Items and Scales
Instructor: Prof Andrew Schofield, based on slides by Prof Adrian Burgess
Week 3: Theory of measurement
Week 8: From items to scales
Week 9: Assessing quality of scales and their meaning
Don’t reinvent the wheel: Utilize existing reliable and valid scales when possible.
Reliability and Validity: Critical yet challenging to achieve
Topics Covered:
Transition from items to scales
Creation of items
Scaling responses
Understanding multivariate scales
Psychological traits are multidimensional; fear consists of three components:
Physiological: Example indicators include GSR (Galvanic Skin Response), ECG (Electrocardiogram)
Behavioral: Fight, flight, freeze responses
Cognitive/Affective: Subjective emotional states
Some phenomena require subjective inquiry through direct questions.
Sources for Item Development:
Psychological Theory: Start with robust theoretical frameworks.
Systematic Search: Includes literature review and expert consultation
Define a psychological trait. For example, extroversion:
Questions might include:
"Do you enjoy going to parties?" (Yes/No)
Reversed statements such as "I like to sit quietly and read".
Ensure all questions align with the trait definition, avoiding irrelevant questions about transient states.
Semantic Links:
Explore related terms in dictionaries or web resources.
Expert Consultation: Engage focus groups and informants for input.
Items should encompass the entire spectrum of the domain (e.g., fear of spiders includes behavioral, physiological, cognitive/affective dimensions).
Example items:
“I feel an urge to run away at the sight of a spider.”
Single Focus: Avoid composite questions (e.g., "I like parties and dancing").
Readability: Strive for an 8-year reading age.
Cultural Sensitivity: Consider potential translation or cultural misunderstandings.
Definition: Transform qualitative attributes into quantitative measures.
Two Approaches to Scaling:
Comparative Scaling
Direct Estimation
Techniques include:
Paired Comparison
Thurstone’s Method
Steven’s Scaling
Guttman Scaling
Identifies the key aspects that determine lecturer quality based on preferences.
Assumptions about latent psychological measurement scales and normally distributed weighting estimates.
Pick a small set of relevant items.
Have judges compare each pair.
Convert preferences into z-scores.
Average z-scores for final scaling.
Assess pairs of lecturer attributes for importance.
Example: 60% preference yields a z-score of 0.26.
Final scores convert z-scores to a scaled weight reflecting importance.
Participants provide direct ratings of their experiences (e.g., fear levels).
Various ways to quantify fear:
Dichotomous
Rating scales
Visual analogue scales
Common formats to gauge responses to fear levels through various comparative phrases.
Considerations for how to ask about fear across different timeframes.
Balance between too few and too many response categories.
Importance of clarity and context for adjectives used.
Understand implications of number choices and consistency.
Efficiency of direct estimation versus the reliability of comparative scaling.
Topics covered in the first quiz.
Improves accuracy and reliability, achieving interval level measurement.
Factors affecting observations and reliability calculations.
Errors can cancel out over repeated measurements.
Challenges of repeated questions in psychological contexts due to memory effects.
Discusses random error distributions in psychological measurement.
Rating scales often viewed as ordinal but can be treated as interval under certain conditions.
Theory supporting the treatment of ordinal data as interval data.
Enables precise statistical analysis and inferences.
Importance of item correlation for scale validity and internal consistency.
Correlation as a measure of variance and reliability of item sets.
Ranges of correlation coefficients indicating strength and shared variance.
Homogeneity in items influences scale reliability.
Correlated items validate the formation of a scale.
Methods such as item-total correlation, split-half reliability, and Cronbach’s Alpha.
Thresholds for assessing internal consistency of scales.
Topics covered in the second quiz.
Understanding constructs like personality and intelligence, which consist of multiple factors.
Identifying potential multiple latent factors through correlation inspection.
Statistical approach to explore latent variable representation in scale measurements.