Reliability and Validity
Introduction
- Personal update: Instructor mentions having been sick, leading to class schedule adjustments.
- Acknowledgment of being behind in the course material due to this illness.
Curriculum Adjustments
- Discussion of curriculum changes:
- Removal of one week's topics: Probability and Z-scores.
- Reasoning: Z-scores not needed for writing assignments, but basic understanding of probability is essential.
- Key concept in probability for psychology is the p-value:
- Definition: The p-value indicates the statistical significance of results, where any p-value < 0.05 suggests that results are statistically supported and unlikely to be due to chance.
Reliability and Validity Overview
- Today's focus:
- Concepts of reliability and validity.
- Preparation for the next writing assignment related to reliability.
Psychological Constructs
- Definition of a construct:
- Summarized as what is being measured in psychology (e.g., anxiety defined as excessive worry in absence of stressors).
- Importance of conceptualization and operationalization:
- Conceptualization: The definition of the construct.
- Operationalization: The method used to measure the construct.
- Potential discrepancies between theoretical definitions and practical measurements can occur.
- Illustration of reliability and validity:
- Ensures accurate measurement of constructs.
Psychological Models and Constructs
- Template for a psychological model:
- Items that contribute to a construct (e.g., personality traits measured in assessments).
- Larger measures (e.g., BFI or IPIP) contain multiple items to capture one variable (e.g., extroversion).
- Example of constructs and their predictive capability:
- Conscientiousness as a predictor of GPA, GRE scores, and job performance.
- Hypothesized relationships:
- Positive association between conscientiousness and GPA, GRE scores.
- Possible negative association with actual job performance.
- Implication: High academic performance does not necessarily correlate with job efficacy.
Key Definitions
- Reliability:
- Defined as the consistency of test scores across different testing occasions, versions of the test, or by different raters.
- Types of reliability:
- Test-retest reliability: Similar scores when the same test is given at different times.
- Example: Pretest and posttest in class.
- Parallel forms reliability: Different versions of the same test yield similar scores.
- Internal consistency: Consistency of scores across items within a test that measures the same construct.
- Inter-rater reliability: Consistency between different raters evaluating the same participant's behavior.
- Validity:
- The degree to which a test measures what it is intended to measure.
- Types of validity:
- Content-related validity: Coverage of domain of the construct.
- Criterion-related validity: Correlation with outcome variables.
- Construct-related validity: Relationship with other similar and dissimilar constructs.
Examples of Reliability and Validity
- Reliability Example:
- Measurement could be consistently incorrect (e.g., scales that read 5 pounds heavy).
- Validity Example:
- High reliability with low validity if a measure predicts unrelated outcomes, such as the weight measure predicting anxiety levels.
Diagrams and Visuals for Understanding Reliability and Validity
- Bull's-eye model of validity and reliability:
- Illustrate hitting the target of measurement with various outcomes:
- Accurate and reliable measurement (hits the center).
- Reliable but inaccurate (hits consistently but not at the target).
- Valid but inconsistent (hits the target on average but lacks consistency).
Philosophical Implications of Reliability and Validity
- Importance of reliability and validity before conducting any analyses:
- If constructs aren't reliably measured, inferred conclusions based on these measures may be invalid or misleading.
- Example: Reliably measuring weight as a proxy for anxiety leads to inaccurate conclusions regarding anxiety levels.
Cronbach's Alpha • Reliability Assessment
- Internal consistency is assessed using Cronbach's alpha:
- Symbol: α (alpha symbol).
- Desired range for reliability:
- Acceptable: α > 0.70.
- Strong: 0.80 - 0.89.
- Very strong: α ≥ 0.90.
- Potential issues in calculating Cronbach's alpha include:
- Incorrect data leading to scores exceeding 1;
- Short measures yielding surprisingly high alphas.
Example Data Analysis in Jamovi
- Overview of statistical software used for data analysis (Jamovi).
- Steps to compute reliability:
- Inputting items and reverse coding when necessary.
- Procedures to check outputs for reliability and internal consistency measures.
Writing Assignments Related to Reliability and Validity
- Structure of upcoming assignments and their relation to reliability and validity concepts discussed in class:
- Preparation of papers includes assessment of correlation, mean scores, and considering reliability coefficients in interpretations.
- Need for proper exemplification of reliable but invalid measures.
- Instructions for incorporating outputs and statistics in assignments (e.g., graphs, results).
- APA formatting considerations for reporting results (e.g., spacing with symbols, etc.).
Testing Hypotheses
- Interpretation of results based on reliability and validity assessments:
- Discussion on behaviors that were observed and measured.
- Evaluating correlation coefficients between constructs and their implications for construct validity.
Conclusion
- Summary of core topics and procedures.
- Instructions for next week: peer review and writing workshop.
- Closing remarks due to backtracking on previously covered material.
Appendix
- Example reverse-coded items and related details:
- Reverse coding must be consistent with directions of scoring.
- Recommendations for future explorations of construct validity and correlations among constructs.