Replication Crisis in Psychology (1)
The Replication Crisis in Psychology and the Open Science Movement
Overview of the Crisis
The replication crisis refers to recurring challenges in replicating psychological studies, raising questions about the reliability of scientific findings.
The Open Science Movement aims to increase transparency and accessibility of research processes and findings in psychology.
Key Events in the Crisis
2011: Significant Papers and Claims
Daryl Bem's ESP Study: Published a paper in the Journal of Personality and Social Psychology (JPSP) claiming evidence for extrasensory perception (ESP).
The significant effects were criticized for not correcting the familywise error rate, leading to excessive false positives (Type I errors).
Data Fabrication and Faux Findings
Diederik Stapel Case: Discovered in 2011 for fabricating data.
Overlapping issues in scientific methods resulted in major findings failing replication, including:
Ego depletion
Embodied cognition
Power posing
Behavioral priming
Open Science Collaboration (2015)
Attempted replicating 100 studies from top psychology journals:
Only 39% of original studies were successfully replicated.
25% replication success rate in social psychology versus 50% in cognitive psychology.
Effect sizes were frequently overestimated in original studies.
Challenges to Replicability
Sampling Variability: Results can vary significantly due to differing sample characteristics.
Hidden Moderators: Variables like cultural context and demographics affect outcomes.
Low Statistical Power: Contributes to difficulties in replicating findings, with false positives potential in original studies and false negatives in replication attempts.
Publication Bias in Academia
Bias Towards Significant Findings
Academic journals preferentially publish:
Significant findings over non-significant ones.
Novel findings with surprising outcomes,
Non-significant findings often end up in a researcher’s "file drawer" and never see publication.
Academic Incentives
Academia encourages the pursuit of "flashy" results to secure promotions, jobs, and media coverage.
This incentivization leads to questionable research practices and increases the prevalence of false positives in published literature.
Error Types and Consequences
False Positives and False Negatives
Type I Errors: Incorrectly claiming an effect exists when it doesn't (e.g., p < .05).
Type II Errors: Failing to recognize an effect that actually exists.
A study by Simmons, Nelson, & Simonsohn (2011) illustrated how easily false positives can be obtained through statistical manipulation.
Implications of False Findings
Health research can wrongly indicate effectiveness of treatments, leading to wasted resources and erosion of credibility in psychological science.
Increasing false positives creates skepticism around published research.
Methodological Challenges in Research
Questionable Research Practices
Researchers may engage in practices that go against scientific integrity, motivated by the need to validate results statistically.
Decisions in study conduct, from sample size to handling outliers, impact the chance of achieving significant results.
Data Collection Techniques
Commonly, researchers collect data, analyze, and if results are not significant, continue data collection until achieving p < .05。
Frequent testing to reach significance increases the likelihood of false positives.
Solutions to Reduce False Positives
Proposed Changes in Research Practices
Prespecified Termination Rules: Researchers should determine data collection rules beforehand.
Minimum Observations: Recommends at least 20 observations per cell in experimental designs.
Disclosure Requirements: Full disclosure of all variables used, including details of failed manipulations and changes in analyses.
Importance of Open Science
Open practices can help eliminate selective reporting and increase transparency, improving the credibility of psychological research and findings.
The Role of Hypothesis Development
Bad vs. Good Hypotheses
Bad Hypothesis Example: Non-directional claims that lack clarity.
Good Hypothesis Example: Directional, stating clear expectations regarding performance discrepancies (e.g., cultural comparisons) with process models included.
Preregistration and Transparency
Researchers encouraged to preregister hypotheses and methodologies to ensure rigorous examinations.
The use of public repositories for findings helps mitigate the file drawer problem.
Advanced Statistical Methods
Importance of Effect Size
Cohen's guidelines on effect sizes indicate necessary sample sizes for detecting small effects.
Use tools like G*Power to determine statistical power.
Exploratory vs. Confirmatory Research
Distinction made between exploratory (theoretical) work and confirmatory studies; both serve important roles but should be labeled correctly to avoid misrepresentation.
Behavioral Priming as a Case Study
Mechanisms and Outcomes
Behavioral (or social) priming influence outcomes without conscious awareness, illustrated by the elderly walking study.
It was replicated only under certain expectations.
Research Design Considerations
Substantial study designs across various subjects, indicating that significant effects require robust methodologies. Attention needed to verify findings across diverse populations.
Final Thoughts on Science and Innovation
Emphasis on the equivalence of replication and innovation in the scientific process.
Psychological research is evolving, embracing a more open scientific culture, supported by social media for broader dissemination and engagement.