LY

Replication Crisis in Psychology (1)

The Replication Crisis in Psychology and the Open Science Movement


Overview of the Crisis

  • The replication crisis refers to recurring challenges in replicating psychological studies, raising questions about the reliability of scientific findings.

  • The Open Science Movement aims to increase transparency and accessibility of research processes and findings in psychology.


Key Events in the Crisis

2011: Significant Papers and Claims

  • Daryl Bem's ESP Study: Published a paper in the Journal of Personality and Social Psychology (JPSP) claiming evidence for extrasensory perception (ESP).

    • The significant effects were criticized for not correcting the familywise error rate, leading to excessive false positives (Type I errors).

Data Fabrication and Faux Findings

  • Diederik Stapel Case: Discovered in 2011 for fabricating data.

  • Overlapping issues in scientific methods resulted in major findings failing replication, including:

    • Ego depletion

    • Embodied cognition

    • Power posing

    • Behavioral priming

Open Science Collaboration (2015)

  • Attempted replicating 100 studies from top psychology journals:

    • Only 39% of original studies were successfully replicated.

    • 25% replication success rate in social psychology versus 50% in cognitive psychology.

    • Effect sizes were frequently overestimated in original studies.


Challenges to Replicability

  • Sampling Variability: Results can vary significantly due to differing sample characteristics.

  • Hidden Moderators: Variables like cultural context and demographics affect outcomes.

  • Low Statistical Power: Contributes to difficulties in replicating findings, with false positives potential in original studies and false negatives in replication attempts.


Publication Bias in Academia

Bias Towards Significant Findings

  • Academic journals preferentially publish:

    1. Significant findings over non-significant ones.

    2. Novel findings with surprising outcomes,

  • Non-significant findings often end up in a researcher’s "file drawer" and never see publication.

Academic Incentives

  • Academia encourages the pursuit of "flashy" results to secure promotions, jobs, and media coverage.

  • This incentivization leads to questionable research practices and increases the prevalence of false positives in published literature.


Error Types and Consequences

False Positives and False Negatives

  • Type I Errors: Incorrectly claiming an effect exists when it doesn't (e.g., p < .05).

  • Type II Errors: Failing to recognize an effect that actually exists.

  • A study by Simmons, Nelson, & Simonsohn (2011) illustrated how easily false positives can be obtained through statistical manipulation.

Implications of False Findings

  • Health research can wrongly indicate effectiveness of treatments, leading to wasted resources and erosion of credibility in psychological science.

  • Increasing false positives creates skepticism around published research.


Methodological Challenges in Research

Questionable Research Practices

  • Researchers may engage in practices that go against scientific integrity, motivated by the need to validate results statistically.

  • Decisions in study conduct, from sample size to handling outliers, impact the chance of achieving significant results.

Data Collection Techniques

  • Commonly, researchers collect data, analyze, and if results are not significant, continue data collection until achieving p < .05。

  • Frequent testing to reach significance increases the likelihood of false positives.


Solutions to Reduce False Positives

Proposed Changes in Research Practices

  • Prespecified Termination Rules: Researchers should determine data collection rules beforehand.

  • Minimum Observations: Recommends at least 20 observations per cell in experimental designs.

  • Disclosure Requirements: Full disclosure of all variables used, including details of failed manipulations and changes in analyses.

Importance of Open Science

  • Open practices can help eliminate selective reporting and increase transparency, improving the credibility of psychological research and findings.


The Role of Hypothesis Development

Bad vs. Good Hypotheses

  • Bad Hypothesis Example: Non-directional claims that lack clarity.

  • Good Hypothesis Example: Directional, stating clear expectations regarding performance discrepancies (e.g., cultural comparisons) with process models included.

Preregistration and Transparency

  • Researchers encouraged to preregister hypotheses and methodologies to ensure rigorous examinations.

  • The use of public repositories for findings helps mitigate the file drawer problem.


Advanced Statistical Methods

Importance of Effect Size

  • Cohen's guidelines on effect sizes indicate necessary sample sizes for detecting small effects.

  • Use tools like G*Power to determine statistical power.

Exploratory vs. Confirmatory Research

  • Distinction made between exploratory (theoretical) work and confirmatory studies; both serve important roles but should be labeled correctly to avoid misrepresentation.


Behavioral Priming as a Case Study

Mechanisms and Outcomes

  • Behavioral (or social) priming influence outcomes without conscious awareness, illustrated by the elderly walking study.

  • It was replicated only under certain expectations.

Research Design Considerations

  • Substantial study designs across various subjects, indicating that significant effects require robust methodologies. Attention needed to verify findings across diverse populations.


Final Thoughts on Science and Innovation

  • Emphasis on the equivalence of replication and innovation in the scientific process.

  • Psychological research is evolving, embracing a more open scientific culture, supported by social media for broader dissemination and engagement.