Untitled Notes

Scientific Method in Psychology: Overview

The scientific method is broadly similar across sciences (biology, chemistry, material sciences) and also applies to psychology and social sciences.
Goal: maximize objectivity and accuracy, recognizing that researchers are human and not perfectly objective.
Core process ( cyclical and iterative ):
- Form a research question of interest.
- Develop predictions or hypotheses about how the world works.
- Ground these in prior research and the current state of the field.
- Test hypotheses by collecting data via rigorous methods.
- Analyze the data to draw conclusions about how the world works.
- Communicate findings so others can use, critique, or replicate them.
Before conducting research, ethical and respectful study of people is required (human subjects research).
Foundational ethics frameworks and governance:
- Belmont Report (developed in the 1970s) guides ethical research with human participants.
- Federal governance comes from bodies like Health and Human Services (HHS).
- Institutional Review Board (IRB): reviews all research at a given institution to ensure ethical conduct and protection of human rights.
- Guidelines apply to both federally funded and non-federally funded research at universities and research institutes.
Core ethical pillars (brief): informed consent, confidentiality, and debriefing.
- Informed consent: participants must understand what they are getting into; information presented at an appropriate developmental level; participants can withdraw at any time without penalty.
- Confidentiality: data are kept private; responses may be anonymized or identified only by codes; data breach protections.
- Special concerns for vulnerable populations (infants, children, prisoners, pregnant people and unborn fetuses).
- Debriefing: after participation, researchers disclose the study’s purpose and methods; particularly important when deception is used to prevent bias (e.g., social desirability bias).
Deception and social desirability bias:
- Deception may be used to reduce bias, but must be followed by thorough debriefing.
- Social desirability bias: participants may report what they think sounds better rather than what they actually think/feel.
Ethical considerations in lifespan developmental research:
- Children require additional protections; legal guardians provide consent for minors; assent (child agreement) is sought when possible.
- Distress-inducing procedures are carefully considered and justified as part of everyday experiences (e.g., attachment studies).
- Pregnant people and unborn fetuses require added protections to safeguard health and well-being.
Historical context: ethical guidelines exist because past research failures were harmful; current guidelines aim to prevent such abuses.
Reconnecting to the scientific method in practice:
- Start with a broad topic of interest; identify gaps in knowledge; ensure feasibility and ethics; replicate where possible to strengthen evidence.
- Understand different study designs and how they impact validity.

Key Concepts in Validity and Study Design

Validity: the extent to which a study or measure reflects what it intends to measure.
- Ecological validity: the accuracy with which findings reflect real-life processes and contexts.
- Internal validity: the likelihood that observed effects reflect a true causal relationship rather than confounds.
- Trade-off: some designs improve ecological validity but reduce internal validity, and vice versa. Different questions may require different balances.
The claim that there is only one design that establishes causality: experimental designs.
Descriptive research: aims to describe what is happening (often debated as a separate category from correlational and experimental).
Correlational designs:
- Study two factors as they occur naturally, without manipulation.
- High ecological validity, but low internal validity (correlation does not imply causation).
- Possible third variables or bidirectional influences.
Experimental designs:
- Manipulate an independent variable (IV) and measure a dependent variable (DV).
- Conducted under controlled conditions (often in a lab).
- Higher internal validity, enabling causal inferences; ecological validity can be lower due to artificial settings.
- Variables: IV (manipulated) and DV (measured).
- Control of extraneous variables to isolate the effect of the IV.
Random assignment:
- Participants are randomly assigned to two or more groups to balance individual differences across groups.
- Example: IV = eating before an exam (eat vs. do not eat); DV = exam performance.
- Random assignment helps ensure that confounding variables are evenly distributed across groups.
Field experiments vs laboratory experiments:
- Field experiments: conducted in natural settings; greater ecological validity but less experimental control.
- Lab experiments: conducted in controlled environments; higher internal validity but potentially lower ecological validity.
Interventions and longitudinal designs:
- Intervention programs can be tested in experiments or quasi-experimental designs.
- Longitudinal studies track the same participants over time to observe change.
Quasi-experiments and natural experiments:
- Quasi-experiments: lack random assignment; groups naturally differ (e.g., different cultures or policy changes).
- Natural experiments: events outside the researcher’s control are used to study effects (e.g., policy changes, COVID-19 as a natural catalyst).
Mixed-methods and meta-analysis:
- Mixed designs combine elements from different designs to suit the question.
- Meta-analysis pools data from multiple studies to estimate overall effects and improve generalizability.
Cross-cultural and diverse sampling:
- Aim to move beyond WEIRD samples to improve representativeness.
- WEIRD acronym (extremely common in psychology) stands for White, Western, Educated, Industrialized, Rich, Democratic; researchers are increasingly cautious about overgeneralizing from WEIRD samples.
- Strategies to diversify samples include oversampling underrepresented groups, collaborating across institutions, and using national datasets for representative samples.
Cross-cultural research cautions:
- Avoid lumping diverse groups into broad labels (e.g., “Asians,” “Latinos”) as if they are homogeneous.
- Historically, white European American samples often served as a default comparison group, which is now discouraged.

Developmental Designs: How to Study Change Over Time

Cross-sectional design:
- Recruit participants of different ages at the same time.
- Pros: quick, relatively inexpensive; provides a snapshot across ages.
- Cons: cohort effects—age groups may differ due to historical/cultural experiences rather than developmental processes.
- Example discussed: COVID-19 pandemic affecting children at different ages could confound age-related development with pandemic experiences.
Longitudinal design:
- Follow the same group of individuals over time.
- Pros: controls for cohort effects; reveals developmental trajectories.
- Cons: time-consuming, expensive; cross-generational generalizability limited (e.g., findings may not generalize to future cohorts); attrition can be an issue.
Cross-sequential design:
- Combines cross-sectional and longitudinal approaches.
- Recruit cohorts at different ages and follow them for a shorter period.
- Pros: balances time and cohort concerns.
- Cons: more complex logistics and analysis.
Microgenetic design:
- Intensive, moment-to-moment study of change over a short period when a developmental change is occurring.
- Focuses on mechanisms and processes driving change (e.g., puberty, rapid skill acquisition).
- Typically produces a large amount of data over a brief window.

Sampling and Representativeness in Developmental Research

Participant selection and representativeness:
- Researchers often focus on a narrower age range or specific cultural groups relevant to the question.
- Samples should be representative of the population of interest; no single study can include every individual.
- Within-group diversity matters (e.g., even within Chinese Americans there are varied experiences).
Historical biases in samples:
- Early psychology studies often used male college students and white, middle-to-upper-class participants; now recognized as problematic and insufficient for generalization.
WEIRD populations and beyond:
- WEIRD: White, Western, Educated, Industrialized, Rich, Democratic; criticized for not representing global diversity.
- Researchers respond by oversampling underrepresented groups, collaborating across universities, using large national datasets (e.g., NICHD data), and conducting cross-cultural research.
Cross-cultural research practices:
- Use culturally appropriate measurement and language; avoid assuming identical constructs map across cultures.

Data Collection Methods and Measurement in Developmental Research

Surveys and questionnaires:
- Structured measures with fixed items; can be paper, online, or self-report.
- Risks: social desirability bias; responses may be influenced by wording and context.
Structured vs unstructured interviews:
- Structured: fixed questions; easier to compare across participants.
- Unstructured: flexible, clinical approach; tailored to participant; useful for capturing individual differences, especially with children.
- Downsides: harder to compare; potential for confirmation bias or leading questions.
Observational methods:
- Naturalistic observations: in participants’ natural environments (e.g., video-recorded family dinners at home); participants know they are observed, with data kept confidential.
- Structured observations: conducted in labs or controlled environments; scenarios are set up and video-recorded for later coding.
- Behavior coding: trained researchers code specific behaviors from video or live observation.
Attachment research example (structured observation):
- Mary Ainsworth’s Strange Situation protocol used to assess primary caregiver attachment.
- Typical procedure: mother-child play; mother leaves the room; a “stranger” enters; mother returns; child’s reactions coded (e.g., crying, clinging).
Physiological and biological measures:
- EEGs (electroencephalography) with head-mounted electrodes;
- MRI (brain imaging);
- Saliva or blood samples; hormone levels.
- These measures can provide objective data about physiological processes related to development.
Practical and ethical considerations for measures:
- Measures must be valid (measure what they are intended to measure).
- Some measures (especially surveys) can be influenced by wording and administration; validity concerns arise if items don’t reflect the construct well.
- Reliability: consistency of measurements over time (test-retest) and across observers (inter-rater reliability).
Reliability specifics:
- Test-retest reliability: stability of a measure across time when the underlying trait is stable.
- Inter-rater reliability: agreement among different coders or observers when coding behavior.
Data privacy and ethics in measurement:
- Ensure confidentiality and control access to sensitive data; deidentify data with codes instead of names.
- Be mindful of potential harms from data disclosure (e.g., sensitive personal or legal information).

From Data to Analysis: Reading and Reporting Results

After data collection, the next step is data analysis.
The instructor briefly notes that statistical analysis is a large topic and not covered in depth in this session.
In practice, analysis involves choosing appropriate statistical tests based on design (correlational, experimental, longitudinal, etc.), checking assumptions, and interpreting results in light of validity and reliability concerns.

Key Takeaways and Practical Implications

Always ground questions in existing literature and identify knowledge gaps that are feasible and ethically permissible to study.
Consider both internal and ecological validity when designing a study; sometimes you trade one for the other depending on the research question.
Understand the strengths and limitations of different designs: correlation for naturalistic validity, experiments for causal inference, and longitudinal approaches for development over time.
Always plan sampling carefully to ensure representativeness and consider WEIRD biases; diversify samples when possible.
Use multiple methods of data collection to triangulate findings (surveys, interviews, observations, physiological measures) where appropriate.
Ethics are not optional: obtain informed consent/assent, protect confidentiality, minimize risk, and provide debriefing; special protections exist for vulnerable populations.
When reporting results, link back to feasibility, ethical considerations, and the broader implications for practice, policy, and further research.

Equations, Symbols, and Notable Formulas (LaTeX)

Correlation vs causation concept:
r
eq ext{causation}
Independent variable (IV) and dependent variable (DV) relationship (conceptual):
ext{DV} = f( ext{IV}, ext{controls})
Random assignment concept (probabilistic grouping):
P( ext{Group } g) = rac{1}{k}, ext{ for } g = 1,
ightarrow k
Validity definitions (conceptual forms):
ext{Ecological Validity} = ext{real-world applicability of findings}
ext{Internal Validity} = ext{confidence that observed effects are due to IV}
Cross-sectional vs longitudinal notation (conceptual):
ext{Cross-sectional}: ext{ multiple ages at one time}
ext{Longitudinal}: ext{ same individuals over time}
Microgenetic design (concept):
ext{Microgenetic design}
ightarrow ext{intense data during rapid development}
WEIRD acronym (conceptual representation):
ext{WEIRD} = igl{ ext{White}, ext{Western}, ext{Educated}, ext{Industrialized}, ext{Rich}, ext{Democratic} igr
floor