Statistical Research Vocabulary and Study Design Notes

Observational vs Experimental Research

Primary purpose of statistical research: answer a research question (examples provided):
- Does aspirin reduce the risk of heart attacks?
- Is one brand of fertilizer more effective at growing roses than another?
- Is fatigue as dangerous to a driver as the influence of alcohol?
Involves investigation of the relationship between at least two variables.
Proper study design ensures the resulting data is reliable and accurate.

Observational Research (Non-experimental)

The researcher merely observes what is currently happening or what has happened in the past (records) and tries to draw conclusions based on these observations.
Little or no interaction with subjects.
No manipulation is performed – do not try to change anything about subjects.
No cause and effect relationship can be determined.
Typically occurs in natural settings.
Data collected is often qualitative, but can be quantitative or both.
Can be expensive and time consuming.
Prevalent in Social Sciences and Marketing (Business).
Examples:
- Surveys (telephone, mailed questionnaire, personal interview).
- Case studies.

Experimental Research (aka Experiment)

The researcher manipulates one of the variables and tries to determine how the manipulation influences other variables.
Interaction with subject usually occurs; modifications on subject occur.
May occur in unnatural settings (labs or classrooms).
Can identify a cause-and-effect relationship.
Used in sciences such as Sociology, Psychology, Biology, Chemistry, Physics, and Medicine.
Examples:
- Clinical trials of new medications or treatments.
- Lab experiments.

The Variables

Independent Variable (aka Explanatory Variable)
- Is the variable or thing that is being manipulated or changed by the researcher.
- In a sense, this variable EXPLAINS what is happening in the experiment.
- Often represented by the letter x.
Dependent Variable (aka Response Variable)
- Is the result (outcome) of a treatment or manipulation of the independent variable.
- Variable about which the researcher is asking a question.
- Often represented by the letter y.

The Setup

Experimental Units
- Single object or individual to be measured.
- Also called Subjects.
Treatments
- Different values of the explanatory variables.
- Typically have at least 2 treatments.
Control Group
- Group of subjects that do NOT receive any treatment or receive a PLACEBO (neutral or fake treatment).
- Provides a baseline for comparison.
Treatment Group
- Group of subjects that receive a treatment.
- May have multiple treatment groups (one for each treatment).
Random Assignment
- Subjects are randomly assigned to a group prior to the start of the experimental research study.
Blinding
- Subjects do not know to which group they are assigned (BLINDING) to prevent or limit the effects of “suggestion.”
- Researcher and subjects do not know which group is which (DOUBLE BLINDING) to prevent or limit bias in the research.

Caution!!! Lurking Variables

Lurking variables are variables not included as part of the research, but influence or affect the interpretation of the relationship.
May be unknown variables and thus not controlled.

Examples (Classification Practice from the Transcript)

Scenario items to classify observational vs experimental:
- Subjects randomly assigned to two groups; one group given an herb and the other a placebo; after a year, number of respiratory tract infections compared.
- Classification: Experimental (random assignment; placebo control).
- GHC President wants to know the average age of students enrolling in Fall 2020; uses SCORE student records to determine the age of each student.
- Classification: Observational (uses existing records; no manipulation).
- A researcher stands at the entrance to see if the color of the automobile a person drives is related to running red lights.
- Classification: Observational (no manipulation; seeking association).
- A dentist wants to research the effects of a new dental material used to make fillings.
- Classification: Experimental (testing a new material).

Scenario 1 (Aspirin and Heart Attacks) – Page 13 Transcript

Study design: Four hundred men aged 50–84 recruited and randomly divided into two groups; one group takes aspirin, the other a placebo; one pill per day for three years; final count of men who had heart attacks.
Identify the variables:
- Explanatory (independent) variable: ext{Treatment assigned} = ext{aspirin vs placebo}
- Response (dependent) variable: ext{occurrence/count of heart attacks in each group}
- Treatments: ext{Aspirin}, ext{Placebo}
Blinding: The men do not know which pill they take, so this is a blinded study for participants; text specifies blinding of participants, which corresponds to single-blind.
- Note: The transcript does not specify whether researchers are blinded; thus it is not necessarily double-blind based on the given information.

Scenario 2 (Smell and Learning) – Page 14 Transcript

Study design: Subjects complete mazes multiple times while wearing masks; mazes repeated three times with floral-scented masks and three times with unscented masks; assignment to wear the floral mask first or last three trials is random; recorded metrics include maze completion time and the subject’s impression of the mask’s scent (positive, negative, neutral).
Identify the variables:
- Explanatory variable: ext{Mask scent condition} = ext{floral vs unscented}
- Response variables:
- Primary: ext{Time to complete the maze (in time units)}
- Secondary: ext{Impression of scent} ext{ ∈ \{positive, negative, neutral\}}
- Treatments: The two scent conditions (floral vs unscented) applied across trials; the random assignment of floral mask to the first three vs last three trials is a design choice to control for order/fatigue effects.

Key Takeaways for Study Design

Observational studies cannot establish causation due to potential lurking variables and lack of controlled manipulation.
Experimental studies actively manipulate an independent variable to observe effects on a dependent variable, enabling causal inferences when well-controlled.
Random assignment helps ensure groups are comparable and reduces selection bias.
Blinding minimizes bias:
- Blinding (single) = participants are unaware of group assignment.
- Double blinding = both participants and researchers are unaware of group assignment.
Control groups provide a baseline to compare against treatment groups.
Lurking variables can confound results if not identified and controlled for; they are variables not included in the study design but that influence the outcome.
In practice, many real-world studies blend observational and experimental elements, require careful interpretation, and consider ethical, philosophical, and practical implications (e.g., feasibility, consent, risk considerations).

Formulas and Notation Recap

Independent variable (explanatory): X
Dependent variable (response): Y
Treatments are values or levels of the independent variable (e.g., X = ext{Aspirin}, X = ext{Placebo}).
For a given study, the relationship of interest can be summarized in data such as counts, means, probabilities, etc., often requiring statistical inference to determine if observed differences are statistically significant.