Statistical Research Vocabulary and Study Design Notes
Observational vs Experimental Research
- Primary purpose of statistical research: answer a research question (examples provided):
- Does aspirin reduce the risk of heart attacks?
- Is one brand of fertilizer more effective at growing roses than another?
- Is fatigue as dangerous to a driver as the influence of alcohol?
- Involves investigation of the relationship between at least two variables.
- Proper study design ensures the resulting data is reliable and accurate.
Observational Research (Non-experimental)
- The researcher merely observes what is currently happening or what has happened in the past (records) and tries to draw conclusions based on these observations.
- Little or no interaction with subjects.
- No manipulation is performed – do not try to change anything about subjects.
- No cause and effect relationship can be determined.
- Typically occurs in natural settings.
- Data collected is often qualitative, but can be quantitative or both.
- Can be expensive and time consuming.
- Prevalent in Social Sciences and Marketing (Business).
- Examples:
- Surveys (telephone, mailed questionnaire, personal interview).
- Case studies.
Experimental Research (aka Experiment)
- The researcher manipulates one of the variables and tries to determine how the manipulation influences other variables.
- Interaction with subject usually occurs; modifications on subject occur.
- May occur in unnatural settings (labs or classrooms).
- Can identify a cause-and-effect relationship.
- Used in sciences such as Sociology, Psychology, Biology, Chemistry, Physics, and Medicine.
- Examples:
- Clinical trials of new medications or treatments.
- Lab experiments.
The Variables
- Independent Variable (aka Explanatory Variable)
- Is the variable or thing that is being manipulated or changed by the researcher.
- In a sense, this variable EXPLAINS what is happening in the experiment.
- Often represented by the letter x.
- Dependent Variable (aka Response Variable)
- Is the result (outcome) of a treatment or manipulation of the independent variable.
- Variable about which the researcher is asking a question.
- Often represented by the letter y.
The Setup
- Experimental Units
- Single object or individual to be measured.
- Also called Subjects.
- Treatments
- Different values of the explanatory variables.
- Typically have at least 2 treatments.
- Control Group
- Group of subjects that do NOT receive any treatment or receive a PLACEBO (neutral or fake treatment).
- Provides a baseline for comparison.
- Treatment Group
- Group of subjects that receive a treatment.
- May have multiple treatment groups (one for each treatment).
- Random Assignment
- Subjects are randomly assigned to a group prior to the start of the experimental research study.
- Blinding
- Subjects do not know to which group they are assigned (BLINDING) to prevent or limit the effects of “suggestion.”
- Researcher and subjects do not know which group is which (DOUBLE BLINDING) to prevent or limit bias in the research.
Caution!!! Lurking Variables
- Lurking variables are variables not included as part of the research, but influence or affect the interpretation of the relationship.
- May be unknown variables and thus not controlled.
Examples (Classification Practice from the Transcript)
- Scenario items to classify observational vs experimental:
- Subjects randomly assigned to two groups; one group given an herb and the other a placebo; after a year, number of respiratory tract infections compared.
- Classification: Experimental (random assignment; placebo control).
- GHC President wants to know the average age of students enrolling in Fall 2020; uses SCORE student records to determine the age of each student.
- Classification: Observational (uses existing records; no manipulation).
- A researcher stands at the entrance to see if the color of the automobile a person drives is related to running red lights.
- Classification: Observational (no manipulation; seeking association).
- A dentist wants to research the effects of a new dental material used to make fillings.
- Classification: Experimental (testing a new material).
Scenario 1 (Aspirin and Heart Attacks) – Page 13 Transcript
- Study design: Four hundred men aged 50–84 recruited and randomly divided into two groups; one group takes aspirin, the other a placebo; one pill per day for three years; final count of men who had heart attacks.
- Identify the variables:
- Explanatory (independent) variable: ext{Treatment assigned} = ext{aspirin vs placebo}
- Response (dependent) variable: ext{occurrence/count of heart attacks in each group}
- Treatments: ext{Aspirin}, ext{Placebo}
- Blinding: The men do not know which pill they take, so this is a blinded study for participants; text specifies blinding of participants, which corresponds to single-blind.
- Note: The transcript does not specify whether researchers are blinded; thus it is not necessarily double-blind based on the given information.
Scenario 2 (Smell and Learning) – Page 14 Transcript
- Study design: Subjects complete mazes multiple times while wearing masks; mazes repeated three times with floral-scented masks and three times with unscented masks; assignment to wear the floral mask first or last three trials is random; recorded metrics include maze completion time and the subject’s impression of the mask’s scent (positive, negative, neutral).
- Identify the variables:
- Explanatory variable: ext{Mask scent condition} = ext{floral vs unscented}
- Response variables:
- Primary: ext{Time to complete the maze (in time units)}
- Secondary: ext{Impression of scent} ext{ ∈ \{positive, negative, neutral\}}
- Treatments: The two scent conditions (floral vs unscented) applied across trials; the random assignment of floral mask to the first three vs last three trials is a design choice to control for order/fatigue effects.
Key Takeaways for Study Design
- Observational studies cannot establish causation due to potential lurking variables and lack of controlled manipulation.
- Experimental studies actively manipulate an independent variable to observe effects on a dependent variable, enabling causal inferences when well-controlled.
- Random assignment helps ensure groups are comparable and reduces selection bias.
- Blinding minimizes bias:
- Blinding (single) = participants are unaware of group assignment.
- Double blinding = both participants and researchers are unaware of group assignment.
- Control groups provide a baseline to compare against treatment groups.
- Lurking variables can confound results if not identified and controlled for; they are variables not included in the study design but that influence the outcome.
- In practice, many real-world studies blend observational and experimental elements, require careful interpretation, and consider ethical, philosophical, and practical implications (e.g., feasibility, consent, risk considerations).
- Independent variable (explanatory): X
- Dependent variable (response): Y
- Treatments are values or levels of the independent variable (e.g., X = ext{Aspirin}, X = ext{Placebo}).
- For a given study, the relationship of interest can be summarized in data such as counts, means, probabilities, etc., often requiring statistical inference to determine if observed differences are statistically significant.