Notes on Sunlight vs. Tomato Yield – Experimental Design
Question Framing and Variable Identification
Example framing from the transcript: think in terms of the question, "What is the effect of blank on blank?" to identify the independent and dependent variables.
The specific scenario: you currently have tomato plants in an area with shading for two hours per day, yielding an average of 30 tomatoes per plant.
Research question derived: what is the effect of more sun on tomato crop yield?
Independent variable (what you change): amount of sunlight (hours of sun per day).
Dependent variable (what you measure): tomato crop yield (tomatoes per plant).
Clear statement from the transcript: more sun is expected to increase tomato yield (the hypothesis).
Independent and Dependent Variables
Independent variable: more sun (in hours of sunlight per day).
Dependent variable: tomato crop yield (tomatoes per plant).
Note: The transcript emphasizes that the independent variable is what you are changing, and the dependent variable is what you are measuring.
Example baseline: area partially blocked by shade during 2 hours/day yields 30 tomatoes per plant on average (used as context for design).
Hypothesis
Primary hypothesis: putting plants in an area with more sun will increase tomato yield.
The hypothesis reflects the assumed positive relationship between sunlight and yield.
Controls in the Experiment
Purpose of controls: to isolate the effect of sunlight by ruling out other explanations.
Positive control: a condition you already know will work (should yield a positive result).
Transcript baseline positive control: shade for 2 hours/day area yields 30 tomatoes per plant on average.
Other potential positive controls: multiple positive controls may exist in practice, all expected to produce known results.
Negative control: a condition where you expect no effect due to the variable being tested, used to show the effect isn’t due to other factors (placebo/water effect).
Transcript example: no sunlight (remove all sunlight) is a negative control, where growth/yield should be minimal or zero; if any yield occurs, it would suggest an invalid result.
Purpose of controls: relate outcomes back to the independent variable (sunlight) and the dependent variable (yield).
Experimental Group and Design Rationale
Experimental group: the condition where you increase sunlight to test the effect on yield.
Specific experimental variations proposed in the transcript:
Shade for one hour a day (i.e., reducing shade from 2 to 1 hour, allowing more sun).
Shade for zero hours a day (i.e., no shade, full sun).
The core idea: keep all other factors constant while varying the amount of sunlight.
Variables kept constant (control variables):
Soil type/quality
Tomato variety
Watering schedule
Fertilizer type and amount
The only change between groups: the amount of sunlight exposure.
Expected outcome: test whether increasing sunlight beyond the baseline 2 hours of shade improves tomato yield.
Experimental Setup and Practical Details
Current baseline condition: two hours of shade per day with average yield 30 tomatoes per plant.
Experimental condition options (relative to baseline):
Shade for one hour per day (less shade, more sun).
Shade for zero hours per day (no shade, maximum sun).
Measurement plan: record tomato yield per plant for each group and compute averages (the transcript uses the phrase "on average" for yield).
Replication and measurement: while not explicitly detailed in the transcript, the language implies taking measurements across plants and averaging to obtain a representative yield per condition.
Data interpretation approach: compare yields from the experimental groups to the positive and negative controls to assess whether sunlight changes drive yield differences.
Mathematical/Quantitative Representations
Simple variable definitions:
Let denote hours of sunlight per day.
Let denote tomato yield per plant (tomatoes per plant).
Baseline/positive control reference: ar{Y}_{S,baseline} = 30, where baseline corresponds to two hours of shade (i.e., limited sun).
Negative control expectation: i.e., no sunlight leads to little or no yield.
General linear relationship (conceptual model, for thinking, not stated explicitly in transcript):
with b > 0 indicating yield increases with more sun (simplified assumption).
Hypothesis in formal terms (optional):
Null hypothesis: $$H_0: b = 0\