Notes on Sunlight vs. Tomato Yield – Experimental Design

Question Framing and Variable Identification

  • Example framing from the transcript: think in terms of the question, "What is the effect of blank on blank?" to identify the independent and dependent variables.

  • The specific scenario: you currently have tomato plants in an area with shading for two hours per day, yielding an average of 30 tomatoes per plant.

  • Research question derived: what is the effect of more sun on tomato crop yield?

  • Independent variable (what you change): amount of sunlight (hours of sun per day).

  • Dependent variable (what you measure): tomato crop yield (tomatoes per plant).

  • Clear statement from the transcript: more sun is expected to increase tomato yield (the hypothesis).

Independent and Dependent Variables

  • Independent variable: more sun (in hours of sunlight per day).

  • Dependent variable: tomato crop yield (tomatoes per plant).

  • Note: The transcript emphasizes that the independent variable is what you are changing, and the dependent variable is what you are measuring.

  • Example baseline: area partially blocked by shade during 2 hours/day yields 30 tomatoes per plant on average (used as context for design).

Hypothesis

  • Primary hypothesis: putting plants in an area with more sun will increase tomato yield.

  • The hypothesis reflects the assumed positive relationship between sunlight and yield.

Controls in the Experiment

  • Purpose of controls: to isolate the effect of sunlight by ruling out other explanations.

  • Positive control: a condition you already know will work (should yield a positive result).

    • Transcript baseline positive control: shade for 2 hours/day area yields 30 tomatoes per plant on average.

    • Other potential positive controls: multiple positive controls may exist in practice, all expected to produce known results.

  • Negative control: a condition where you expect no effect due to the variable being tested, used to show the effect isn’t due to other factors (placebo/water effect).

    • Transcript example: no sunlight (remove all sunlight) is a negative control, where growth/yield should be minimal or zero; if any yield occurs, it would suggest an invalid result.

  • Purpose of controls: relate outcomes back to the independent variable (sunlight) and the dependent variable (yield).

Experimental Group and Design Rationale

  • Experimental group: the condition where you increase sunlight to test the effect on yield.

  • Specific experimental variations proposed in the transcript:

    • Shade for one hour a day (i.e., reducing shade from 2 to 1 hour, allowing more sun).

    • Shade for zero hours a day (i.e., no shade, full sun).

  • The core idea: keep all other factors constant while varying the amount of sunlight.

  • Variables kept constant (control variables):

    • Soil type/quality

    • Tomato variety

    • Watering schedule

    • Fertilizer type and amount

  • The only change between groups: the amount of sunlight exposure.

  • Expected outcome: test whether increasing sunlight beyond the baseline 2 hours of shade improves tomato yield.

Experimental Setup and Practical Details

  • Current baseline condition: two hours of shade per day with average yield 30 tomatoes per plant.

  • Experimental condition options (relative to baseline):

    • Shade for one hour per day (less shade, more sun).

    • Shade for zero hours per day (no shade, maximum sun).

  • Measurement plan: record tomato yield per plant for each group and compute averages (the transcript uses the phrase "on average" for yield).

  • Replication and measurement: while not explicitly detailed in the transcript, the language implies taking measurements across plants and averaging to obtain a representative yield per condition.

  • Data interpretation approach: compare yields from the experimental groups to the positive and negative controls to assess whether sunlight changes drive yield differences.

Mathematical/Quantitative Representations

  • Simple variable definitions:

    • Let SS denote hours of sunlight per day.

    • Let YY denote tomato yield per plant (tomatoes per plant).

  • Baseline/positive control reference: ar{Y}_{S,baseline} = 30, where baseline corresponds to two hours of shade (i.e., limited sun).

  • Negative control expectation: YS=00,Y_{S=0} \approx 0, i.e., no sunlight leads to little or no yield.

  • General linear relationship (conceptual model, for thinking, not stated explicitly in transcript):

    • Y=a+bS,Y = a + bS, with b > 0 indicating yield increases with more sun (simplified assumption).

  • Hypothesis in formal terms (optional):

    • Null hypothesis: $$H_0: b = 0\