stats

Statistics for Advanced Experimental Design

Key Formulas and Concepts

Mean

Sample Mean: ( x̄ = (Σ xᵢ) / n )Where:(n) = size of the sample(xᵢ) = individual sample values

Summarizes entire sample
Might provide an estimate of entire population’s true mean
Unreliable w/ high variance/outliers

Standard Deviation

Sample Standard Deviation: ( S = √(Σ (x - x̄)² / (n - 1)) )
- Aka: closeness/variability of data to the mean
Larger standard deviation indicates more variability, which implies that other variables influence the dependent variable
Smaller standard deviation indicates clustering around the mean, suggesting direct effects from the independent variable
- better justification that x causes y
± 1s = 68% of data
± 2s = 95% of data
± 3s = 99% of data

Standard Error of the Mean

( SEM = S / √n )
Indicates how well the sample mean matches the true population mean.
- determine confidence in data collected in sample
- estimates how much the sample mean is likely to vary in repeated experiments

Chi-Square Test

( χ² = Σ ((O - E)² / E) )Where:(O) = observed results(E) = expected results
- Sum of ((observed - expected) squared) / expected
- Degrees of freedom (df) = # of categories - 1
  - compare calculated value to critical value (p value) in chi-square distribution table

Degrees of Freedom

Calculated as: For Chi-Square: ( df = k - 1 ) (k = number of categories)

Important Statistical Insights

Statistics are essential for:

Understanding sample data meaning
Drawing conclusions from data
Supporting scientific arguments
Estimating data reliability
Effectively communicating findings

Advanced Statistical Tests

T-Test

A statistical test comparing means between two groups.
- determine if observed effect reflect true population characteristics rather than sampling error (is it significant or due to chance/random ?)

Types:

Independent T-Test - Compares separate groups (e.g., control vs. treatment).
Paired T-Test - Compares the same group before/after treatment.

Hypotheses:

Null Hypothesis: No significant difference exists.
Alternative Hypothesis: Significant difference exists based on control vs. treatment.

Significance level: p = 0.05 → p < 0.05 then there is a significant difference → reject null hypothesis

ANOVA (Analysis of Variance)

Used when comparing means of more than two groups.

Helps determine if differences are due to chance. (like T-test)

Hypotheses:

Null Hypothesis: No significant difference among groups.
Alternative Hypothesis: Significant difference exists.

p-value

p < 0.05 → reject null + significant difference exists

Chi-Square Test

Compares observed data to expected data.

determines if differences are due to chance
represented by χ²
only used w/ numerical data / counting frequencies
- NOT percentages or proportions

Hypotheses:

Null Hypothesis: No significant difference between observed and expected values.
Alternative Hypothesis: Significant differences exist.

Interpreting Results:

If calculated ( χ² ) is less than critical value (p < 0.05): Accept null hypothesis.
If greater than critical value (p > 0.05): Reject null hypothesis.