BIO 182: Scientific Method and Statistics
Introduction to the Scientific Method
The lecture focuses on the scientific method and its application in biological studies.
The course leverages speculative fiction, particularly science fiction, to stimulate curiosity about biological concepts.
Speculative Fiction: Often used to explore the human condition and ask 'what if' questions, aiming to entertain and critique society.
The use of interactive technologies like Dreamscape aims to make learning engaging.
Importance of Statistics in Biology
Statistics is crucial for biologists to analyze and validate data.
Historical context:
In the 18th century, nations began keeping better demographic information.
Early statistics development linked with gambling, as both fields involve making predictions based on limited information.
Variability in biological data presents challenges, leading to the regular use of statistics in biological research.
Variability in Biological Studies
Living organisms exhibit significant variability across many biological dimensions.
Example: Variability in puppies of the same family.
Implication: Single family variability suggests broader variability across species and populations.
Measures of Central Tendency
Central Tendency refers to measures providing a central or typical value for a dataset.
Mean:
Defined as the average value. Calculated by:
Median:
The middle value when data points are arranged in order. For datasets with an odd number of points, it is the central number.
Mode:
The value that appears most frequently in a dataset.
Example with imaginary data:
For seven puppies, values such as weights might have a mean of 91.7 grams, median of 83 grams, and mode being 75 grams.
Data Analysis Approaches
Critical Questions for Data Analysis:
What is the best question to ask?
What method is optimal for data collection?
Which analytical approach should be used for the question at hand?
Measures of Variability
Due to high variability in living organisms, measures of variability become essential.
Range:
Difference between maximum and minimum values.
Interquartile Range:
The difference between the upper and lower quartiles of the data.
Standard Deviation (SD):
Measures the amount of variation around the mean.
Low SD indicates most values are close to the mean; high SD indicates a wider spread of values.
Calculated as:
where (x_i) are individual samples, (\bar{x}) is the mean, and (N) is the number of samples.Variance:
The average of the squared differences from the mean.
Visualizing Data Distribution
Many biological traits tend to follow a Normal Distribution curve
Normal Distribution is expected in various biological measurements.
Example: Height of students in a class likely forms a normal distribution curve.
Distribution percentages based on standard deviations from the mean:
68% of data lies within 1 standard deviation.
95% within 2 standard deviations.
99.7% within 3 standard deviations.
Data Analysis Practice
Example Question:
What would the mean weight be for a second litter of puppies?
The total weight of a given dataset is summed and divided by the count of the data points.
Tools for Data Analysis:
Encouragement to use Excel or Google Sheets for conducting statistical analysis.
Importance of utilizing tutorial resources available online for mastering data analysis in labs.
Conclusion and Additional Remarks
Some major course concepts may be encountered in labs before being formally introduced in lectures.
Laboratories are designed to integrate various biological concepts (evolution, ecology, genetics), potentially causing initial confusion but increasing understanding over time.
Encouragement to engage with provided learning resources and seek help when needed, ensuring fruitful learning experiences throughout the course.