Basic Statistics for Behavioral Sciences - Chapter 5
Chapter Five: Measures of Variability: Range, Variance, and Standard Deviation
1. Introduction to Measures of Variability
Author: Gary W. Heiman
Edition: 5th edition of "Basic Statistics for the Behavioral Sciences"
Focus: This chapter reviews how to measure variability in statistical data using different techniques, primarily focused on range, variance, and standard deviation.
2. Statistical Notation
New Statistical Notation:
Symbol:
Represents the sum of all squared scores (Xs).
Example: For scores 2, 3, and 4, calculation is done as follows:
Symbol:
Represents the square of the sum of all scores (Xs).
Example: For scores 2, 3, and 4:
Important Note: Solve the expressions within the parentheses prior to conducting additional calculations.
3. Theory: Clump vs. Spread
Concept:
Before comparing two groups statistically, it is essential to summarize groups to their essential elements.
Use measures of central tendency to encapsulate the "clump" of scores in the center of a distribution (i.e., mean, median, mode).
Acknowledgment that data points are not always identical and can diverge from the central tendency, necessitating measures to describe spread or variability among scores.
4. Measures of Variability
Definition: Measures of variability quantify the extent to which scores in a distribution differ from each other. These include methods like range, variance, and standard deviation.
5. Understanding Different Distributions
Illustration: Three distributions share the same mean (X = 6) but vary in their spread:
Sample A: Scores distributed tightly around the mean.
Sample B: Scores spread out more widely (0, 2, 6, 10, 12).
Sample C: All scores identical (6, 6, 6, 6, 6).
6. Normal Distribution Curves
Discussion:
Variations of the normal curve distribute scores differently while maintaining a mean, highlighting the significance of variance and standard deviation in understanding data spread.
7. Range
Definition: The range indicates the distance between the maximum and minimum scores within a distribution.
Formula:
Advantages:
Simplicity of computation.
Disadvantages:
Caused by only two extreme scores, which may not be representative of the data set as a whole.
8. Distance and Deviation
Definition of Distance: In mathematics, distance between two numbers calculated using subtraction; example given where the distance between 7 and 12 is 5 (12 - 7 = 5).
Definition of Deviation: The distance of an individual score from the mean. Same computational process applies. If the mean is 10 and a score is 12, the deviation is:
Remark: When calculating deviations for all scores in a distribution, the total sum will yield 0.
9. Variance and Standard Deviation
Goal: Variance and standard deviation serve as measures of how scores deviate from the mean.
Calculating Deviations: A method to analyze how many numbers deviate from the mean.
Acknowledgment that raw deviations won't yield a usable average if summed directly.
Example with scores 2, 3, and 4: Mean is 3, deviations are: (2-3) = -1, (3-3) = 0, (4-3) = +1, resulting in a sum that equals zero.
Workaround: Square deviations to avoid losing negative values; summing squared deviations provides a better measure of variability.
10. Sample Variance
Sample Variance Definition: The average of the squared deviations around the mean.
Pros: Squaring deviations eliminates the zero-sum problem.
Cons: Squaring can exaggerate deviations, leading to representations in squared units.
11. Sample Standard Deviation
Concept: The sample standard deviation is the square root of the sample variance, thereby preserving the same units as the original data.
12. Interpretation of Standard Deviation
Summary: Standard deviation quantifies the average deviation from the mean, indicating score consistency and dispersion.
A larger standard deviation implies greater spread in data.
13. Normal Distribution Characteristics
In a normal distribution:
Approximately 34% of scores lie within one standard deviation of the mean.
Cumulatively, about 68% of scores fall between -1 and +1 standard deviation from the mean.
14. Application of Standard Deviation in Real Life
Example: Given the average score of 86.8, an individual scoring below this might have a deviation of 11 points, reflecting normal distribution probabilities.
Example with standard deviation to express data ranges (between 75.3 and 98.2).
15. Estimating Population Variance and Standard Deviation
Challenges: Direct calculation of population variance is rarely feasible; often rather than definitive, they represent estimations.
Importance of understanding degrees of freedom when calculating estimates to account for sample size.
16. Degrees of Freedom
Analogy: Using an elevator metaphor where if two people are present, one’s actions are easily identified (farting analogy). With more individuals, identification becomes random.
Definition: Degrees of freedom refer to how many scores are free to vary in statistics. In calculations, adjust by dropping one sample from the total (N-1).
17. Estimation Bias in Sample Variability
Overview: Sample variance tends to underestimate true population variance due to inherent sampling biases. Adjusting estimates requires random scores and acknowledgment of the last deviations constrained to zero.
18. Unbiased Estimates and Formulas
Formulations:
Employing N-1 yields unbiased estimators of variance and standard deviation for the population.
Note: Definitional formulas are presented simply for understanding without practical application.
19. Real-World Example Integration
Application: In practical assessments, average consumer responses and standard deviations display underlying variability in reactions.
Conclusion that comprehending standard deviation allows behavioral scientists to substantiate claims regarding the impact and efficacy of interventions.
20. Studying Variability via Surveys
Example: Surveys gauging extroversion among friends—analysis of Likert scales can delineate variability in social behaviors.
Understanding responses assists in determining central tendencies vs. averaged responses.
21. Implications for Research Reporting
Publishing Results: When drafting research articles, descriptive statistics like means and standard deviations should be integrated in the Method section, framing participant demographics.
Example provided for a sample of first-year students including gender breakdown and smoking statistics, illustrating demographic diversity.