Measures of Central Tendency
Measures of Central Tendency
Definition
Measures of Central Tendency: A statistical measure that identifies a single value, attempting to represent the center point or typical value of a dataset.
Types of Measures
The three primary measures of central tendency are:
Mean
Median
Mode
Mean
Definition:
The mean is the arithmetic average of a set of values. It is calculated by summing all values and dividing by the number of values.
Example:
Given scores: 5, 12, 20, 16, 15, 23, 10, 18, 7, 11
When to use the mean:
The mean is appropriate for interval or ratio-scaled data that is not skewed.
Distribution Skewness
Skewed distributions:
Positive Skew: Longer tail on the right side.
Negative Skew: Longer tail on the left side.
Median
Definition:
The median is the middle value in a dataset when arranged in ascending order, effectively splitting the dataset into two equal halves.
Example 1:
Given data: 32, 41, 56, 34, 28, 67, 49, 37, 52
Step 1: Arrange in ascending order: 28, 32, 34, 37, 41, 49, 52, 56, 67
Step 2: The median is 41 (5th position in a 9-value dataset).
Example 2:
Data: 4.5, 2.8, 5.6, 9.2, 3.5, 6.7, 3.9, 8.4
Step 1: Ascending order: 2.8, 3.5, 3.9, 4.5, 5.6, 6.7, 8.4, 9.2
Step 2:
Median:
When to use the median:
The median can be used for datasets that are skewed (positively or negatively).
Mode
Definition:
The mode is the value that appears most frequently within a dataset.
Bimodal: If there are two modes.
Multimodal: If there are more than two modes.
Example:
Given data: 12, 15, 13, 12, 14, 17, 16, 12, 13, 19
Mode: 12 (occurs 3 times).
Dataset: 3.4, 2.2, 3.5, 3.4, 2.2, 2.6, 2.1, 3.9, 2.2, 3.4
Modes: 3.4 and 2.2
Dataset: 105, 200, 159, 110, 225, 170, 115, 250, 285, 190
Mode: Does not exist (no number repeats).
When to use the mode:
The mode is mainly used with nominal and ordinal scaled data.
Comparison of Measures
Skewness Effects:
Left-Skewed (Negative Skewness): Mode > Median > Mean
Right-Skewed (Positive Skewness): Mean > Median > Mode
Conclusion
Understanding these measures helps in better data analysis and interpretation.
Determine which measure is appropriate based on the data characteristics and distribution.