Numerical Measures in Business Statistics
Numerical Measures in Business Statistics
Outline and Learning Outcomes
Outline:
What are measures?
Measures of Data Center and Location
Measures of Data Variation
Learning Outcomes: Students will be able to:
Compute and describe data using the mean, median, mode, and weighted mean.
Compute and describe data using range, interquartile range, variance, and standard deviation.
Compute and use z-score and the coefficient of variation to describe data.
Apply the Empirical Rule and Tchebysheff's theorem.
Measures of Center and Location
Measure:
A quantitative value that describes a particular characteristic of a dataset.
Useful for summarizing and interpreting data.
Allows complex datasets to be distilled into a single useful number.
Key Terms: Parameter and Statistic
Parameter:
A measure computed from the entire population.
This quantitative value is constant if the population does not change.
Usually denoted with a Greek character (e.g., for population mean, for population standard deviation).
Statistic:
A measure computed from a sample of a population.
Will vary based on the specific sample taken.
Usually denoted with a Roman character (e.g., for sample mean, for sample standard deviation).
Mean (Average)
The most commonly used measure of central tendency.
Helps to describe the center of a dataset.
=average(range)in Excel.
Population Mean ()
A parameter.
Formula:
: Population mean (pronounced "mu")
: Population size
: individual value of the variable
Example #1: Sales prices: . For .
Example #2: Sales prices: . For .
Sample Mean ()
A statistic.
Formula:
: Sample mean (pronounced "x-bar")
: Sample size
: individual value of the variable
Example #1: Cars sold daily: . For .
cars
Median
Another center measure, less impacted by outliers than the mean.
Divides a data array into two equal halves.
=median(range)in Excel.To Compute the Median:
Sort the data in ascending order.
Calculate the median's index ().
For population data index ():
For sample data index ():
If index is not an integer: Round up to the next integer. The median is the value at this rounded index position.
If index is an integer: The median is the average of the values in index positions and .
Example #1 (Odd N): Prices: . ()
Sorted:
Index: . Round up to .
Median is the value: .
Example #2 (Odd N): Cars sold: . (Sorted, )
Index: . Round up to .
Median is the value: cars.
Example #3 (Even N): Cars sold: . (Sorted, )
Index: . (Integer)
Median is the average of values at positions and .
Values: and . Median cars.
Using Mean and Median Together
Comparing the mean and median can reveal insights about the dataset's distribution, especially regarding extreme values:
Mean > Median: The dataset is skewed to the right (positive skew), indicating the presence of high extreme values.
Mean < Median: The dataset is skewed to the left (negative skew), indicating the presence of low extreme values.
Mean = Median: The dataset is evenly spread and symmetric.
Mode
Another measure of central location, though not as common as mean or median.
Counts the most frequent value(s) in the dataset.
A dataset can have multiple modes (bimodal, multimodal) or no mode.
Note: The mode need not be in the center and may not always reflect the center of the data set; interpretation requires care.
Example #1 (Single Mode): Daily car sales:
The value appears 3 times, which is more than any other value. Mode = .
Example #2 (Multiple Modes): Song plays:
The values and both appear 3 times. Modes = .
Center Considerations:
Datasets with no value(s) occurring more frequently than others have no mode.
The mode may not necessarily be close to the mean or median (e.g., for dataset , mode is 3, median is 8, mean is 8.4).
Other Location Measures
Weighted Mean
Used when some data values are more important or occur more frequently than others.
Also known as weighted average.
Weighted Mean for a Population ()
Formula:
: The weight of the data value
: The data value
Weighted Mean for a Sample ()
Formula:
: The weight of the data value
: The data value
Example #1 (Income):
Worker 1: 50 days, .
Worker 2: 20 days, .
Worker 3: 30 days, .
Example #2 (Antique Store):
Cust 1: 5 items, .
Cust 2: 3 items, .
Cust 3: 4 items, .
Percentiles
Bin data into equally spaced percentage bins.
Can answer questions like: