Distribution of Means, Z Tests, and Confidence Intervals
Review of Populations, Samples, and Research Objectives
Population vs. Sample:
A population is the entire group that a researcher is interested in (e.g., all depressed people).
A sample is a smaller subset of that population (e.g., a sample of depressed people).
Because it is usually not feasible to study an entire population, researchers u8ose samples to make inferences about the larger group.
Statistics vs. Parameters:
Parameters are numerical values that describe a population.
Statistics are numerical values that describe a sample.
Researchers use statistics to estimate population parameters.
Objective of Inferences:
The primary goal is for samples to represent the population accurately so that results can be claimed about the population as a whole.
Sampling Error and Estimation
Defining Sampling Error:
This refers to the difference between the values actually existing in the population and the values observed in a sample.
It is the likelihood that sample values (such as the Mean, , and Standard Deviation, ) will differ from the actual population parameters.
It does not imply that mistakes were made in data collection or analysis; it is a natural byproduct of sampling.
Managing Sampling Error:
In real-world scenarios, the exact amount of sampling error is unknown because the population parameters are rarely known.
Therefore, researchers estimate the average amount of sampling error present in the data.
Illustration of Sampling Error:
Population (NY College Students):
Total population size () = .
Population Parameters: Mean Age = , Mean IQ = , Gender = Female, Male.
Sample #1 (U. Albany Students, ):
Sample Statistics: Mean Age = , Mean IQ = , Gender = Female, Male.
Sample #2 (Ithaca College Students, ):
Sample Statistics: Mean Age = , Mean IQ = , Gender = Female, Male.
Sample #3 (NYU Students, ):
Sample Statistics: Mean Age = , Mean IQ = , Gender = Female, Male.
The Distribution of Means
Conceptual Overview:
When samples of more than one individual are taken, the comparison distribution for the null hypothesis shifts from a distribution of individual scores to a distribution of means.
Also referred to as the "Sampling Distribution of Means" or "Distribution of Sample Means."
These means tend to "pile up" around the actual population mean ().
Characteristics of the Distribution of Means:
Mean of the Distribution of Means: The mean of the sampling distribution is identical to the mean of the population of individuals ().
Impact of Sample Size ():
Larger sample sizes lead to sample means that are closer to the population mean on average.
Large samples are generally better representatives of the population than small samples.
Variance of the Distribution of Means ( or ):
Calculated as the variance of the population divided by the number of individuals in each sample: .
Standard Deviation of the Distribution of Means:
This is the square root of the variance of the distribution of means.
The Standard Error of the Mean (SEM)
Definition: The standard deviation of the sampling distribution of means is formally called the Standard Error of the Mean (SEM).
Function:
It reflects the accuracy with which sample means estimate the population mean.
it represents the average deviation (sampling error) of sample means from the population mean ().
Mathematical Relationship and Sample Size:
As sample size () increases, the SEM decreases, meaning the sample mean becomes a better estimate of the population mean.
If the SEM is large, sample means will tend to differ significantly from one another, and many will not be accurate representations of .
It answers the practical question: "If I had taken another sample, how different would my result be?"
Hypothesis Testing with Z Tests
Applicability: Z tests are used when population parameters (population mean and standard deviation) are known.
The Procedure:
A sample mean is compared to a known null population mean using the SEM (adjusted for sample size) as the standard of measure.
Z-Test Formulas:
Standardizing the mean:
Where
Z Test Example: College XYZ IQ Scores:
Known Population Parameters: Mean () = , Standard Deviation () = .
Hypotheses:
Null Hypothesis (): (No difference).
Alternative Hypothesis (): (Difference exists).
Scenario 1: Small Sample ():
Observed Mean () = .
Alpha Level () = (two-tailed).
Critical Value () = .
Calculation:
Conclusion: 1.47 < 1.96. Since the observed is smaller than the critical value, the null hypothesis is not rejected. The result is inconclusive.
Scenario 2: Large Sample ():
Observed Mean () = .
Calculation:
(or )
Conclusion: 3.33 > 1.96. Since the observed is larger than the critical value, the null hypothesis is rejected in favor of the alternative. The class mean appears to come from a population with a mean higher than .
Confidence Intervals (CI)
Purpose: An alternative to the "all-or-none" decision of hypothesis testing. It establishes a range of values around a sample mean where the population mean is likely to reside.
Calculation Components:
Uses the sample mean ().
Uses the standard error (SEM).
Uses a specific level of confidence (typically or ).
Confidence Levels and Z Scores:
For a CI, the Z score is .
For a CI, the Z score is .
Formula:
Lower Limit:
Higher Limit:
Confidence Interval Examples
Example: IQ of 25 Children:
Given: , Variance of population = , .
Step 1: Calculate SEM:
Variance of D of M =
Confidence Interval:
Interval: to
Confidence Interval:
Interval: to
Observations on Intervals:
The CI is wider than the CI. A larger range is required to be more confident that the interval contains the population mean.
Effect of Sample Size:
As increases, SEM decreases, making the confidence interval smaller (more precise).
Example at \ confidence: If , the range is to . If , the range narrows to to .
Interpretation:
It is technically more accurate to say we are calculating a range of values that contains the true population mean based on the sample mean, rather than being "\ confident this specific interval contains it."
SPSS Application and Data Visualization
Error Bar Graphs:
Used to visualize confidence intervals for different categories.
Path: Graph -> Error Bar -> Simple -> Define.
Interpreting Overlap:
Nonsignificant Results: Confidence intervals around the means will typically fall within each other's limits (they overlap significantly).
Significant Results: The confidence interval around one mean does not overlap with the confidence interval of the other mean.
Variables Examined in Class:
Independent Variable (IV) vs. Dependent Variable (DV).
Examples:
Family_VisitbySex,Current_GPAbySex.
Scholarly Research Examples
Trypophobia and Comfort Levels (Pipitone & DiMattina, 2020):
Study involved trypophobic images with three manipulated versions (original, scrambled, phase).
Sample size: participants.
Results (Error bars representing CIs):
Phase explained the most variance in comfort ().
Amplitude explained \ variance.
Interaction of phase and amplitude explained .
Kinship and Fertility in Iceland (Helgason et al., 2008):
Analyzed all known Icelandic couples born between and .
Found a significant positive association between kinship and fertility.
Greatest reproductive success observed for couples related at the level of third and fourth cousins.
Used confidence intervals across seven intervals of kinship level to measure:
Total number of children.
Number of children who reproduced.
Number of grandchildren.
Mean life expectancy of children.