HPCS4031: Research Methods in Psychology - Lecture 7 and 8: t-test
HPCS4031: Research Methods in Psychology - Lecture 7 and 8: t-test
p-value vs. Alpha Level
Establishing an Alpha Level
An alpha level is the probability of rejecting the null hypothesis when it is actually true.
For example, if a significant difference is observed between two groups, this could be attributed to random chance (noise) rather than a systematic difference caused by the independent variable (IV).
Statistics is fundamentally about probability; thus, nothing can ever be 100% certain.
In psychology, the common practice is to set the alpha level at 0.05 (5%):
This signifies that we accept a 5% risk of identifying a false significant effect (Type I error).
Alternative alpha levels:
More conservative: 0.01 (1%)
More liberal: 0.10 (10%) (rarely used)
Distribution of Sample Means
The distribution of sample means is divided into two sections:
Sample means that are likely to be obtained if the null hypothesis (H0) is true.
These sample means are close to the null hypothesis and support it.
Sample means that are very unlikely to be obtained if H0 is true.
These sample means are far different from the null hypothesis and lead to its rejection.
Alpha Level Explained
The alpha level or level of significance is a probability value that defines what is meant by "very unlikely" in hypothesis testing.
Commonly used alpha levels include:
0.05 (5%)
0.01 (1%)
0.001 (0.1%)
Example with α = 0.05:
The most unlikely 5% of sample means, defined by extreme values, separate from the most likely 95% (central values). This extreme segment constitutes the critical region.
Critical Region
Definition:
The critical region consists of the extreme sample values that are unlikely based on the established alpha level if the null hypothesis is true.
Boundaries:
Determined by the alpha level.
Decision Rule:
If sample data fall within this critical region, the null hypothesis is rejected.
Type I and Type II Errors
Type I Error:
Occurs when the null hypothesis is rejected when it is true.
Probability of Type I error is denoted by the alpha level (α).
Type II Error:
Happens when the null hypothesis is retained when it is false.
Probability of Type II error is referred to as Beta (β).
Comparing p-value to Alpha Level
p-value:
Represents the calculated probability that an observed difference between groups is due to chance (random noise) rather than a true effect.
Connection between t-values and p-values: larger absolute t-values yield smaller p-values.
For instance, a p-value of 0.063 indicates a 6.3% chance that the observed difference is not real.
Given α = 0.05 (5%), a p-value of this magnitude leads to retaining the null hypothesis since 6.3% is too high a risk.
Rules for Hypothesis Testing
If p >= α, retain the null hypothesis (H0: m1 = m2).
If p < α, reject the null hypothesis (H1: m2 > m1).
SPSS Output for Independent Samples Test
Independent Samples Test Results:
t = -2.157
df = 8
Sig. (2-tailed) = 0.071
Mean Difference = -0.32000
Std. Error Difference = 0.14832
Calculating p-value Manually
After calculating the t-statistic, reference a t-distribution table to look up the corresponding p-value.
Requirements:
Degrees of freedom (df)
One-tailed or two-tailed test designation.
Degrees of Freedom (df) Explained
For sample variance, the calculation uses n - 1.
This reflects that for n data points, n - 1 can vary freely; the last point is determined by the previous n - 1 points.
Thus, for a sample of n scores, the equation for degrees of freedom is: .
Illustratively, with n = 3 data points and a mean of 20 (total = 60), then .
Independent Samples t-test and Degrees of Freedom Calculation
For independent measures t-statistic:
.
Example:
If both samples (n1, n2) are 5, then:
.
One-Tailed vs. Two-Tailed Tests
For example, in a comparison of online vs face-to-face learning, a two-tailed test is typically appropriate.
To convert a two-tailed p-value to a one-tailed, divide it by 2.
A one-tailed test is viewed as more liberal since it may lead to easier to find significant results, sometimes considered as "cheating".
Between-Subjects/Independent-Measures Design
Goals of Independent-Measures Research Study
To evaluate the mean difference between two populations or treatment conditions.
Null Hypothesis:
H0: (no difference between the population means).
Alternative Hypothesis:
H1: (indicating there is a mean difference).
Independent Measures t Formula Explanation
The independent-measures t statistic evaluates a hypothesis concerning the difference between two population means.
Notable components:
The formula comprises of mean difference from the sample data.
Expected difference derived from null hypothesis: .
Standard Error and Its Role
Standard error: Indicates the expected error divergence between the sample mean difference ($M1 - M2$) and the population mean difference ($
u1- u2$).Two sources of error exist when evaluating the t statistic for independent measures:
From sample one.
From sample two.
Conclusion for t-statistic Calculation
To calculate the t-statistic accurately:
Utilize , where $S{difference}$ factors the variance from both samples.
Example of Difference Evaluation
Analysis of significant differences must account for both signal (effect of treatment) and noise (measurement error, individual variations, etc.).
For instance, a course mode comparison reveals means of 3.06 and 3.38 respectively:
A difference of 0.32 could stem from either signal (actual treatment effects) or noise (random variations).
The interpretations yield support for either the null hypothesis (H0: ) or the alternative hypothesis (H1: m2 > m1).
Testing Procedures by Hand
Compute t statistic ($t{obs}$), represented by where signal over noise yields the t statistic outcome.
Ensure to recognize whether sample variances are equal or unequal:
If equal, utilize basic formula for t-statistic.
If unequal, employ a different formula to pool variances for an unbiased measure of standard error.
Steps for Example Observations
Ensure hypothesis clarity before assessment.
Execute systematic calculations for variances, mean differences, and standard error values to finalize t-statistics.
Outcome interpretation rests upon comparing calculated t to critical t-values (e.g., t critical for two-tailed test being ±2.306 at α = 0.05).
Summary of Testing Decisions
Compare obtained t versus critical thresholds to ascertain the acceptance or rejection of H0 based on the critical regions defined by alpha values.
Studies reveal decisions influenced by the significance levels set at the onset of research inquiries.
Student Example for Practice
A study investigating sleep effects on academic performance utilized a sample size of 20 (music group) and 15 (quiet group) with queried output of 12 and 16 total average recalls:
Sample variance noted, aligning participants into groups to evaluate statistical significance against normative behaviors.
Here are some application-based multiple-choice questions derived from the provided content:
A researcher conducts a study and obtains a p-value of . If the alpha level was set to , what is the correct statistical decision?
A) Reject the null hypothesis, as the result is statistically significant.
B) Retain the null hypothesis, as the p-value is greater than the alpha level.
C) Reject the alternative hypothesis, as the p-value indicates no effect.
D) The p-value is close to the alpha level, so more data is needed.A psychology student is analyzing data from an independent samples t-test comparing two groups. The SPSS output shows , , and . If the researcher initially hypothesized a directional difference (e.g., Group A scores higher than Group B), what would be the p-value for a one-tailed test, and what decision should be made if ?
A) p-value = ; Retain the null hypothesis.
B) p-value = ; Reject the null hypothesis.
C) p-value = ; Retain the null hypothesis.
D) p-value = ; Retain the null hypothesis.In an experiment, Group 1 has participants and Group 2 has participants. Both groups are independent. What is the correct degrees of freedom () for an independent measures t-test?
A)
B)
C)
D)A research team publishes findings stating they rejected the null hypothesis with an alpha level of . However, it turns out that there was no real effect in the population. Which type of error did the research team commit?
A) A Type II error.
B) A Beta error.
C) A Type I error.
D) A standard error.A researcher wants to be very cautious about making a Type I error. Which of the following alpha levels would be most appropriate for their study?
A)
B)
C)
D)
Answer Key:
B
B
B
C
D