10/10/25: Chapter 10 Review
Overview of Linear Correlation and Regression
Review of linear correlation and regression concepts from Chapter 10.
Focus on the final exam structure.
Final exam scheduled for next Wednesday at 7:30 PM.
Review session planned for Monday on Chapter 8 problems.
Problem Example 1: Linear Correlation Assessment
Objective: Determine if there is sufficient evidence to support a linear correlation between time of ride and fare.
Data points given for analysis:
Time of ride (X): 3, 1, 2, 7, 11, 8, 8, 2
Fare (Y): 14.3, 31.75, 36.8, 9.8, 7.8, 7.8, 4.8
Introduction to Linear Correlation Coefficient (r)
The linear correlation coefficient, denoted as r, is calculated using the following formula:
r = \frac{n \sum xy - \sum x \sum y}{\sqrt{(n \sum x^2 - (\sum x)^2)(n \sum y^2 - (\sum y)^2)}}
Breakdown of terms in the formula:
n: Number of data points.
\sum xy: Sum of the product of each X and corresponding Y.
\sum x: Sum of all X values.
\sum y: Sum of all Y values.
\sum x^2: Sum of squares of X values.
\sum y^2: Sum of squares of Y values.
Data Organization (for Problem Example 1)
Create a table to organize data for computation:
Sum computations:
Total for X: 34
Total for Y: 23.5
Total for X^2: 252
Total for Y^2: 1864.76
Total for XY: 401.1
Calculation of r
Use above sums in the formula for r:
n = 8 (data points)
Calculation example: Step-by-step by plugging in values,
leading to the result of 0.953 for r.
Hypothesis Setup (for Problem Example 1)
Null Hypothesis (H₀): There is no linear correlation (r = 0).
Alternative Hypothesis (H₁): There is a linear correlation (r \neq 0).
Critical value determination:
Use Table A-6 with n = 8 and alpha level (\alpha) of 0.05:
Critical values found: \pm0.707.
Conclusion from r value
Resulting r value (0.953) exceeds the critical value,
suggesting a significant linear correlation between time and fare.
Interpretation: There is sufficient evidence that a linear correlation exists.
Alternate P-value Method
Compute t statistic:
t = \frac{r}{\sqrt{\frac{1 - r^2}{n - 1}}}
Substituting our r value:
t = 7.7.
Use t-table (A-3) with degrees of freedom (n-2 = 6):
t value > table maximum for df=6; thus p < 0.01.
Since p < \alpha (0.05), reject H₀, indicating a linear correlation.
Problem Example 2: Finding Regression Line Equation
Data Given:
X: 108, 139, 111, 464, 127, 5
Y: 7.466, 60.771, 20.747, 0.117, 0.818, 0.846, 0.085, 0.398, 0.156, 0.425, 0.7
Regression Coefficient Calculation
Method 1 and Method 2 to calculate coefficients b\u2081 and b\u2080:
Method 1: Using the defined regression equations:
Coefficients obtained, b\u2081 = 0.5, b\u2080 = 3.0.
Method 2: Using standard deviations to calculate coefficients:
Using values from the t-statistic.
Conclusion of Regression Equation
Resulting regression equation:
y = 3 + 0.5x
Interpretation: Predicts y based on any given x value in practical application.
Take Home Assignment Problems
Problem set related to Lecture Content, to be submitted by next Monday midnight:
Find regression line for provided data.
Test significance using the previously discussed methods.
Final Notes
Ensure comprehensive understanding of both correlation and regression methods.
Focus on problem-solving for final preparations, especially in Chapter 10 and associated calculations.
Opportunities for clarification and review offered in upcoming sessions.