Ch 3 and 12 Test Review: Regression and Inference

LSRL Calculations and Residual Diagnostics

  • Least-Squares Regression Line (LSRL) Equation: AR^=5.810.19(Hours)\hat{AR} = 5.81 - 0.19(\text{Hours}).

  • Residual Calculation: For the point (3,10)(3, 10), the predicted value is y^=5.810.19(3)=5.24\hat{y} = 5.81 - 0.19(3) = 5.24. The residual is observedpredicted=105.24=4.76\text{observed} - \text{predicted} = 10 - 5.24 = 4.76.

  • Impact of Removing Points:     * Large Residual Point: Removing a point with a large residual (e.g., (3,10)(3, 10)) results in the slope (bb) remaining about the same, the correlation (rr) increasing, and the typical residual size (ss) decreasing.     * Influential Outlier: Removing an outlier in the explanatory variable (e.g., (15,2)(15, 2)) causes the slope (bb) to change significantly and the correlation (rr) to increase.

Inference for Linear Regression: Sleep and GPA

  • LSRL Context: GPA^=2.6476+0.10176(SLEEP HOURS)\hat{GPA} = 2.6476 + 0.10176(\text{SLEEP HOURS}).

  • Slope Interpretation: For each additional hour of sleep, a student's GPA\text{GPA} is predicted to increase by 0.101760.10176 points.

  • Standard Deviation of Residuals (ss): The value s=0.1804s = 0.1804 represents the typical prediction error or the typical residual in GPA\text{GPA} points.

  • Hypothesis Test for Slope:     * Hypotheses: H0:β1=0H_0: \beta_1 = 0 vs. H_a: \beta_1 > 0.     * Degrees of Freedom: df=n2=142=12df = n - 2 = 14 - 2 = 12.     * Test Statistic: t(12)=0.1017600.04347=2.341t(12) = \frac{0.10176 - 0}{0.04347} = 2.341.

  •     * P-value: P(t(12)2.341)=0.0187P(t(12) \geq 2.341) = 0.0187.     * Conclusion: Since 0.0187 < 0.05, reject H0H_0. There is convincing evidence of a positive linear relationship between hours of sleep and GPA\text{GPA}.

  • 95% Confidence Interval for Slope:     * Formula: b1±t(12)××SEb1b_1 \pm t(12)^\times \times SE_{b_1}.     * Calculation: 0.10176±2.179×0.04347(0.007,0.196)0.10176 \pm 2.179 \times 0.04347 \rightarrow (0.007, 0.196).     * Interpretation: We are 95%95\% confident that the true slope of the LSRL for GPA\text{GPA} vs Hours of Sleep is between 0.0070.007 and 0.1960.196 points/hour.

Non-Linear Transformations and Model Selection

  • Linear Model: Area^=4.133+3.0333(seconds)\hat{\text{Area}} = -4.133 + 3.0333(\text{seconds}). Prediction at 5.5seconds5.5\,\text{seconds} is 12.55cm212.55\,cm^2.

  • Model Selection: The Exponential Model (Model 1) is superior because it has the highest r2r^2 and the residual plot lacks a clearly curved pattern.

  • Exponential Model Equation: log(Area^)=0.34057+0.115582(seconds)\log(\hat{\text{Area}}) = 0.34057 + 0.115582(\text{seconds}).

  • Transformed Prediction: To predict area at 5.5seconds5.5\,\text{seconds}, calculate log(Area^)=0.34057+0.115582(5.5)=0.976271\log(\hat{\text{Area}}) = 0.34057 + 0.115582(5.5) = 0.976271. Solving for Area\text{Area} gives Area=100.976271=9.468cm2\text{Area} = 10^{0.976271} = 9.468\,cm^2.