Statistics: Regression Analysis & Residuals
Residuals and Regression Analysis
Residuals are the differences between observed values and values predicted by a regression model.
Important to visualize residuals for analysis and regression appropriateness.
Scatter Plots and Data Analysis
When plotting data (e.g., hours and weight of dry ice), look for patterns:
As hours increase, dry ice weight decreases.
Use scatter plots to analyze direction, shape, and strength of relationships.
Example: a negative linear association with a strong correlation (e.g., $r = -0.9979$).
Least Squares Regression Line (LSRL)
Form of LSRL: ext{Ŷ} = a + bX
Example from transcription: ext{Ŷ} = -0.52 + 15.21X
Calculate using a statistical calculator: Store the regression equation in function (e.g., $Y_{1}$).
Analyzing Residuals
Check for outliers; plot residuals for randomness to ensure model appropriateness.
If residual plot shows a pattern (e.g., U-shaped), indicates that the linear model may not be suitable.
Requires consideration of nonlinear models (quadratic, exponential, etc.).
Model Appropriateness
A linear association can seem strong, yet residual analysis may suggest otherwise.
Confirm linear fit by ensuring residuals plot is randomly distributed.
If not random, alternative models may be a better fit.
Confounding Variables
Confounding variables can mislead conclusions from data analysis. They act as hidden causes in associations.
Example: Increased firefighters might indicate worse fires causing more damage, not the firefighters themselves causing damage.
Key Takeaways
Analyze both data scatter plots and their residual plots to see if a linear model is appropriate.
Always consider potential confounding variables before drawing conclusions from statistical analysis.