● Subsets: all data should come from the same population
● Extrapolation: doing unfounded assumptions
● Outliers, leverage, and influence: A point with high leverage and influence
changes our understanding of the data in the model, sometimes it is better to
omit these values to get a better understanding of data.
● Lurking variable: Is when a third variable is influencing both variables, correlation
does not equal causation.