Case Influence Statistics
- Leverage (measure of potential influence)
- DFBetas (measure of actual influence)
- Cook's Distance (measure of actual influence)
Toy Example to Illustrate Leverage
REAL EXAMPLE: PCB's and Estuaries Data
Let's revisit the issue of non-normal errors in the PCB data set
from the takehome exam. Recall we had 37 observations with PCB
measurments in ppb in 1984 and 1984. Our goal was to develop a
short-term prediction model using the past year's data. In your
analyses, you found two "outliers", Boston Harbor and Deleware Bay.
We will take a look at this again and use some more formal methods for
identifying outliers and "influential" cases.
Plots/output:
- Simple Linear Regression with all datapoints.
- Case Statistics (all datapoints)
- Case Statistics (boxplots)
- Simple Linear Regression (without Boston Harbor)
- Simple Linear Regression with all datapoints log-log transformation
- Case Statistics (all datapoints)
- Case Statistics (boxplots)
- Simple Linear Regression without Boston Harbor
- Case Statistics (boxplots)
- Case Statistics (without Boston Harbor)
- Simple Linear Regression without Boston Harbor and Deleware Bay
- Case Statistics (without Boston Harbor and Deleware Bay)
CONCLUSIONS