Multicollinearity
EXAMPLE: Sediment Yield in Mountain Basins
We have data for 19 mountain basis mostly in the Southern Alps of New
Zealand (Hicks, McSavenay, and Chinn, 1990). The authors found that
sediment yield was determined largely by runoff and precipitation. To
linearize relationships we'll take log base 10 of all variables. (as suggested by the authors) see page 139 RWG
Variables:
- Mean Annual Stream Sediment yield (tons/km^2)
- Mean Annual Runoff (mm)
- Mean Annual precipitation (mm)
Plots/output:
- Simple Linear Regression log(sediment) on log(runoff)
- Simple Linear Regression log(sediment) on log(precipitation)
- Multiple Regression log(sediment) on log(precipitation) and log(runoff)
- Added Variable Plots
- Scatter Plot Matrix and Correlation matrix
- Case Statistics and False Discovery Rate Procedure
Identifying Multicollinearity:
- Scatterplot and correlation matrices of the variables
- Correlation matrix of parameter estimates
- Tolerance