Sta 242 / Env 255
Homework 4
Due either Thursday, March 7 in class
or by 5pm on Friday, March 8 to 223C Old Chemistry
Last modified: Fri Mar 1 14:03:51 EST 2002
This homework corresponds to Chapter 9 of Sleuth.
To maximize your grade, read these homework guidelines.
Report all fitted regression lines in the format of the box above Section 7.4 (on page 180).
Some red spruce forests in the Appalachian Mountains show signs of decline, with many dead or dying trees. Environmental stress may contribute to this decline; deposition of airborne pollutants such as metals or acids tends to be heavier at higher elevations, where red spuce predominate. The dataset, spruce.txt, contains data on elevation and the percentage of dead or badly damaged trees, from 64 Appalachian sites (Johnson and Siccama, reported by the Committee on Monitoring and Assement of Trends in Acid Deposition, 1986). Eight of the sites are in southern states (West Virginia, Virginia and North Carolina); the remainder are northern (New Hampshire, Vermont and New York).
Dataset: "spruce.txt" describing elevation and percentage dead or damaged red spruce treesYou will write a 1-page summary of your analysis of this dataset, in the same format as Homework 2. Research questions: Use the data to describe the effect of elevation on percentage of damaged forest. What is the role of region (North or South) in this analysis? Is the relationship between elevation and percentage of damaged forest the same for the North and South, or does it change according to region? For whatever model you choose, give confidence intervals for the slope in both regions. Also give confidence intervals for the percent of damaged trees (or some transformation of percent damaged) at an elevation of 1200 m for the North and the South.
Project title and group members
Description of the data you will use. Where found? How collected? How does the data relate to the research question? What role will each variable play in exploring the general research question? Give the outcome (dependent, response, Y) and predictor (independent, X) variables you will use to answer the questions.
You'll submit a table summarizing the variables of interest. Column 1: Variable Name. Column 2: Indicate whether continuous, discrete, categorical. Column 3: units of the variable. Column 4: Number of observations for this variable.
The general questions you will answer, and hypothesized answers (i.e. what do you expect to see?). What results from these specific statistical methods are needed to support your hypothesized answer?
The statistical method(s) that you will use to help answer the question.
If the data were not provided by me, attach a copy of the dataset with labeled variables or you may provide the link to the web address where the data is located. If your dataset is large, handing in a subset of observations is acceptable.
The Duke Honor Code applies in our course and to these projects. It is assumed that you will not collaborate with students who may have worked with the same datasets in past regression courses. We will compare current posters to past posters if Honor Code violations are suspected.