STA 242/ENV
255
Optional Homework 5
Due Monday, April 22, 2002 by 5pm to 223 C Old Chemistry
Last modified: Thu Apr 18 11:29:42 EDT 2002
This homework is optional. Recall that homework is worth 10% of your final grade, and there are 500 homework points total. Each homework is worth 100 points, except HW1 which was a 200 point 2-week homework.
- Critique of 2 MP Presentations Follow the instructions given in class to submit a critique of 2 MP presentations. That is, for each project give:
- Name of presenter
- Title of project (rough title is fine)
- Research question
- What assumptions were necessary for the model chosen?
- Did the application of the model to these data raise any issues with assumptions?
- Measures of uncertainty for parameters critical to the research question: were they given? were they appropriate for the problem at hand?
- Model selection: Was any done? Was the process briefly described or justified?
- Model uncertainty: Did the author address the issue of model uncertainty either qualitatively or quantitatively?
Each critique should be about a paragraph in length.
What is the impact of the emergency locator transmitter (ELT) for aircraft?
70% of light aircraft that disappear in airspace are discovered. Of those that are discovered, 60% have an ELT. Of those that are not discovered, 90% do not have an ELT.
- What is the probability that an aircraft equipped with an ELT is not discovered?
- What is the probability that an aircraft without an ELT is discovered?
Further reading: p. 355-6 of Moore and McCabe
- In lab you performed a simulation exercise (#1,Lab 4/10/02). I also passed out Splus results from this exercise in class on April 11th.
- Sum the posterior probabilities for the top 5 models, and make a statement about the amount of model uncertainty in this problem. You may use language similar to that in the "Bayesian Model Averaging:A Tutorial" paper.
- Let's say you are using a variable selection technique when the number of explanatory variables is a substantial proportion of the sample size.
- What is the impact on the d.f. for model error?
- In this situation, compare the penalty terms for BIC and AIC for fixed MSE. Which criterion will favor a more parsimonious model?
- The following questions apply to the reading handed out entitled "Chapter 4: Criteria for Choice of Best Model" (from Myers, "Classical and Modern Regression with Applications"). Note that a 6-parameter model is fit to the data on page 168.
- Refer to the small table on the top of page 169. We will focus on site 17. What do the numbers -466 and -3675 represent for site 17?
- A model is used to compute each of the above quantities for site 17. Give the associated error degrees of freedom for each model.
- At site 17, does the ordinary residual overstate or understate the true prediction error?
- What does PRESS say about observation 17 with respect to its influence?
Exercise 11 on p. 592, based on
the space shuttle data. These are the original data; there are errors with the Sleuth data.
Updated 4/19
Turn in only these three parts:
- Part (a)
- Part (e)
- How do the estimated odds of failure change with each 5 degree increase in temperature?
It may be helpful for you to refer to the text on page 577 "Example of Confidence Interval for a Multiple .... ". For this homework, I am interested in your interpretations of estimates. Don't worry about confidence intervals for these exercises.
Splus hints:
Fit a logistic regression with failure
as the Response and temp as the explanatory variable. This is how:
Go to Statistics,
Regression and Generalized
Linear Models
Under Model Family, specify
binomial. The default link is logit; leave that alone for
now.
The model formula is specified the
same as in linear regression; in this case : failure ~ temp
click OK to run
For curiosity only: this is how you would test the significance of the "pressure" term in the presence of the "temperature" term in the logistic regression setting.
## read in the shuttle data, let's call it "shuttle"
reduced.mod_glm(failure~temp,family=binomial,data=shuttle)
full.mod_glm(failure~temp+pressure,family=binomial,data=shuttle)
summary.glm(reduced.mod)
summary.glm(full.mod)
anova.glm(reduced.mod,full.mod,test="Chisq")
Last modified: Thu Apr 18 11:40:09 EDT 2002