Most of the information that the text contains about multiple regression requires some knowledge of linear algebra. For that reason, you'll probably only want to use the text for reference/clarification. Refer mostly to the transparencies and your notes from lecture.

Reminder: This assignment will not be collected. Its purpose is to help you review the material we have covered in class and the text, but it is not to be viewed as an exhaustive list of topics/skills that you need to know.


Problem 1 (Wonnacott and Wonnacott, problem 13-5)

In the midterm election U.S. congressional elections (between Presidential elections), the party of the President usually loses seats in the House of Representatives. To measure this loss concretely, we take as our base the average congressional vote for the Presidents's party over the previous 8 elections; the amount that the congressional vote drops in a given midterm election, relative to this base, will be our standardized vote loss, Y.
Y depends on several factors, two of which seem important and easily measurable: X1 - Gallup poll rating of the President at the time of the midterm election (percent who approved of the way the President is handling his job) and X2 = change over the previous year in the real disposable annual income per capita.

Year   Y     X1     X2
1946  7.3%  32%   -$40
1950  2.0%  43%   $100
1954  2.3%  65%   -$10
1958  5.9%  56%   -$10
1962 -0.8%  67%    $60
1966  1.7%  48%   $100
From the above data (Tufte, 1974), the following multiple regression equation was computed: Yhat = 10.9 - 0.13*X1 - 0.034*X2


Problem 2 (Wonnacott and Wonnacott, problem 13-9)

In the previous problem, the congressional vote loss of the Presidents's party in midterm elections (Y) was related to the President's Gallup rating (X1) and change in real income over the previous year (X2). Specifically, the following regression was computed from n=6 points:

Yhat = 10.9 - 0.13*X1 - 0.034*X2
             (0.046)   (0.010)


Problem 3 (Wonnacott and Wonnacott, problem 13-11)

To determine the effect of the various influences on land value in Florida, the sale price of residential lots in the Kissimmee River Basin was regressed on several factors. With a data base of n=316 lots, the following multiple regression was calculated (Conner et al., 1973; via Anderson and Sclove, 1978):

Yhat = 10.3 - 1.5*X1 - 1.1*X2 - 1.34*X3 + ...
where
Y = price per front foot (in dollars)
X1 = year of sale (X1=1,...,5 for 1966,...,1970)
X2 = lot size (acres)
X3 = distance from the nearest paved road (miles)


Problem 4 (Wonnacott and Wonnacott, problem 14-1)

To help firms determine which of their executive salaries might be out of line, a management consultant fitted the following multiple regression equation from a data base of 270 executives under the age of 40:

SAL = 43.4 + 1.24*EXP + 3.60*EDUC + 0.74*MALE
             (0.30)     (1.20)      (1.10)
      residual standard deviation s=16.4
where
SAL = the executive's annual salary ($000)
EDUC = number of years of post-secondary education
EXP = number of years of experience
MALE = dummy variable, coded 1 for male, 0 for female

  1. From this regression a firm can calculate the fitted salary of each of its executives. If the actual salary is much lower or higher, it can be reviewed to see whether it is appropriate.
    Fred, who's 32, has been with the firm since he was 25; his annual salary is $126,000 annually. He got a 2-year MBA, following a 4-year undergraduate degree.
  2. In addition to identifying unusual slaries in specific firms, this regression can be used to answer questions about the economy-wide structure of executive salaries in all firms. For example,


Problem 5 (Wonnacott and Wonnacott, problem 14-18)

Suppose a study of quantity supplied as a function of price gave the following regression:

log(Q) = 5.2 + 1.3*log(P)
For this question, keep in mind that a change in log(X) is approximately equal to relative change in X itself.