STA 110B STA 110B Homework 12

11-1

  1. For the given data, [`X] = 3, [`Y] = 80, åxy = 70 and åx2 = 10, so that
    b =
    å
    xy

    å
    x2
    = 70
    10
    = 7
    a = _
    Y
     
    - b _
    X
     
    = 80-7·3 = 59 ,
    and the estimated regression line is
    ^
    Y
     
    = 59 + 7 X .
  2. Here is the graphic. The only points of interest here are the ones marked with a ``·'' .

    1. [^Y] = 59+7·3 = 80; this corresponds to the point marked with an ``o'' in the previous graph.
    2. [^Y] = 59+7·4 = 87; this corresponds to the point marked with an ``&'' in the previous graph.
    3. This corresponds to the estimated slope of the regression line, b = 7.

11-3

  1. Here, b = åxy /åx2 = 876/97 » 9.031 and a = [`Y]-b[`X] » 160 - 9.031·4.6 = 118.457, so that the estimated regression line is [^Y] = 118.457 + 9.031 X.
  2. For a radiation exposure of 5.0, we estimate the cancer mortality to be [^Y] = 118.457 + 9.031·5 = 163.612. For a radiation exposure of zero, the estimated cancer mortality is the estimated intercept, a = 118.457.
  3. The answer to b. with zero exposure is marked with an ``&'', the answer to part b. with exposure 5.0 with an ``o'' and the counties with a ``·''.

  4. Since the data arises from an uncontroled observational study, one can not conclude for any causal relationship between radiation exposure and cancer mortality. It can be the case that the observed positive relation between the two variables is due to the presence of some confounding variable.

11-5

a. mean; b. normal; c. easy; d. OLS, curve.

12-13

Since t = b/SE and SE = .27/t.025, we get t = 2.76. Also the hypotheses is H0: b £ 0, so 0.0025 < p-value < 0.005. Note that we used a normal standard table since the sample size is large: n = 1000.

Extra Exercise

  1. We have n = 32, so that df = 30 and t.025 = 2.05.
    b
    =
    b ±t.025 SE
    =
    102.289 ±2.05 ·24.23
    =
    102.289 ±49.43
    52.86 < b < 151.72
    a
    =
    a ±t.025 SE
    =
    -6553.57 ±2.04 ·1661.96
    =
    -6553.57 ±3390.40
    -9943.97 < a < -3163.17
  2. Yes, since the 95% confidence interval for b does not contain zero.
  3. The hypotheses to test is H0b = 100, so the t statistic is
    t = b-100
    SE
    = 102.289-100
    24.23
    = 0.094.
    We want to calculate a two-sided p-value, so, looking at the table, we recognize that p-value > 2·0.25, that is, the p-value is larger than 50%.
  4. Looking at the output tables, we see that s2 = 18761.24.


File translated from TEX by TTH, version 2.00.
On 24 Nov 1999, 11:11.