Practice
Final Exam
a) (6 pts) What would be the
percentile rank for a student with a GPA of 3.4?
z = (3.4 – 3.0)/.5 = .8, from Table A, percentile rank is 78.81
b) (3 pts) What can you say about the
shape of the distribution of GPA's
It is negatively skewed.
double-blind [placebo-controlled] simple random sample
random assignment to groups observational
Use normal approximation to binomial- mean =
np=200*.6=120, sd=sqrt {np(1-p)}=6.93.
i)
The
probability, if H0 is false, that you will reject H0
ii)
The
probability, computed assuming that H0
iii)
The
probability that H0
iv)
The
probability, computed assuming that H0 is true, that you will reject
H0
v)
The
probability that H0
vi)
The
probability, if H0 is false, that you will retain H0
a)
(2
pts) Which of the above corresponds to a
b)
(2
pts) Which corresponds to b
c)
d)
|
|
|
|
|
|
|
|
|
|
|
|
Is there a statistically
significant relationship between Tire Supplier and the Acceptable/Defective
variable? Use a
Calculated chi-square = 16.6, df=2, critical value
of chi-square is 5.99, so reject null, conclude there is a significant
relationship between Tire Supplier and the number of defective tires received.
T___ c) Assume the null
hypothesis is H0: m=0. If the null hypothesis is not
rejected, then the corresponding confidence interval (e.g., the 95% confidence
interval would correspond to an hypothesis test with a
F___ d)
8.
A
researcher is analyzing crime data from the 50 states. Below is JMP-In output comparing murder rates
across four U.S. geographical regions-- Midwest, Northeast, South, and
West.
Source DF Sum of Squares Mean Square F
Ratio
Model 3 _271.1157 90.3719 _8.70_
Error 46 _ 477.8285 10.3876 Prob>F
Total _49 748.94420
Alpha=
Abs(Dif)-LSD South West Northeast Midwest
South -3.03732 1.16772 1.57118 2.08391
West 1.16772 -3.36960 -2.95002 -2.44998
Northeast 1.57118 -2.95002 -4.04976 -3.57431
Midwest 2.08391 -2.44998 -3.57431
a)
b)
(5 pts) Interpret, in terms of the variables given in
the problem, the overall ANOVA results.
We can conclude that the true population means for murder rate among the four
regions are not equal.
c)
(5 pts) Interpret the results of the pairwise
comparisons.
The murder rate in the South is significantly
different from the murder rates in the other three regions. None of the other regions different
significantly from each other in murder rates.
P(at least 1) = 1 – P(none),
RSquare 0.108613
Root Mean Square Error 252.4688
Mean of Response 473.6
Observations (or Sum Wgts)
Term Estimate Std Error t Ratio Prob>|t|
Intercept 112.13722 153.6689 0.73 0.4691
Unemployment 56.780205 23.47839 2.42
a)
(3
pts) Interpret the slope of the regression equation.
For every percentage point of increase in unemployment, the auto theft rate
tends to increase up by 56 thefts per year per 100,000 autos.
b)
(3
pts) Interpret Root Mean Square Error
If we predict auto theft rate based on unemployment rate, our typical error
will be 252.5 thefts per year per 100,000 autos.
c)
(3
pts) Is unemployment a significant predictor of auto theft?
Yes, since beta1 is significant.
d)
(3
pts) What would you predict the auto theft rate would be for a state with an
unemployment rate of 7.5?
112.137 + 56.7802(7.5) = 538 thefts per year per 100,000 autos.
e)
(3
pts) Can we conclude that higher unemployment causes higher auto theft
rates? Why or why not?
No, because this is an observational study, and
observational studies do not allow the inference of causality. Many confounding variables could account for
the relationship between unemployment and auto theft rates.
11.
(6
pts) Suppose that 0.5% (.005) of all students seeking treatment at a school infirmary
are eventually diagnosed as having mononucleosis. Of those who do have mono, 90% complain of a sore throat. But 30% of those not having mono also have
sore throats. If a student comes to the
infirmary and says that he has a sore throat, what is the probability that he
has mono?
Bayes formula: Let A:Has mono, B:Has
sore throat, want P(A|B).
P(A|B) = (.90)(.005) / {(.90)(.005) + (.30)(.995)} = .0149
12.
A
researcher was interested in the influence of participation in school sports
and being employed on high school grades.
Mean GPA |
Employment Status |
|||
Not Employed |
Employed 0 to 10 hours/week |
Employed > 10 hours/week |
||
Varsity Sports? |
No |
3.0 |
3.3 |
3.2 |
Yes |
3.4 |
3.1 |
2.8 |
a)
(4
pts) Graph these data with employment status along the X-axis, and with one
line for the high school students who play varsity sports and another line for those
who don't.
3.4+
3.3+
3.2+
3.1+
3.0+
2.8+
0 0-10 >10
(Connect the x’s to get the line for the Varsity students, and the o’s to get
the line for the non-Varsity students.
|
|
|
|
|
Prob
> |
Varsity Status |
1 |
37.2 |
37.2 |
7.44 |
.040 |
Employment |
2 |
57.2 |
28.6 |
5.72 |
.171 |
Varsity * Employment |
1 |
94.3 |
94.3 |
18.86 |
.001 |
b) (5 pts) Interpret
the above table.
There is a main effect for Varsity Status on grades,
no main effect for Employment, and a significant interaction effect for Varsity
X Employment. Since the main effect for
Varsity Status is qualified by the significant interaction of Varsity Status
and Employment Status on GPA we will only interpret the interaction effect. In particular, we find that the effect of
playing sport on GPA depends on the employment status of the student.