STA 240 Take Home Midterm

Due: in class on 23 November.

Assignment:



1. Bug traps in open water

The data in file bugs.asc show the bug counts recorded from 30 sticky traps placed in the open water zone of a wetland in North Dakota. The traps are floating cylinders that are 25cm tall with a diameter of 6.25cm. The cylinder is coated with a sticky tape that traps any bugs that fly into it. The traps are set and then left for 24 hours. Two counts are recorded for each trap: the number of bugs caught on the lower 12.5cm of the cylinder; and the number of bugs caught on the upper 12.5cm of the cylinder. Hence the data file looks like
count height trap
286   low     1
113   high    1
225   low     2
108   high    2
195   low     3
 :     :      :
which means the count for the lower portion of trap 1 is 286 and the count for the upper portion of trap 1 is 113. Likewise the lower and upper portions of trap 2 have counts 225 and 108 respectively.

In open water, the researchers wish to know whether bugs are more likely to travel near the water surface, or a slight distance away from it.
  1. Give a statistic or plot that gives an idea how many bugs are typically trapped on a given day.
  2. Present a graphical display that sheds light on whether or not there is a difference between the number of bugs caught the high and low trap portions.
  3. Quantify the difference between the counts at the high and low portions. It may be most sensible to report a result that says: there tends to be X times as many bugs in the lower portion of the trap as compared to its upper portion.
  4. Give a plausible range of values for X (ie. a 95% CI).
  5. Comment on the scope of the inference carried out here. No more than 3 sentances please.


2. Ant foraging

The file lfarea.asc holds data on the size for pieces of leaf collected by atta ants upon returning from a foraging expedition. On a number of different nights, between 50 and 200 leaf pieces were collected from ants returning to their colony from a foraging expedition and the area of each piece was measured. This was done on repeated nights for various colonies on two different islands. The data here show the average leaf area over a given night for a given colony. Most colonies have data recorded on a number of different nights. Repeated samples for a given colony were taken at least two weeks apart. Each line of the data file gives: Note that colonies have been sampled multiple times. S1-S4 denote colonies from a small island where resources are minimal; M1-M5 denote colonies from a mid-sized island were the resources are less strained.

Researchers have two main questions of interest: Using one page or less, write a summary of your statistical analysis. Be sure to include:
  1. a brief statement of the problems of interst;
  2. a summary of your statistical findings;
  3. some discussion of the scope of inference.
You may include plots and Splus output which do not count against your one page limit; you can attach those to your write-up. But don't write more than a single page! I won't read beyond 1 page.

3. A strawberry trial

Trial was conducted to determine which variety of strawberries produces the most fruit. Plants were cultivated on a lattice of 4 x 8 plots (as shown below), and the pounds of fruit produced for each plot were recorded. A randomized complete block design was used with four 2 x 4 plot blocks as shown below. The data are available in the file berries.asc .

  1. Compute an F test to determine if there is a variety effect after adjusting for the block effect.
  2. Compute 95% confidence intervals for each of the variety effects.
  3. Determine which variety effects are significantly different from one another -- here it is sensible to adjust for the fact that multiple comparisons are being made.
  4. The yields from the rightmost column of plots seem rather low in comparison to the other plot yields. This is likely because the hedge at the end of the field was shading (or otherwise affecting) those plots. In light of this, how would you modify your estimates in 2.
  5. In light of all you've learned about this data, which of the varieties could acutally be the best yielding. What would you recommend to a planter who wanted to maximize his yield. You may assume that the conditions of this experiment are similar to the conditions the planter would grow his strawberries in.
G 5.8 V 6.3 Rl 4.9 F 6.5 Re 4.5 M 5.2 E 6.5 P 3.8 H
E 6.9 P 7.6 M 7.9 Re 5.6 G 7.0 V 5.5 F 4.0 Rl 2.7 E
D
V 7.6 F 6.4 Rl 5.0 G 6.9 P 7.4 E 5.3 Re 5.2 M 3.0 G
E 7.5 Re 7.0 M 6.1 P 7.2 G 6.5 F 5.6 V 5.8 Rl 1.4 E