[Return to Sta113 Home]

Graded Assignment


Assignment (due 3 November)

Complete the tasks described below and write a short (at most four pages, including graphs) report explaining what you've done and what it demonstrates.

The tasks

Throughout this lab assignment you will be simulating the heights of men randomly drawn from a large population of college-age men whose heights are normally distributed with mean 70 inches and standard deviation 2.5 inches.

  1. Simuate 100 different samples of 5 randomly selected men. For each of the 100 samples, determine the 95% confidence interval estimate of the population mean that it yields, using the "known" population standard deviation of 2.5 and the appropriate critical value determined from the standard normal distribution. Show a plot of the 100 confidence intervals as parallel line segments, indicating which ones did and which ones did not capture the population mean of 70 inches. (The "hints" section below contains a .m file that will plot this graph for you.) Determine what proportion of the intervals successfully captured the population mean. Repeat this with 1000 samples, but do not include the graph in your paper, only the proportion of intervals that captured the mean.

  2. In reality, the population standard deviation would not be known. Simulate 100 different samples of 5 randomly selected men once more. For each of the 100 samples, determine the 95% confidence interval estimate of the population mean that it yields, using the sample standard deviation s for each interval instead of sigma=2.5. For now, choose your critical value from a standard normal distribution as you did in step 1, even though that isn't the appropriate thing to do. Plot the intervals as you did in step 1 and report the proportion of them that capture the population mean. Repeat with 1000 random samples, but do not show the graph, just report the proportion of the intervals that caputure the mean.

  3. Repeat everything you did in step 2, only this time use the appropriate critical value from the t distribution with 4 degrees of freedom, instead of from the standard normal distribution. You should now be able to explain why the t-distribution, not the standard normal, is the appropriate one from which to choose your critical values when using the sample standard deviation.

  4. Repeat step 3 above but with a 99% confidence level instead of a 95% confidence level. (Do only 100 simulations, not 1000.) Show your confidence intervals. What do you find is different about the intervals? About the proportion capturing the mean?

  5. Repeat step 3 above (returning to a 95% confidence level) but with samples of size 20 instead of size 5. Note that your degrees of freedom is no longer 4. (Do only 100 simulations, not 1000.) Show your confidence intervals. What do you find is different about the intervals?

Helpful hints