[Return to Sta113 Home]
Graded Assignment
Assignment (due 22 October)
Complete the tasks described below and write a short (at most three pages, including graphs) report explaining what you've done and what it demonstrates.
- Your paper's goal should be to explain the Central Limit Theorem to your reader and to demonstrate it.
- Graphs will be an essential part of your paper, of course. These should be titled and clearly labeled.
- Graphs should be referred to at appropriate places in your paper and described clearly. They should be incorporated into the flow of your paper, not dumped in with the reader left to figure out what they mean.
- What a graph shows and what it means are not the same thing. For example, one might make a plot of number of home burglaries in a town versus the lunar phase. A description of what the graph shows would include statements of what the axes represent, the meaning of any odd color coding, etc. An interpretation of the meaning of the graph might include observations of trends, such as: "We note a clear association between number of burglaries and lunar phase that persists over many months. More burglaries tend to occur when the moon is close to new, suggesting that burglers favor dark conditions." The best way to present a graph in a scientific paper is to precede the graph with a description of what the graph shows, and to follow the graph with an interpretation of what the graph means.
- You may work with others while doing the MatLab simulations and you may discuss your results with others. However, you must write and understand your own MatLab code, generate your own graphs, and write your own paper.
- Don't forget to put your name on your paper.
- The paper is to be turned in at the beginning of the lab or recitation section on Friday, 22 October.
- Your paper will be graded in large part on how successfully it communicates and demonstrates the Central Limit Theorem using the example described in the tasks. Pay attention to how you word things, as details can matter. For example, "the average is..." is too vague. The average of what? In this lab, you'll be dealing with three different types of means: The mean of a population, the mean of a sample, and the mean of many sample means. Be clear which mean you mean.
The tasks
- Let X be the duration of a telephone call to the Durham Public Library. Suppose, as some studies have suggested is often the case, that X has an exponential distribution. Suppose further that X has a mean duration of two minutes.
- Simulate 5000 phone calls' durations and make a histogram of them. The MatLab command for simulating from an exponential distribution is exprnd.
- Suppose you were to take averages of X's in sets of four. That is, you take random samples of four independent phone calls and average their durations together. Simulate 5000 such sample averages and make a histogram of them.
- Repeat the above for samples of sizes 9 and size 100.
- For the histogram of sample means for sample size 100, superimpose an appropriate normal pdf on the graph.
- Make normal probability plots for each of the four simulation results above.
- For each of the four simulations, find the mean and standard deviation of the sample means. What appears to be the relationship between sample size and the mean of the sample means? Between sample size and the standard deviation of the sample means?