Lab 8 (Week of 10/25/99)

Step 0: Down-load this week's SAS/Insight program

Click here and this week's program will appear in your browser window. Click on "File>Save As..." in Netscape and choose "Save in: D: " then click "Save". The program should now be saved on the D drive of your computer. The file's name is "lab8.sas". Return to this page by choosing "GO>Back" from the Netscape menu bar. To get started, open SAS by clicking "Start>Programs>Statistics and Mathematics>SAS System v6.11". Once the SAS environment appears, click on "File>Open". In the "Open" window, type "D:\lab8.sas" in the "File name:" field and then click "Open". Next, find the button on the menu bar with a picture of a running person. Click on this button.

Step 1: Questions

This week's data set has 8 variables. Four correspond to sample means, the other 4 to sample medians, drawn from a population with mean 61.6 and variance 211.2. The data represent age at diagnosis of a particular disease; the population consists of women living in a large U.S. government study region. Mu4 is a column of 100 sample means calculated from random samples of size 4 drawn from this population; Mu16, Mu100, and Mu1600 are 100 sample means for samples of size 16, 100, and 1600. Columns Med4, Med16, Med100, and Med1600 are sample medians for samples of the indicated size.

Distribution of the Sample Mean. Before you begin this problem, open a new spreadsheet to record your findings. Calculate summaries of the distribution of each of the four columns Mu4, Mu16, Mu100, and Mu1600. Make note of the shape of each histogram and enter standard deviation of each column in column 2 of your empty spread sheet, enter the associated n (4, 16, 100, or 1600) in column 1 of the empty spread sheet. The central limit theorem says that (at least the later two of) these histograms should have approximately what distribution? Are the histograms consistent with this? The variance of the sample mean as a function of n is the population variance divided by n. For each n, calculate (can do in the second spread sheet) the theoretical variance of the sample means in each of the 4 cases given. Compare the theoretical to observed values.

Return to the Stat 110B lab page.


iversen@stat.duke.edu
last updated 26 October 1999