STA113: Probability and Statistics in Engineering

Data Sets

Text:Mendenhall & Sincich, Statistics for Engineering and the Sciences (4th edn)

113 Home Page Syllabus FAQ

Data sets: Some lecture examples will feature data from published research, books, or other sources; I'll try to make the data available here so you can try out the methods used in class or experiment with other methods. Also some homework problems will have data sets; I'll put those here too. Click on any item below and you will find either the raw data or a further page of explanation leading to the raw data. Use the Save As feature of your web browser to save the data to a file, which you can then enter into Minitab using the read command. To see a sample Minitab session illustrating this, click on Anscombe below.
NameDescription
M+S Exercises Data sets for most exercises in Mendenhall & Sincich 4/e
Anscombe A classic pedagogic data set illustrating the need to use graphical methods and residual analysis to assess regression fits.
Fish DDT Fish DDT data set from Appendix III of Mendenhall & Sincich, in character encoding; digital encoding also available, suitable for MatLab.
CPU CPU times of 1000 computer jobs from Appendix IV of M & S.
Iron Percentage iron contents for 390 ore samples from Appendix V of M & S.
Cigarette Cigarette data set from Appendix VI of M & S; digital encoding also available.
Fuel Automobile fuel data; subsets for Mazda and Chevrolet also available.
Andrews 71 data sets from Data: A Collection of Problems from Many Fields for the Student and Research Worker by D.F. Andrews and A.M. Herzberg (look at T00.0 for brief explanations, or at the book for full ones).
Hand et al. 510 data sets from Handbook of Small Data Sets by Hand, Daly, Lunn, McConway, and Ostrowski.
Duke Statistics Links to many other data archives, including
StatLib CMU's ``StatLib'' statistics archive which in turn includes their
DASL ``Data and Story'' (DASL, pronounced like ``dazzle'') archive.