Course Home
Page
Course Description
In this course, we learn approaches to analyzing multivariate data sets, emphasizing the analysis of variance, linear regression, and logistic regression. We learn techniques for checking the appropriateness of proposed models, such as residual analyses and case influence diagnostics, and techniques for selecting models. We emphasize data analysis over mathematical theory.
Course Objectives
Logistics
Prerequisites
The primary text is:
Ramsey, F. L. and Schafer, D. W. (2002), The Statistical Sleuth: Second Edition, Duxbury. Other resources are posted on the STA 210 course web site on Sakai.
Computing
We will use the statistical software package R for analyzing data.
It can be downloaded for free at
http://www.r-project.org/.
Alternatively, R is available on the public computers on campus.
Calculator
Students don't need a calculator for this course.
Schedule of Topics
We will cover the topics in the table below. We may spend
different amounts of time on each topic than shown, depending on the
interests of the
class participants.
Review of inference for one and two sample problems | Chapters 1, 2, 3 |
2 lectures |
One way analysis of variance
(ANOVA) |
Chapters 5, 6 |
2 lectures |
Simple linear regression model |
Chapter 7, 8 |
2 lectures |
Multiple linear regression model
and general ANOVA |
Chapter 9, 10, 11, 12, 13, 14 |
8 lectures |
Time series |
Chapter 15 |
1 lectures |
Logistic regression model |
Chapter 18, 19, 20, 21 |
3 lectures |
Generalized linear models |
Chapter 22 |
3 lectures |
Missing Data |
Supplemental notes |
3 lectures |
Algorithmic prediction methods |
Supplemental notes |
2 lectures |
Graded work
Graded work for the course will consist of two term exams, methods and data analysis assignments, writing assignments, and a final project. There is no final exam. Students' final grades will be determined as follows:
Methods and Data Analysis Assignments |
35% |
Research Project |
15% |
Exam 1 |
25% |
Exam 2 |
25% |
There are no make-ups for exams, assignments, or the final project except for medical or familial emergencies or for reasons approved by the instructor before the due date. See the instructor in advance of relevant due dates to discuss possible alternatives.
Descriptions of graded work
Methods and Data Analysis Assignments:
Methods and Data Analysis assignments are posted on the Statistics 210 course web
site on Sakai. Students turn in these assignments at the beginning
of class on the due date. Students are permitted to work with
others on the assignments, but each person must write up and turn in
her or his own answers. The Methods and Data Analysis assignments include questions on
the computational and the mathematical aspects of the methods that
underpin the statistical models we learn during the semester, and
questions that ask students to apply the modeling skills discussed
during the semester.
Research Project:
For the final project, students analyze a data-based
research question of their choosing, subject to the instructor's
approval. The data should comprise several variables amenable to
statistical analyses via regression modeling. Students can bring
in their own research data sets, or they can ask the instructor for
assistance with identifying appropriate data. Students present their results in a poster session.
Detailed instructions are available on the Sakai site.
Term Exams:
The exams will cover mathematical and conceptual aspects of building
regression models. One exam will be after the unit on
multiple regression, which is approximately half way through the
course. The other will be towards the end of the course.
Students are expected to abide by Duke's Community Standard for all
work
for this course. Violations of the Standard will result in a
zero grade for the relevant assignment and will be reported to the Dean of
Students for
adjudication. Additionally, there may be penalties to the final grade for the course. Ignorance of what constitutes academic dishonesty
is
not a justifiable excuse for violations.
For the exams, students are required to work alone. For the
Methods and Data Analysis assignments, students may work
with a study group
but each student must write up and submit his or her own answers. For the
final project, students are required to perform the data analysis
and make the poster individually (although students can consult with
the instructor and discuss ideas with other students in the class).