The following workshops will be held at Duke. Specific location TBA. Sign up links will be posted soon.

  • 03/27/2018 - Introduction to R (6-8pm) - John Little & John Herndon - Sign up

A gentle introduction to the basics of R using RStudio. Learn about managing your R projects, wrangling data, and writing clear code using the Tidyverse collection of R packages. If time allows: students will also gain brief introductions to visualization with ggvis and mapping with leaflet. Attendees will have the opportunity of supplementing the materials covered in this workshop with free academic access to the interactive training at Please note that laptops with Rstudio pre-installed are required for this workshop.

  • 03/28/2018 - Data manipulation with dplyr & tidyr - ??? - Sign up TBA

  • 03/29/2018 - Working with large data (6-8pm) - Michael Akande - Sign up TBA

Tips and tricks for working with large datasets in R. Laptops with R and RStudio required.

  • 04/02/2018 - ggplot2 (6-8pm) - Angela Zoss - Sign up

In this workshop we will focus on ggplot2, a library for R that creates clear and well-designed visualizations and that plays well with other tidyverse packages. While prior experience with ggplot2 and with other tidyverse packages is not required, some basic familiarity with R is expected. Please consider attending (or viewing a recording of) our Introduction to R workshop before attending this workshop on ggplot2. In this workshop, we will use RStudio and RMarkdown files for all exercises. Laptops are required. Please make sure you come with RStudio and the tidyverse package installed. You may also want to install the knitr package to be able to compile the entire Rmarkdown file.

  • 04/03/2018 - Introduction to Amazon Web Services (AWS) (6-8pm) - Brian Beach - Sign up TBA

Amazon Web Services (AWS) offers reliable, scalable, and inexpensive cloud computing services including a comprehensive set of services to handle every step of the analytics process chain including data warehousing, business intelligence, batch processing, stream processing, machine learning, and data workflow orchestration. This workshop will introduce you to the tools available to help you during DataFest. Note that AWS is offering credits to DataFest participants who want to complete their analysis in the cloud.

  • 04/04/2018 - Easy Interactive Charts and Maps with Tableau (6-8pm) - Eric Monson - Sign up

Tableau Public (available for both Windows and Mac) is free software that allows individuals to quickly and easily explore their data with a wide variety of visual representations, as well as create interactive web-based visualization dashboards. This workshop will focus on using Tableau Public to create data visualizations, starting with an overview of how the program thinks about data, common data manipulation and loading, and the terminology used. Activities will include a sample data visualization and mapping project, which will give people hands-on experience using Tableau’s basic chart types and dashboard creation tools. We will also discuss publishing to the Tableau Public web server and related services and tools, like the full Tableau Desktop application (free for full-time students). Laptops with tableau public pre-installed are required for this workshop.

  • 04/05/2018 - Machine Learning and Data Mining (6-8pm) - Liz Lorenzi & Isaac Levine - Sign up TBA

Introduction to machine learning and data mining algorithms. Laptops with R and RStudio required.

  • TBA - Shiny