class: center, middle, inverse, title-slide # Welcome to Data Science
🙌 ### Dr. Çetinkaya-Rundel --- layout: true <div class="my-footer"> <span> Dr. Mine Çetinkaya-Rundel - <a href="http://www2.stat.duke.edu/courses/Fall18/sta112.01/schedule" target="_blank">stat.duke.edu/courses/Fall18/sta112.01 </a> </span> </div> --- class: center, middle # Hello world! --- ## What is data science? - <i class="fa fa-database fa"></i> + <i class="fa fa-flask fa"></i> = data science? -- - <i class="fa fa-database fa"></i> + <i class="fa fa-code fa"></i> = data science? -- - <i class="fa fa-database fa"></i> + <i class="fa fa-user fa"></i> + <i class="fa fa-code fa"></i> = data science? -- - <i class="fa fa-database fa"></i> + <i class="fa fa-users fa"></i> + <i class="fa fa-code fa"></i> = data science? -- <br> Data science is an exciting discipline that allows you to turn raw data into understanding, insight, and knowledge. We're going to learn to do this in a `tidy` way -- more on that later! --- # What is this course? This is a course on introduction to data science, with an emphasis on statistical thinking. -- **Q - What data science background does this course assume?** A - None. -- **Q - Is this an intro stat course?** A - While statistics `\(\ne\)` data science, they are very closely related and have tremendous of overlap. Hence, this course is a great way to get started with statistics. However this course is **not** your typical high school statistics course. -- **Q - Will we be doing computing?** A - Yes. --- # What is this course? **Q - Is this an intro CS course?** A - No, but many themes are shared. -- **Q - What computing language will we learn?** A - R. -- **Q: Why not language X?** A: We can discuss that over ☕. --- ## Where is this course? <br><br><br><br><br><br><br> .large[ .center[ [bit.ly/sta112-f18](http://bit.ly/sta112-f18) ] ] --- class: center, middle # Data in the wild --- # Data from wearables **A year as told by fitbit** .pull-left[ by Nick Strayer http://livefreeordichotomize.com/2017/12/27/a-year-as-told-by-fitbit/ ] .pull-right[  ] --- # R-Ladies chapters **R-Ladies global tour** .pull-left[ by Maelle Salmon http://www.masalmon.eu/2017/10/06/globalrladiestour/ ] .pull-right[  ] --- # Trump's tweets **Text analysis of Trump's tweets confirms he writes only the (angrier) Android half** .pull-left[ by David Robinson (Stack Overflow) http://varianceexplained.org/r/trump-tweets/ ] .pull-right[  ] --- ## Greatest Twitter scheme of all time <img src="img/bohemian-rhapsody.png" width="381" style="display: block; margin: auto;" /> .small[ https://gist.github.com/mine-cetinkaya-rundel/03d7516dea1e5f2613a5d71c28edb08d ] --- class: center, middle # Your turn! --- ## Create a GitHub account .instructions[ Go to [github.com](https://github.com/), and create an account (unless you already have one). ] Tips for selecting a username:<sup>✦</sup> .midi[ - Incorporate your actual name. - Reuse username from other contexts, e.g., Twitter or Slack. - Pick a username you'll be comfortable revealing to your future boss. - Shorter is better than longer. - Be as unique as possible in as few characters as possible. - Make it timeless. Don’t highlight your current university, employer, etc. - Avoid words laden with special meaning in programming, like `NA`. ] .instructions[ Once done, place a green sticky on your laptop. If you have questions, place a pink sticky. ] .footnote[ <sup>✦</sup> Source: [Happy git with R](http://happygitwithr.com/github-acct.html#username-advice) by Jenny Bryan ] --- ## Join RStudio Cloud .instructions[ Go to [http://bit.ly/sta112-rscloud-join](http://bit.ly/sta112-rscloud-join), and log in with your GitHub credentials. ] .instructions[ Once done, place a green sticky on your laptop. If you have questions, place a pink sticky. ] --- ## Create your first data visualization - Once you log on to RStudio Cloud, click on this course's workspace "STA 112FS - Fall 2018". - Make a copy of the project called UN Votes, and launch it. - In the Files pane in the bottom right corner, spot the file called `unvotes.Rmd`. Open it, and then click on the "Knit" button. - Go back to the file and change your name on top (in the `yaml` -- we'll talk about what this means later) and knit again. - Then change the country names to those you're interested in. Your spelling and capitalization should match how the countries appear in the data, so take a peek at the Appendix to see how the country names are spelled. Knit again. And voila, your first data visualization! .instructions[ Once done, place a green sticky on your laptop. If you have questions, place a pink sticky. ] --- ## Join Slack - Go to http://bit.ly/sta112-slack-join and join course Slack. - Download the Slack app and keep it open.