The Data Scientist’s Toolbox
Offered By: Johns Hopkins University via Coursera
Course Description
Overview
In this course you will get an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, and tools that data analysts and data scientists work with. There are two components to this course. The first is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a practical introduction to the tools that will be used in the program like version control, markdown, git, GitHub, R, and RStudio.
Syllabus
- Data Science Fundamentals
- In this module, we'll introduce and define data science and data itself. We'll also go over some of the resources that data scientists use to get help when they're stuck.
- R and RStudio
- In this module, we'll help you get up and running with both R and RStudio. Along the way, you'll learn some basics about both and why data scientists use them.
- Version Control and GitHub
- During this module, you'll learn about version control and why it's so important to data scientists. You'll also learn how to use Git and GitHub to manage version control in data science projects.
- R Markdown, Scientific Thinking, and Big Data
- During this final module, you'll learn to use R Markdown and get an introduction to three concepts that are incredibly important to every successful data scientist: asking good questions, experimental design, and big data.
Taught by
Jeff Leek
Tags
Related Courses
A Beginner’s Guide to VCS and GitPackt via FutureLearn Preparing for the AZ-400 Azure DevOps Exam
A Cloud Guru Source Control with Git
A Cloud Guru Advanced Reproducibility in Cancer Informatics
Johns Hopkins University via Coursera Advanced Application Management with Red Hat OpenShift
Red Hat via Coursera