Data Management and Preparation Using R
Offered By: Pluralsight
Course Description
Overview
Data management is a key part of data analysis. Importing, selecting a class, cleaning, and filtering are the aspects of data management we will cover.
Have you ever encountered problems in data analysis just because the data was not clean, had a wrong format, or was simply messy? Data preparation is an immensely important yet overlooked field in data science. Most of the time of a data professional is not spent analyzing or visualizing, it is spent getting data ready as clean and well-structured as possible. R is a widely used open source tool with an active user community. This community created high quality add on packages for data preparation. In this course, Data Management and Preparation Using R, you will not only learn about data preparation in R Base, you will also learn about those add on packages that make R so powerful. First, you'll learn about data importing, cleaning, and structuring (selecting the right class). Next, you'll explore data querying. Finally, you will learn about dplyr, tidyr, reshape2 and data.table. At the end of this course, you will be able to select the right tools and efficiently perform data import, formatting, cleaning, and querying.
Have you ever encountered problems in data analysis just because the data was not clean, had a wrong format, or was simply messy? Data preparation is an immensely important yet overlooked field in data science. Most of the time of a data professional is not spent analyzing or visualizing, it is spent getting data ready as clean and well-structured as possible. R is a widely used open source tool with an active user community. This community created high quality add on packages for data preparation. In this course, Data Management and Preparation Using R, you will not only learn about data preparation in R Base, you will also learn about those add on packages that make R so powerful. First, you'll learn about data importing, cleaning, and structuring (selecting the right class). Next, you'll explore data querying. Finally, you will learn about dplyr, tidyr, reshape2 and data.table. At the end of this course, you will be able to select the right tools and efficiently perform data import, formatting, cleaning, and querying.
Syllabus
- Course Overview 1min
- Introduction 13mins
- Selecting Suitable Classes and Importing Data 26mins
- Cleaning Data with tidyr 26mins
- Data Filtering and Querying with dplyr and data.table 44mins
- Course Recap and Your Next Steps 6mins
Taught by
Martin Burger
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX