Manage Invalid, Duplicate, and Missing Data in Python
Offered By: Pluralsight
Course Description
Overview
Cleaning data is one of those tasks that is not fancy, but key to any data application. This course will teach you the skills and knowledge of data cleaning in Pandas needed to convert your datasets from raw and useless to clean and useful.
Regardless of your line of work; data is everywhere. Today, we generate more data per second than ever before; however, this data is usually raw, dirty, and frequently unusable. In this course, Manage Invalid, Duplicate, and Missing Data in Python, you’ll gain the ability to clean your data to make it usable for any application you may need. First, you’ll explore how to handle missing values and how to fill NaN columns. Next, you’ll discover how to deal with duplicate rows on a subset of columns. Finally, you’ll learn how to cope with invalid values and how to fix or remove them. When you’re finished with this course, you’ll have the skills and knowledge of data cleaning in Pandas needed to convert your datasets from raw and useless to clean and useful.
Regardless of your line of work; data is everywhere. Today, we generate more data per second than ever before; however, this data is usually raw, dirty, and frequently unusable. In this course, Manage Invalid, Duplicate, and Missing Data in Python, you’ll gain the ability to clean your data to make it usable for any application you may need. First, you’ll explore how to handle missing values and how to fill NaN columns. Next, you’ll discover how to deal with duplicate rows on a subset of columns. Finally, you’ll learn how to cope with invalid values and how to fix or remove them. When you’re finished with this course, you’ll have the skills and knowledge of data cleaning in Pandas needed to convert your datasets from raw and useless to clean and useful.
Syllabus
- Course Overview 1min
- Why Clean Data? 6mins
- Handling Missing Values in Your Data 24mins
- Handling Duplicate Values in Your Data 9mins
- Handling Invalid Values in Your Data 14mins
Taught by
Axel Sirota
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX