YoVDO

Data Wrangling with Python

Offered By: Pluralsight

Tags

Data Analysis Courses Machine Learning Courses Python Courses Data Transformation Courses Data Wrangling Courses Data Normalization Courses One Hot Encoding Courses

Course Description

Overview

In this course, Data Wrangling with Python, you'll learn about various functions and procedures that will help you get your data in order, providing a clean and well-constructed dataset for further data analysis and machine learning.

Machine Learning and Data analytics in general follows the garbage-in/garbage-out principle. If you want to learn from or predict based on your data, you need to make sure that data is well constructed and cleaned. This course, Data Wrangling with Python 3, is aimed at helping you do exactly that. First, you’ll see how to merge data from different sources using the methods concat, append, and merge. Next, you’ll discover how to combine data into groups. The primary function used here is groupby. In the next two sections, you’ll explore how to transform and normalize data. You’ll learn why these processes are necessary, and then proceed to see how they work in practice. Finally, you’ll examine important processes such as One Hot Encoding, which enables further processing during data analysis. When you’re finished with this course, you’ll have thorough knowledge of data wrangling which will help you immensely during your data analysis and machine learning projects.

Syllabus

  • Course Overview 1min
  • Concatenating and Merging Data from Different Sources 24mins
  • Combining Data into Groups 12mins
  • Normalizing Data with Pandas 11mins
  • Reshaping Data with Python 13mins
  • Data Encoding with Python 15mins

Taught by

Pratheerth Padman

Related Courses

Applied Data Science Capstone
IBM via Coursera
Data Analytics for Business with Tableau
Coventry University via FutureLearn
Data Analytics for Business with Tableau Training
Tableau via FutureLearn
Capstone: Data Science Problem in Linear Algebra Framework
Howard University via Coursera
Learn dplyr
Codecademy