YoVDO

Data Cleaning in Python Essential Training

Offered By: LinkedIn Learning

Tags

Data Cleaning Courses Python Courses Data Manipulation Courses Data Organization Courses Data Validation Courses

Course Description

Overview

Improve the overall analytic workflow of your organization by boosting your data cleaning skills in Python.

Syllabus

Introduction
  • Why is clean data important?
  • What you should know
  • Using GitHub Codespaces with this course
1. Bad Data
  • Types of errors
  • Missing values
  • Bad values
  • Duplicates
2. Causes of Errors
  • Human errors
  • Machine errors
  • Design errors
  • Challenge: UI design
  • Solution: UI design
3. Detecting Errors
  • Schemas
  • Validation
  • Finding missing data
  • Domain knowledge
  • Subgroups
  • Challenge: Find bad data
  • Solution: Find bad data
4. Preventing Errors
  • Serialization formats
  • Digital signature
  • Data pipelines and automation
  • Transactions
  • Data organization and tidy data
  • Process and data quality metrics
  • Challenge: ETL
  • Solution: ETL
5. Fixing Errors
  • Renaming fields
  • Fixing types
  • Joining and splitting data
  • Deleting bad data
  • Filling missing values
  • Reshaping data
  • Challenge: Workshop earnings
  • Solution: Workshop earnings
Conclusion
  • Next steps

Taught by

Miki Tebeka

Related Courses

Getting and Cleaning Data
Johns Hopkins University via Coursera
数据结构与算法第二部分 | Data Structures and Algorithms Part 2
Peking University via edX
社会调查与研究方法 (下)Methodologies in Social Research (Part 2)
Peking University via Coursera
統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco
Fundamentos do Google para o Ensino
Fundação Lemann via Coursera