Data Cleaning in Python Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Improve the overall analytic workflow of your organization by boosting your data cleaning skills in Python.
Syllabus
Introduction
- Why is clean data important?
- What you should know
- Using GitHub Codespaces with this course
- Types of errors
- Missing values
- Bad values
- Duplicates
- Human errors
- Machine errors
- Design errors
- Challenge: UI design
- Solution: UI design
- Schemas
- Validation
- Finding missing data
- Domain knowledge
- Subgroups
- Challenge: Find bad data
- Solution: Find bad data
- Serialization formats
- Digital signature
- Data pipelines and automation
- Transactions
- Data organization and tidy data
- Process and data quality metrics
- Challenge: ETL
- Solution: ETL
- Renaming fields
- Fixing types
- Joining and splitting data
- Deleting bad data
- Filling missing values
- Reshaping data
- Challenge: Workshop earnings
- Solution: Workshop earnings
- Next steps
Taught by
Miki Tebeka
Related Courses
Getting and Cleaning DataJohns Hopkins University via Coursera 数据结构与算法第二部分 | Data Structures and Algorithms Part 2
Peking University via edX 社会调查与研究方法 (下)Methodologies in Social Research (Part 2)
Peking University via Coursera 統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco Fundamentos do Google para o Ensino
Fundação Lemann via Coursera