Data Cleaning in Python Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Improve the overall analytic workflow of your organization by boosting your data cleaning skills in Python.
Syllabus
Introduction
- Why is clean data important?
- What you should know
- Using GitHub Codespaces with this course
- Types of errors
- Missing values
- Bad values
- Duplicates
- Human errors
- Machine errors
- Design errors
- Challenge: UI design
- Solution: UI design
- Schemas
- Validation
- Finding missing data
- Domain knowledge
- Subgroups
- Challenge: Find bad data
- Solution: Find bad data
- Serialization formats
- Digital signature
- Data pipelines and automation
- Transactions
- Data organization and tidy data
- Process and data quality metrics
- Challenge: ETL
- Solution: ETL
- Renaming fields
- Fixing types
- Joining and splitting data
- Deleting bad data
- Filling missing values
- Reshaping data
- Challenge: Workshop earnings
- Solution: Workshop earnings
- Next steps
Taught by
Miki Tebeka
Related Courses
Data AnalysisJohns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Scientific Computing
University of Washington via Coursera Introduction to Data Science
University of Washington via Coursera Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera