Data Cleaning in Python Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Improve the overall analytic workflow of your organization by boosting your data cleaning skills in Python.
Syllabus
Introduction
- Why is clean data important?
- What you should know
- Using GitHub Codespaces with this course
- Types of errors
- Missing values
- Bad values
- Duplicates
- Human errors
- Machine errors
- Design errors
- Challenge: UI design
- Solution: UI design
- Schemas
- Validation
- Finding missing data
- Domain knowledge
- Subgroups
- Challenge: Find bad data
- Solution: Find bad data
- Serialization formats
- Digital signature
- Data pipelines and automation
- Transactions
- Data organization and tidy data
- Process and data quality metrics
- Challenge: ETL
- Solution: ETL
- Renaming fields
- Fixing types
- Joining and splitting data
- Deleting bad data
- Filling missing values
- Reshaping data
- Challenge: Workshop earnings
- Solution: Workshop earnings
- Next steps
Taught by
Miki Tebeka
Related Courses
Rails with Active Record and Action PackJohns Hopkins University via Coursera Excel Skills for Business: Intermediate II
Macquarie University via Coursera Programming 103: Saving and Structuring Data
Raspberry Pi Foundation via FutureLearn Everyday Excel, Part 1
University of Colorado Boulder via Coursera Creating Dashboards in Google Spreadsheets
Coursera Project Network via Coursera