Between the Spreadsheets - Classifying and Fixing Dirty Data for Data Science
Offered By: Data Science Dojo via YouTube
Course Description
Overview
Explore real-world examples of dirty data and its impact on decision-making, reporting, analytics, AI, and machine learning in this 58-minute video presentation by Susan Walsh. Learn quick and accurate methods for checking and modifying data in Excel, regardless of experience level, while understanding the importance of data accuracy and maintenance. Discover best practices for identifying anomalies and implementing effective data cleanup processes to transform and elevate your data analysis skills. The presentation covers topics such as defining dirty data, its consequences, ensuring data accuracy, maintaining and spot-checking data, exploring other tools, understanding the dirty data maturity model, and concludes with a summary and Q&A session.
Syllabus
Introduction
What is dirty data
The consequences of dirty data
Ensuring data accuracy
Maintain and spot-check your data
Other tools
The dirty data maturity
Summary
QnA
Taught by
Data Science Dojo
Related Courses
Interprofessional Healthcare InformaticsUniversity of Minnesota via Coursera Data Science at Scale - Capstone Project
University of Washington via Coursera Implementing ETL with SQL Server Integration Services
Microsoft via edX Introduzione a R
University of Modena and Reggio Emilia via EduOpen Практики работы с данными средствами Power Query и Power Pivot
Saint Petersburg State University via Coursera