Overview and Importance of Data Quality
Offered By: Association for Computing Machinery (ACM) via YouTube
Course Description
Overview
Syllabus
Overview and Importance of Data Quality for Machine Learning Tasks
Acknowledgements
Data Preparation in Machine Learning
Challenges with Data Preparation
Data Quality Analysis can help..
Different personas in enterprise setting..
To put it all together
To summarize
Data Quality Metrics
Common Data Cleaning Techniques
Is data cleaning always helpful for ML pipeline?
Insights: Impact of different cleaning techniques
In conclusion
Why it happens?
Why Imbalanced Classification is Hard?
Evaluation Metrics for Imbalanced Datasets Accuracy Paradox
Factors affecting class imbalance
Affecting Factor: Imbalance Ratio
Affecting Factor: Overlap
Affecting Factor: Smaller sub-concepts
Affecting Factor: Dataset Size
Affecting Factor: Combined Effect
Modelling Strategies: Types
Resampling Techniques
Bayes Impact index
Taught by
Association for Computing Machinery (ACM)
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX