Machine Learning on Non-Curated Data
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore machine learning techniques for handling non-curated data in this 43-minute EuroPython Conference talk. Delve into practical solutions for two common dirty-data problems: missing values and non-normalized entries. Learn how to implement standard machine learning tools like scikit-learn when dealing with these data errors. Discover the importance of imputation and adding missingness indicators for handling missing values, and understand how to create vectorial representations for non-normalized categories. Gain insights from theoretical analyses and recent machine learning publications to improve your data science workflow and efficiency when working with imperfect datasets.
Syllabus
Gael Varoquaux - Machine learning on non curated data
Taught by
EuroPython Conference
Related Courses
A Brief History of Data StorageEuroPython Conference via YouTube Breaking the Stereotype - Evolution & Persistence of Gender Bias in Tech
EuroPython Conference via YouTube We Can Get More from Spatial, GIS, and Public Domain Datasets
EuroPython Conference via YouTube Using NLP to Detect Knots in Protein Structures
EuroPython Conference via YouTube The Challenges of Doing Infra-As-Code Without "The Cloud"
EuroPython Conference via YouTube