Machine Learning on Non-Curated Data
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore machine learning techniques for handling non-curated data in this 43-minute EuroPython Conference talk. Delve into practical solutions for two common dirty-data problems: missing values and non-normalized entries. Learn how to implement standard machine learning tools like scikit-learn when dealing with these data errors. Discover the importance of imputation and adding missingness indicators for handling missing values, and understand how to create vectorial representations for non-normalized categories. Gain insights from theoretical analyses and recent machine learning publications to improve your data science workflow and efficiency when working with imperfect datasets.
Syllabus
Gael Varoquaux - Machine learning on non curated data
Taught by
EuroPython Conference
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent