Getting ML Right in a Complex Data World
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the intricacies of machine learning workflows in a complex data environment through this informative conference talk. Delve into the iterative and repetitive nature of ML experimentation, focusing on data labeling, cleaning, preprocessing, and feature selection methods. Learn why quality ML at scale requires reproducibility of specific experiment iterations and the crucial role of data versioning. Discover how open-source tools enable efficient versioning of ML experiments without duplicating code, data, and models, potentially reducing storage costs. Through a live code demonstration, gain practical insights on creating a basic ML experimentation framework, reproducing ML components from specific iterations, and building intuitive, zero-maintenance experiment infrastructure using open-source tooling.
Syllabus
Getting ML Right in a Complex Data World - Vinodhini Duraismy, Treeverse
Taught by
Linux Foundation
Tags
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX