YoVDO

How ML-Powered Data Cleaning Streamlines the AI Pipeline

Offered By: Snorkel AI via YouTube

Tags

Machine Learning Courses Data Science Courses Data Cleaning Courses Differential Privacy Courses Snorkel AI Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how machine learning-powered data cleaning streamlines the AI pipeline in this 30-minute video from Snorkel AI. Learn about the challenges data scientists face in preparing, cleaning, and transforming raw data before model training. Discover automated approaches to data cleaning, including record linkage pipelines, probabilistic cleaning models, and imputation techniques. Gain insights into differential privacy synthesis with structure and methodologies for automating data cleaning infrastructure. Understand how ML-driven data cleaning can significantly reduce the labor-intensive exercises that impede end-to-end AI pipelines, ultimately accelerating the data science workflow.

Syllabus

Intro
Data Prep is the Impediment for Al
Record Linkage Pipeline
Dirty Data Beyond Integration
Automating Cleaning with ML
A Probabilistic Cleaning Model
Use Case: Imputation
ML for Cleaning Automation
Differential Privacy Synthesis with Structure
Methodology Overview
Automating Data Cleaning Infrastructure


Taught by

Snorkel AI

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent