YoVDO

SQL for Efficient Data Organization in Machine Learning

Offered By: Snorkel AI via YouTube

Tags

SQL Courses Machine Learning Courses Data Organization Courses Random Forests Courses Scalability Courses Gradient Boosting Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how SQL can enhance data organization for machine learning in this 11-minute video presentation by Columbia PhD student Zachary Huang. Learn about JoinBoost, a lightweight Python library that transforms tree training algorithms over normalized databases into pure SQL queries. Discover how this innovative approach addresses the mismatch between ML data organization requirements and traditional database structures, offering a simplified, all-in-one data stack solution. Gain insights into JoinBoost's compatibility with various DBMS and data stacks, its exceptional performance and scalability, and how it outperforms specialized ML libraries like LightGBM in terms of speed and scalability for random forests and gradient boosting algorithms.

Syllabus

Introduction
Background
Example
Problem Statement


Taught by

Snorkel AI

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent