YoVDO

Transform Your Machine Learning Pipelines with Apache Hudi

Offered By: Linux Foundation via YouTube

Tags

Machine Learning Courses Data Lakes Courses Real-Time Data Processing Courses Data Pipelines Courses Apache Hudi Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how to revolutionize machine learning pipelines integrated with data lakes in this 25-minute conference talk by Nadine Farah from Onehouse. Learn about the challenges of maintaining fresh, accurate, and near real-time data for ML models in traditional data lakes. Explore how Apache Hudi addresses these issues with features like upserts, incremental processing, and near real-time access. Gain insights into building efficient ML pipelines using Hudi's capabilities, including time-travel querying and incremental data pulls. Understand how to overcome data latency, implement incremental updates, and ensure timely data availability for ML models. By the end of this talk, acquire knowledge on transforming your ML pipelines to harness the full potential of data lakes using Apache Hudi.

Syllabus

Unveil the Magic Without Hoodini: Transform Your Machine Learning Pipelines with Apa... Nadine Farah


Taught by

Linux Foundation

Tags

Related Courses

Hands-On with Dataflow
A Cloud Guru
Azure Data Engineer con Databricks y Azure Data Factory
Coursera Project Network via Coursera
Data Integration with Microsoft Azure Data Factory
Microsoft via Coursera
Azure Data Factory : Implement SCD Type 1
Coursera Project Network via Coursera
MLOps1 (Azure): Deploying AI & ML Models in Production using Microsoft Azure Machine Learning
statistics.com via edX