YoVDO

MATS Stack for Cross-System Orchestration of Machine Learning Pipelines

Offered By: Databricks via YouTube

Tags

Machine Learning Pipelines Courses TensorFlow Courses Apache Spark Courses Apache Airflow Courses Model Deployment Courses MLFlow Courses Phishing Detection Courses

Course Description

Overview

Explore the MATS stack for cross-system orchestration of machine learning pipelines in this 22-minute conference talk from Databricks. Learn how Avast integrates model tracking, storage, orchestration, and deployments to handle over 17 million daily phishing detections. Discover how MLFlow, Airflow, Tensorflow, and Spark combine to create a standardized, well-integrated toolset for data scientists. Follow the journey of Angler, an internal project for detecting phishing URLs, through all pipeline stages including data transformations, model training, experiment tracking, and serving. Gain insights into fast, reproducible experiments and seamless progression from research to production. Understand the challenges and successes of implementing this modern ML pipeline approach, which can be integrated into existing ecosystems without disruption.

Syllabus

Introduction
Project Life Cycle
MATS Stack
Airflow
Tensorflow
Challenges
Successes


Taught by

Databricks

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera