PandasUDFs - Scaling Ensembles for Improved Predictions
Offered By: Databricks via YouTube
Course Description
Overview
Discover how to leverage PandasUDFs as a powerful technique for scaling ensemble models in this 38-minute Databricks talk. Learn to transform development code into scalable solutions for category-specific predictions, dramatically reducing runtime from hours to minutes. Explore the general usage, types of PandasUDFs, strategies for overcoming data limits, and equivalent approaches in R and Koalas. Gain insights into applying this method to scale from single models to entire ensembles, enhancing prediction accuracy across diverse categories.
Syllabus
Introduction
The Problem
PandasUDFs
Use Cases
Data Limits
Other Frameworks
Taught by
Databricks
Related Courses
Data Processing with AzureLearnQuest via Coursera Mejores prácticas para el procesamiento de datos en Big Data
Coursera Project Network via Coursera Data Science with Databricks for Data Analysts
Databricks via Coursera Azure Data Engineer con Databricks y Azure Data Factory
Coursera Project Network via Coursera Curso Completo de Spark con Databricks (Big Data)
Coursera Project Network via Coursera