YoVDO

Uber's Batch Analytics Evolution from Hive to Spark

Offered By: Databricks via YouTube

Tags

Apache Spark Courses Data Migration Courses ETL Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore Uber's strategic migration from Hive to SparkSQL in this 28-minute conference talk. Discover how Uber tackled the challenge of optimizing their batch analytics processes, which previously accounted for 40% of their multimillion-dollar ETL expenses. Learn about the development of automation features, including query transpilation, parallel execution, and a validation framework for data correctness and performance. Delve into the architecture of Uber's auto-migration framework, understand the challenges faced during the migration process, and gain insights into the solutions implemented. Senior Software Engineers Akshayaprakash Sharma and Kumudini Kakwani from Uber share their experiences and reveal the overall efficiency gains achieved through this large-scale migration effort.

Syllabus

Uber's Batch Analytics Evolution from Hive to Spark


Taught by

Databricks

Related Courses

iOS Persistence and Core Data
Udacity
Data Migration to SAP S/4HANA
SAP Learning
Deep Dive into Amazon Glacier
Amazon via Independent
Upgrade2Success – Making SAP ERP HCM Migration Easier
SAP Learning
Migrating Your Business Data to SAP S/4HANA – New Implementation Scenario
SAP Learning