YoVDO

Efficient Scheduling of High Performance Batch Computing for Analytics Workloads with Volcano

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses Apache Spark Courses Jupyter Notebooks Courses Data Analytics Courses Volcano Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how ING Wholesale Banking Advanced Analytics team implemented efficient scheduling for high-performance batch computing in analytics workloads using Volcano. Discover the journey of creating a centralized platform for internal data sources and large-scale computing, enabling over 300 internal projects and 2000 users to access advanced analytics capabilities. Learn about the implementation of a specialized cloud-native Kubernetes scheduler, Volcano, to optimize resource usage and maintain stability of core services. Gain insights into the custom extension developed for Apache Spark binaries, allowing dynamic allocation and hierarchical dominant resource fairness (HDRF) in multi-tenant environments. Understand how this solution enables users to leverage Volcano with Spark interactive mode in Jupyter notebooks and visualize scheduling metrics similar to the YARN UI.

Syllabus

Efficient Scheduling Of High Performance Batch Computing For... Krzysztof Adamski & Tinco Boekestijn


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Understanding China, 1700-2000: A Data Analytic Approach, Part 1
The Hong Kong University of Science and Technology via Coursera
The Analytics Edge
Massachusetts Institute of Technology via edX
大数据与信息传播 Big Data and Information Dissemination
Fudan University via Coursera
The Future of Fashion
Marist College via Independent
The Mobile Consumer
Marist College via Independent