Dynamic Large Scale Spark on Kubernetes: Empowering the Community with Argo Workflows and Argo Events
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Discover how to build and manage large-scale Spark clusters on Kubernetes for powerful data processing in this informative conference talk. Learn best practices for constructing scalable Spark clusters on Kubernetes, with a focus on leveraging Argo Workflows and Argo Events. Explore the challenges of configuring storage, compute, networking, and optimizing job scheduling, whether starting from scratch or migrating Spark workloads from existing Hadoop clusters. Gain insights into harnessing the potential of Argo Workflows and Argo Events for event-driven job scheduling, enabling efficient resource utilization and seamless scalability. Understand how integrating these powerful open-source tools can provide better control and flexibility for executing Spark jobs on Kubernetes, empowering you to build highly scalable and efficient data processing environments.
Syllabus
Dynamic Large Scale Spark on Kubernetes: Empowering the Community wi... Ovidiu Valeanu & Vara Bonthu
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera