Automating Cloud-native Spark Jobs with Argo Workflows
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore how to orchestrate Apache Spark jobs using Argo Workflows in a cloud-native environment through this informative conference talk. Discover the challenges of managing dependencies in large computational workloads and learn how Kubernetes and Argo Workflows provide solutions for distributed environments. Gain insights into the architecture, resource management, and workflow definitions necessary for running Spark jobs on Kubernetes. Witness demonstrations of provisioning Spark and Argo Workflows, and understand the scaling and stability advantages they offer over traditional local or cloud environments. Evaluate the pros and cons of this approach to help determine if it's suitable for your data processing needs.
Syllabus
Automating Cloud-native Spark Jobs with Argo Workflows - Caelan Urquhart & Darko Janjić, Pipekit
Taught by
Linux Foundation
Tags
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera