Building Reproducible Distributed Applications at Scale
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore the challenges and solutions for packaging Python code in distributed computing environments through this conference talk. Dive into various methods for deploying Python code to compute clusters, examining the role of Python's pickling feature and self-contained executables. Learn about the complexities of shipping code to large-scale clusters with thousands of nodes running jobs like TensorFlow or Spark. Discover how to execute a PySpark job on S3 storage using PEX as a self-contained executable artifact. Gain insights into generalizing these concepts for different job types, virtual environments, and distributed storage systems. Walk away with an understanding of Python packaging challenges for distributed applications and practical code samples applicable to your own projects.
Syllabus
Fabian Höring - Building reproducible distributed applications at scale
Taught by
EuroPython Conference
Related Courses
Cloud Computing Concepts, Part 1University of Illinois at Urbana-Champaign via Coursera Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera Reliable Distributed Algorithms - Part 1
KTH Royal Institute of Technology via edX Introduction to Apache Spark and AWS
University of London International Programmes via Coursera Réalisez des calculs distribués sur des données massives
CentraleSupélec via OpenClassrooms