Deploying Python at Scale with Dask
Offered By: PyCon US via YouTube
Course Description
Overview
Explore the challenges and solutions for scaling Python with Dask on distributed hardware in this PyCon US talk. Dive into deployment strategies for Dask on cluster resource managers like Kubernetes, Yarn, and cloud platforms. Learn how the Dask library extends popular Python data science tools to handle 100+TB datasets across multi-core workstations and distributed clusters. Discover approaches to balance load, share resources, control access, and ensure security when deploying Dask within organizations. Examine real-world examples showcasing Dask's positive social impact in large-scale data processing. Gain insights into uniform software environments, resource sharing, credentials management, and cost optimization for IT professionals. Understand the landscape of friendly resource managers, managed solutions, and opinionated approaches for efficient Python deployment at scale.
Syllabus
Introduction
Why this talk
Data Science Libraries
Task
Desk
Environment Management
Uniform software environments
Data science vs IT
Resource sharing
Access
IT Professional
Credentials
Security
Costs
Avoid Track Optimize
Cost
Conclusion
Friendly Resource Managers
Managed Solutions
opinionated solutions
Coyle Computing
Managed Services
Summary
Taught by
PyCon US
Related Courses
Cybersecurity and Its Ten DomainsUniversity System of Georgia via Coursera Bases de données relationnelles : Comprendre pour maîtriser
Inria (French Institute for Research in Computer Science and Automation) via France Université Numerique Desarrollo de Aplicaciones Web: Seguridad
University of New Mexico via Coursera Web Application Development: Security
University of New Mexico via Coursera Computing, Storage and Security with Google Cloud Platform
Google via Coursera