Scaling Monitoring at Databricks - From Prometheus to M3
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore the journey of scaling monitoring infrastructure at Databricks, transitioning from Prometheus to M3. Learn about the initial architecture, M3 setup process, migration challenges, and valuable lessons learned. Gain insights into memory usage optimization, internal dashboard development, and strategies for handling upgrades, updates, and metric spikes. Discover capacity planning techniques and future plans for monitoring at Databricks. This comprehensive talk provides a deep dive into the complexities of large-scale monitoring systems and offers practical solutions for similar scaling challenges.
Syllabus
Intro
Monitoring before M3
Initial Architecture
M3 Setup
Migration
Lessons Learned
Memory Usage
Internal Dashboard
Upgrades and Updates
Metric Spikes
Capacity
Future plans
Conclusion
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Software as a ServiceUniversity of California, Berkeley via Coursera Software Defined Networking
Georgia Institute of Technology via Coursera Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera Web-Technologien
openHPI Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique