YoVDO

Ensuring Business Continuity with Embedded SRE Tools and Incident Management

Offered By: Platform Engineering via YouTube

Tags

Business Continuity Courses Auditing Courses Incident Management Courses Load Testing Courses Platform Engineering Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 15-minute conference talk from PlatformCon 2023 focused on ensuring business continuity through embedded Site Reliability Engineering (SRE) tools and incident management. Discover how MercadoLibre integrates SRE practices into their platform, including uptime modeling, architectural reviews, reliability checks, and critical application processes. Learn about the implementation of load testing tools, centralized alert management, runbooks, and comprehensive event auditing for effective troubleshooting. Gain insights into how these embedded elements contribute to a robust reliability engineering approach, reducing cognitive load and simplifying delivery. Understand the importance of high availability in platform engineering and its impact on both developers and end-users. Join Oscar Mullin, Sr. Director of Platform Core Services and SRE at MercadoLibre, as he shares valuable insights on promoting a culture of reliability, reducing downtime, and enhancing developer and end-user satisfaction through integrated SRE practices and tools.

Syllabus

Ensuring business continuity with embedded SRE tools and incident management | PlatformCon 2023


Taught by

Platform Engineering

Related Courses

Emergency Management: Risk, Incidents and Leadership
Coventry University via FutureLearn
Security Operations
Coventry University via FutureLearn
Planificación y Coordinación en Logística Humanitaria
Acción contra el Hambre via Miríadax
Preparing for Google Cloud Certification: Cloud DevOps Engineer
Google Cloud via Coursera
Managing Cybersecurity
University System of Georgia via Coursera