Running AI Inference on Google Kubernetes Engine - Anthropic's Approach with Claude
Offered By: Google Cloud Tech via YouTube
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how Anthropic leverages Google Kubernetes Engine (GKE) to run inference for Claude, achieving cost efficiency and high performance using Google tensor processing units (TPUs) and NVIDIA graphics processing units. Learn about Anthropic's improved price-performance on TPU v5e, GKE's advanced management capabilities for simplified Day-2 maintenance, and the exceptional support provided by Google Cloud. Explore topics such as customer-triggered maintenance, Cube for Claude, Google TPU, Kubernetes orchestration, cost-efficient inference, GPU recommendations, and GKE features in this 29-minute conference talk from Google Cloud Next 2024.
Syllabus
Introduction
About Anthropic
Customer triggered maintenance
Cube for Claude
Google TPU
Kubernetes Orchestration
Cost Efficient Inference
GPU Recommendations
GKE
Taught by
Google Cloud Tech
Related Courses
4.0 Shades of Digitalisation for the Chemical and Process IndustriesUniversity of Padova via FutureLearn A Day in the Life of a Data Engineer
Amazon Web Services via AWS Skill Builder FinTech for Finance and Business Leaders
ACCA via edX Accounting Data Analytics
University of Illinois at Urbana-Champaign via Coursera Accounting Data Analytics
Coursera