YoVDO

Running AI Inference on Google Kubernetes Engine - Anthropic's Approach with Claude

Offered By: Google Cloud Tech via YouTube

Tags

Artificial Intelligence Courses Machine Learning Courses Cloud Computing Courses Kubernetes Courses Inference Courses Claude Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how Anthropic leverages Google Kubernetes Engine (GKE) to run inference for Claude, achieving cost efficiency and high performance using Google tensor processing units (TPUs) and NVIDIA graphics processing units. Learn about Anthropic's improved price-performance on TPU v5e, GKE's advanced management capabilities for simplified Day-2 maintenance, and the exceptional support provided by Google Cloud. Explore topics such as customer-triggered maintenance, Cube for Claude, Google TPU, Kubernetes orchestration, cost-efficient inference, GPU recommendations, and GKE features in this 29-minute conference talk from Google Cloud Next 2024.

Syllabus

Introduction
About Anthropic
Customer triggered maintenance
Cube for Claude
Google TPU
Kubernetes Orchestration
Cost Efficient Inference
GPU Recommendations
GKE


Taught by

Google Cloud Tech

Related Courses

Career Hacking: The Ultimate Job Search Course (Now w/ AI!)
Udemy
Insane AI News Happening That No One is Noticing - Weekly Roundup
Matt Wolfe via YouTube
Live Coding an LLM Battle - GPT-4 vs. Claude - 20 Questions Game
Rob Mulla via YouTube
Complete Tutorial of Top Generative AI Tools - ChatGPT, GitHub Copilot, Claude, and Google Gemini
Great Learning via YouTube
Preparing Data with Generative AI
Pluralsight