Distributed Multi-worker TensorFlow Training on Kubernetes
Offered By: Google via Google Cloud Skills Boost
Course Description
Overview
In this hands-on lab you will explore using Google Cloud Kubernetes Engine and Kubeflow TFJob to scale out TensorFlow distributed training.
Syllabus
- GSP775
- Overview
- Setup and requirements
- Lab tasks
- Task 1. Creating a GKE cluster
- Task 2. Deploying
- Task 3. Creating a Cloud Storage bucket
- Task 4. Preparing TFJob
- Task 5. Submitting the TFJob
- Task 6. Monitoring the TFJob
- Congratulations
Tags
Related Courses
Building End-to-end Machine Learning Workflows with KubeflowPluralsight Smart Analytics, Machine Learning, and AI on GCP
Pluralsight Leveraging Cloud-Based Machine Learning on Google Cloud Platform: Real World Applications
LinkedIn Learning Distributed TensorFlow - TensorFlow at O'Reilly AI Conference, San Francisco '18
TensorFlow via YouTube KFServing - Model Monitoring with Apache Spark and Feature Store
Databricks via YouTube