Building and Operating an Open Source Data Science Platform
Offered By: Toronto Machine Learning Series (TMLS) via YouTube
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intricacies of building and operating an open-source data science platform in this comprehensive workshop led by Jörg Schad, Head of Machine Learning at ArangoDB. Delve into the entire deep learning pipeline, from exploratory analysis to model deployment and monitoring. Learn how to enable data scientists to develop models exploratively, automate distributed training and serving using CI/CD, deploy frameworks on various infrastructures, manage multiple deep learning frameworks on a single cluster, store and serve models at scale, track essential metadata, and monitor pipeline performance. Gain hands-on experience constructing an end-to-end data analytics pipeline, incorporating tools such as TFX, Kubeflow, Airflow, Apache Spark, Jupyter Notebooks, TensorFlow, Jenkins, Argo, and more. Acquire valuable insights into pipeline orchestration, data preparation, distributed training, automation, model storage, serving, and monitoring throughout this intensive 2-hour and 57-minute session.
Syllabus
Jörg Schad - Workshop: Building and Operating an Open Source Data Science Platform
Taught by
Toronto Machine Learning Series (TMLS)
Related Courses
Big Data EssentialsA Cloud Guru Big Data
University of Adelaide via edX Advanced Data Science with IBM
IBM via Coursera Amazon EMR Getting Started (Indonesian)
Amazon Web Services via AWS Skill Builder Analisar e preparar dados com o Amazon SageMaker Data Wrangler e o Amazon EMR (Português (Brasil)) | Lab - Analyze and Prepare Data with Amazon SageMaker Data Wrangler and Amazon EMR (Portuguese (Brazil))
Amazon Web Services via AWS Skill Builder