YoVDO

ML Data Version Control and Reproducibility at Scale

Offered By: Linux Foundation via YouTube

Tags

Machine Learning Courses Cloud Computing Courses TensorFlow Courses Keras Courses PyTorch Courses LangChain Courses Data Management Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore data version control and reproducibility techniques for large-scale machine learning in this 38-minute talk by Einat Orr from Treeverse. Learn how to overcome challenges in ML data management, including reproducibility constraints and inefficient data transfer. Discover open-source tools for versioning data locally and best practices for working with data in the cloud without copying it. Gain insights into training models at scale using an OSS stack including Langchain, TensorFlow, PyTorch, and Keras. Acquire practical methods to enhance data management for developing and iterating on ML models, specifically tailored for modern computer vision research.

Syllabus

ML Data Version Control and Reproducibility at Scale - Einat Orr, Treeverse


Taught by

Linux Foundation

Tags

Related Courses

Software as a Service
University of California, Berkeley via Coursera
Software Defined Networking
Georgia Institute of Technology via Coursera
Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera
Web-Technologien
openHPI
Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique