YoVDO

Data Management - Full Stack Deep Learning - March 2019

Offered By: The Full Stack via YouTube

Tags

Deep Learning Courses Data Lakes Courses Data Storage Courses Data Processing Courses Data Management Courses Data Labeling Courses Makefiles Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore data management essentials for deep learning projects in this comprehensive lecture. Delve into data labeling, storage, versioning, and processing techniques. Learn about the data flywheel concept, annotator training, labor sources, and service comparisons. Discover database scalability, data lake organization, and versioning strategies. Examine task dependencies and workflow management tools like Luigi and Airflow. Gain practical insights for implementing efficient data pipelines in machine learning projects.

Syllabus

Introduction
Data Flywheel: an initial manually labeled dataset enables self- improvement with user data
Roadmap
Training the annotators is crucial
Sources of Labor
Service Companies
Software
Data Storage
Database scalable storage and retrieval of structured data
Data Lake
What goes where
Data Versioning
Level 2
Motivational Example We have to train a photo popularity predictor every night.
Task Dependencies
Makefile limitations
Luigi and Airflow


Taught by

The Full Stack

Related Courses

Coding the Matrix: Linear Algebra through Computer Science Applications
Brown University via Coursera
كيف تفكر الآلات - مقدمة في تقنيات الحوسبة
King Fahd University of Petroleum and Minerals via Rwaq (رواق)
Datascience et Analyse situationnelle : dans les coulisses du Big Data
IONIS via IONIS
Data Lakes for Big Data
EdCast
統計学Ⅰ:データ分析の基礎 (ga014)
University of Tokyo via gacco