YoVDO

GenAI and Datacomp: Creating the Largest Public Multimodal Dataset in Academia

Offered By: Data Council via YouTube

Tags

Generative AI Courses Machine Learning Courses Data Cleaning Courses Open Source Courses Synthetic Data Courses Data-Centric AI Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the vital role of universities and the open-source community in the Generative AI ecosystem through this 17-minute talk, focusing on large-scale dataset management. Examine the NeurIPS'23 Datacomp paper, which details the creation of academia's largest multimodal dataset to date. Discover four emerging trends reshaping AI data management: AI-powered data cleaning, data-centric AI approaches, legal and privacy challenges in data sharing, and the potential of synthetic dataset expansion. Gain insights into how academia continues to innovate in the field of Generative AI, presented by Professor Alex Dimakis from the University of Texas at Austin.

Syllabus

GenAI and Datacomp: Creating the Largest Public Multimodal Dataset in Academia


Taught by

Data Council

Related Courses

Data Wrangling with MongoDB
MongoDB via Udacity
Getting and Cleaning Data
Johns Hopkins University via Coursera
软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera
Creating an Analytical Dataset
Udacity
Implementing ETL with SQL Server Integration Services
Microsoft via edX