GenAI and Datacomp: Creating the Largest Public Multimodal Dataset in Academia
Offered By: Data Council via YouTube
Course Description
Overview
Explore the vital role of universities and the open-source community in the Generative AI ecosystem through this 17-minute talk, focusing on large-scale dataset management. Examine the NeurIPS'23 Datacomp paper, which details the creation of academia's largest multimodal dataset to date. Discover four emerging trends reshaping AI data management: AI-powered data cleaning, data-centric AI approaches, legal and privacy challenges in data sharing, and the potential of synthetic dataset expansion. Gain insights into how academia continues to innovate in the field of Generative AI, presented by Professor Alex Dimakis from the University of Texas at Austin.
Syllabus
GenAI and Datacomp: Creating the Largest Public Multimodal Dataset in Academia
Taught by
Data Council
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX