Data Curation for Open Source LLM Fine-Tuning
Offered By: Data Science Festival via YouTube
Course Description
Overview
Explore data curation techniques for open source LLM fine-tuning in this 19-minute conference talk by Clemens Schroeer from Lemon AI, presented at the Data Science Festival MayDay event 2024. Delve into the challenges of acquiring high-quality data for fine-tuning models like Mistral 7B, and learn strategies for iterating towards effective datasets. Gain insights into the complexities of understanding and leveraging company data for optimal fine-tuning results. Designed for technical practitioners, this talk offers valuable experience-based knowledge on dataset curation and considerations for successful open source LLM fine-tuning.
Syllabus
Data Curation for Open Source LLM Fine Tuning - Data Science Festival
Taught by
Data Science Festival
Related Courses
Data AnalysisJohns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Scientific Computing
University of Washington via Coursera Introduction to Data Science
University of Washington via Coursera Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera