YoVDO

RLHF Data Collection in Practice - Part 2

Offered By: MLOps.community via YouTube

Tags

Machine Learning Courses MLOps Courses LLM (Large Language Model) Courses Data Collection Courses OpenAI Courses RLHF Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore practical strategies for collecting high-quality RLHF (Reinforcement Learning from Human Feedback) data in this 12-minute conference talk by Andrew Mauboussin from Surge AI. Learn about the risks associated with low-quality data collection for RLHF and discover effective techniques implemented in Surge's full-stack RLHF data collection product. Gain insights from Mauboussin's extensive experience in ML engineering, including his work in the finance industry at Kensho and election safeguarding at Twitter. Understand how Surge AI has been powering ML teams at leading companies like Anthropic and OpenAI through their innovative approach to human feedback data collection at scale.

Syllabus

RLHF Data Collection in Practice // Andrew Mauboussin // LLMs in Prod Conference Part 2


Taught by

MLOps.community

Related Courses

Google BARD and ChatGPT AI for Increased Productivity
Udemy
Bringing LLM to the Enterprise - Training From Scratch or Just Fine-Tune With Cerebras-GPT
Prodramp via YouTube
Generative AI and Long-Term Memory for LLMs
James Briggs via YouTube
Extractive Q&A With Haystack and FastAPI in Python
James Briggs via YouTube
OpenAssistant First Models Are Here! - Open-Source ChatGPT
Yannic Kilcher via YouTube