RLHF Data Collection in Practice - Part 2
Offered By: MLOps.community via YouTube
Course Description
Overview
Explore practical strategies for collecting high-quality RLHF (Reinforcement Learning from Human Feedback) data in this 12-minute conference talk by Andrew Mauboussin from Surge AI. Learn about the risks associated with low-quality data collection for RLHF and discover effective techniques implemented in Surge's full-stack RLHF data collection product. Gain insights from Mauboussin's extensive experience in ML engineering, including his work in the finance industry at Kensho and election safeguarding at Twitter. Understand how Surge AI has been powering ML teams at leading companies like Anthropic and OpenAI through their innovative approach to human feedback data collection at scale.
Syllabus
RLHF Data Collection in Practice // Andrew Mauboussin // LLMs in Prod Conference Part 2
Taught by
MLOps.community
Related Courses
Observing and Analysing Performance in SportOpenLearning Statistics: Making Sense of Data
University of Toronto via Coursera Financial Planning
TAFE NSW via Open2Study Mobiles for Development
Indian Institute of Technology Kanpur via Independent Valoración de futbolistas
Universitat Politècnica de València via UPV [X]