How to Use Self-Play for Language Models to Improve at Solving Programming Puzzles
Offered By: Simons Institute via YouTube
Course Description
Overview
Explore a groundbreaking lecture on enhancing language models' programming capabilities through self-play techniques. Discover how large language models can generate their own programming puzzles and solutions, leading to significant improvements in code generation performance. Learn about the innovative approach of using a Python interpreter to verify the correctness of synthesized problems and solutions. Examine the experimental results showing how publicly-available language models more than doubled their test accuracy after fine-tuning on self-generated problems. Gain insights into the potential of code language models to create instructive content and enhance their own performance when paired with an interpreter.
Syllabus
How to Use Self-Play for Language Models to Improve at Solving Programming Puzzles
Taught by
Simons Institute
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent