Harnessing Black-Box Control to Boost Commonsense in Language Models' Generation
Offered By: USC Information Sciences Institute via YouTube
Course Description
Overview
Explore a resource-efficient framework for enhancing commonsense in large language models during a 55-minute talk presented by Yufei Tian from UCLA at the USC Information Sciences Institute. Discover the BOOST method, which steers frozen Pre-Trained Language Models towards more reasonable outputs without expensive fine-tuning. Learn about the creation of an interpretable, reference-free evaluator that assigns commonsensical scores to sentences based on a dynamic knowledge base. Examine how this evaluator guides the NADO controllable generation method to train an auxiliary head, improving output quality. Review test results on various language models, including GPT-2, Flan-T5, and Alpaca-based models, and compare BOOST-generated content with ChatGPT outputs through human evaluation. Gain insights into creative and controllable text generation, machine reasoning, and evaluation metrics for open-ended NLG tasks from Yufei Tian, a CS PhD student at UCLA supported by the UCLA-Amazon fellowship program.
Syllabus
Harnessing Black-Box Control to Boost Commonsense in LM’s Generation
Taught by
USC Information Sciences Institute
Related Courses
How to Build Codex SolutionsMicrosoft via YouTube Unlocking the Power of OpenAI for Startups - Microsoft for Startups
Microsoft via YouTube Building Intelligent Applications with World-Class AI
Microsoft via YouTube Stanford Seminar - Transformers in Language: The Development of GPT Models Including GPT-3
Stanford University via YouTube ChatGPT: GPT-3, GPT-4 Turbo: Unleash the Power of LLM's
Udemy