YoVDO

Harnessing Black-Box Control to Boost Commonsense in Language Models' Generation

Offered By: USC Information Sciences Institute via YouTube

Tags

GPT-3 Courses GPT-2 Courses Flan-T5 Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a resource-efficient framework for enhancing commonsense in large language models during a 55-minute talk presented by Yufei Tian from UCLA at the USC Information Sciences Institute. Discover the BOOST method, which steers frozen Pre-Trained Language Models towards more reasonable outputs without expensive fine-tuning. Learn about the creation of an interpretable, reference-free evaluator that assigns commonsensical scores to sentences based on a dynamic knowledge base. Examine how this evaluator guides the NADO controllable generation method to train an auxiliary head, improving output quality. Review test results on various language models, including GPT-2, Flan-T5, and Alpaca-based models, and compare BOOST-generated content with ChatGPT outputs through human evaluation. Gain insights into creative and controllable text generation, machine reasoning, and evaluation metrics for open-ended NLG tasks from Yufei Tian, a CS PhD student at UCLA supported by the UCLA-Amazon fellowship program.

Syllabus

Harnessing Black-Box Control to Boost Commonsense in LM’s Generation


Taught by

USC Information Sciences Institute

Related Courses

Generating New Recipes using GPT-2
Coursera Project Network via Coursera
Deep Learning NLP: Training GPT-2 from scratch
Coursera Project Network via Coursera
Artificial Creativity
Parsons School of Design via Coursera
Coding Train Late Night - GPT-2, Hue Lights, Discord Bot
Coding Train via YouTube
Coding Train Late Night - Fetch, GPT-2 and RunwayML
Coding Train via YouTube