ChatGPT - This AI Has a Jailbreak! - Unbelievable AI Progress
Offered By: Yannic Kilcher via YouTube
Course Description
Overview
Explore the groundbreaking capabilities and potential implications of ChatGPT, OpenAI's latest GPT-3 variant fine-tuned using Reinforcement Learning from Human Feedback. Delve into the model's inner workings, origins, and OpenAI's iterative refinement strategy. Witness astonishing demonstrations, including building a virtual machine within ChatGPT's imagination. Examine the controversial topic of "jailbreaks" that circumvent safety mechanisms, and gain insights into OpenAI's vision for the future of AI. Discover how this revolutionary language model is reshaping the landscape of artificial intelligence and its potential impact on various fields.
Syllabus
- Intro
- Sponsor: Weights & Biases
- ChatGPT: How does it work?
- Reinforcement Learning from Human Feedback
- ChatGPT Origins: The GPT-3.5 Series
- OpenAI's strategy: Iterative Refinement
- ChatGPT's amazing capabilities
- Internals: What we know so far
- Building a virtual machine in ChatGPT's imagination insane
- Jailbreaks: Circumventing the safety mechanisms
- How OpenAI sees the future
Taught by
Yannic Kilcher
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Artificial Intelligence for Robotics
Stanford University via Udacity Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent