Spinning Language Models: Risks of Propaganda-as-a-Service and Countermeasures
Offered By: IEEE via YouTube
Course Description
Overview
Explore the potential risks and countermeasures associated with using language models for propaganda in this 17-minute IEEE conference talk. Delve into the concept of Propaganda-as-a-Service, examining the differences between classification and sequence-to-sequence models. Gain insights into the intuition behind spinning language models and learn about the creation of input for meta-task models. Analyze the changes in output distribution and discover defensive strategies to mitigate these risks. Presented by Eugene Bagdasaryan and Vitaly Shmatikov from Cornell Tech, this talk offers a comprehensive overview of the challenges posed by manipulated language models in the context of propaganda dissemination.
Syllabus
Intro
What is Propaganda?
Classification vs Sequence-to-sequence
Spin Intuition in Language Models
Create Input for Meta-task Model
Change in output distribution
Defense
Taught by
IEEE Symposium on Security and Privacy
Tags
Related Courses
Computer SecurityStanford University via Coursera Cryptography II
Stanford University via Coursera Malicious Software and its Underground Economy: Two Sides to Every Story
University of London International Programmes via Coursera Building an Information Risk Management Toolkit
University of Washington via Coursera Introduction to Cybersecurity
National Cybersecurity Institute at Excelsior College via Canvas Network