Spinning Language Models: Risks of Propaganda-as-a-Service and Countermeasures
Offered By: IEEE via YouTube
Course Description
Overview
Explore the potential risks and countermeasures associated with using language models for propaganda in this 17-minute IEEE conference talk. Delve into the concept of Propaganda-as-a-Service, examining the differences between classification and sequence-to-sequence models. Gain insights into the intuition behind spinning language models and learn about the creation of input for meta-task models. Analyze the changes in output distribution and discover defensive strategies to mitigate these risks. Presented by Eugene Bagdasaryan and Vitaly Shmatikov from Cornell Tech, this talk offers a comprehensive overview of the challenges posed by manipulated language models in the context of propaganda dissemination.
Syllabus
Intro
What is Propaganda?
Classification vs Sequence-to-sequence
Spin Intuition in Language Models
Create Input for Meta-task Model
Change in output distribution
Defense
Taught by
IEEE Symposium on Security and Privacy
Tags
Related Courses
Security Principles(ISC)² via Coursera A Strategic Approach to Cybersecurity
University of Maryland, College Park via Coursera FinTech for Finance and Business Leaders
ACCA via edX Access Control Concepts
(ISC)² via Coursera Access Controls
(ISC)² via Coursera