YoVDO

State of the Industry - Short-Circuiting Reward Functions

Offered By: David Shapiro ~ AI via YouTube

Tags

Artificial Intelligence Courses Machine Learning Courses AI Ethics Courses

Course Description

Overview

Explore the complex landscape of AI safety and alignment in this 26-minute video lecture. Delve into the risks associated with learned optimization in advanced machine learning systems, examining the Inner Alignment Problem and the potential dangers of self-optimizing machines. Analyze the limitations of AI as presented in "The AI Alignment Problem" and consider the implications of world uncertainty on AI development. Investigate the importance of learning in AI systems, drawing parallels with the dangers of hyperpalatable foods in human behavior. Discuss the challenges of disentangling proximal and distal goals, and contemplate the potential objectives of a superintelligent AI. Evaluate the risks and implications of superintelligence, including its potential to intervene in goal information. Critically assess the flaws in current reasoning surrounding superintelligence containment, providing a comprehensive overview of key concerns in the field of AI safety.

Syllabus

- The risks of learned optimization in advanced machine learning systems
- The Inner Alignment Problem
- The dangers of self-optimizing machines
- The limitations of AI according to "The AI Alignment Problem"
- The Uncertainty of the World
- The importance of learning
- The dangers of hyperpalatable foods
- The cost of disentangling proximal from distal goals
- The goals of a superintelligence
- The possibility of an agent with a rich enough action space
- The dangers of a superintelligence
- The risks of a superintelligence intervening in goal information
- Flaws in the reasoning of superintelligence containment


Taught by

David Shapiro ~ AI

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent