YoVDO

Actor Critic

Offered By: Pascal Poupart via YouTube

Tags

Reinforcement Learning Courses Deep Reinforcement Learning Courses

Course Description

Overview

Explore the foundations of actor-critic methods in reinforcement learning through this comprehensive 35-minute lecture. Delve into key concepts such as the Stochastic Gradient Policy Theorem, REINFORCE Algorithm with baseline, and Temporal Difference updates. Examine performance comparisons and gain insights into advanced techniques like Advantage Actor Critic (A2C), Continuous Actions, and Deterministic Policy Gradient (DPG). Enhance your understanding of reinforcement learning algorithms and their applications in this informative session led by Pascal Poupart.

Syllabus

Intro
Outline
Stochastic Gradient Policy Theorem
REINFORCE Algorithm with a baseline
Performance Comparison
Temporal difference update
Actor Critic Algorithm
Advantage update
Advantage Actor Critic (A2C)
Continuous Actions
Deterministic Policy Gradient (DPG)


Taught by

Pascal Poupart

Related Courses

6.S094: Deep Learning for Self-Driving Cars
Massachusetts Institute of Technology via Independent
Natural Language Processing (NLP)
Microsoft via edX
Deep Reinforcement Learning
Nvidia Deep Learning Institute via Udacity
Advanced AI: Deep Reinforcement Learning in Python
Udemy
Self-driving go-kart with Unity-ML
Udemy