YoVDO

Reinforcement Learning

Offered By: Edureka

Tags

Reinforcement Learning Courses Machine Learning Courses Markov Decision Processes Courses Dynamic programming Courses Bandit Algorithms Courses Policy Gradient Methods Courses Bellman Equations Courses Deep Q-Learning Courses

Course Description

Overview

Edureka offers the best Reinforcement Learning course online. Learn basics of Reinforcement Learning Bandit Algorithms (UCB, PAC, Median Elimination, Policy Gradient), Dynamic Programming, Value Function, Bellman Equation, Value Iteration, and Policy Gradient Methods from ML & AI industry experts.

  • Introduction to Reinforcement Learning
  • Bandit Algorithms and Markov Decision Process
  • Dynamic Programming & Temporal Difference Methods
  • Deep Q Learning
  • In-class Project

Related Courses

Reinforcement Learning
Indian Institute of Technology Madras via Swayam
Bandit Algorithm (Online Machine Learning)
Indian Institute of Technology Bombay via Swayam
Bandits - Kevin Jamieson - University of Washington
Paul G. Allen School via YouTube
Tracking Significant Changes in Bandit - IFDS 2022
Paul G. Allen School via YouTube
Bandits - Lecture 5
Paul G. Allen School via YouTube