YoVDO

Planning and Learning in Risk-Aware Restless Multi-Arm Bandit

Offered By: GERAD Research Center via YouTube

Tags

Markov Decision Processes Courses Reinforcement Learning Courses Probability Theory Courses Thompson Sampling Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intricacies of risk-aware restless multi-arm bandits in this 48-minute seminar presented by Nima Akbarzadeh from HEC Montréal at the GERAD Research Center. Delve into the generalization of traditional restless multi-arm bandit problems by incorporating risk-awareness into the objective function. Discover the established indexability conditions for risk-aware objectives and learn about a proposed Thompson sampling approach for addressing learning challenges when true transition probabilities are unknown. Gain insights into how this method achieves bounded regret that scales sublinearly with episodes and quadratically with arms. Examine numerical experiments demonstrating the effectiveness of this approach in reducing risk exposure in restless multi-arm bandits.

Syllabus

Planning and Learning in Risk-Aware Restless Multi-Arm Bandit, Nima Akbarzadeh


Taught by

GERAD Research Center

Related Courses

Computational Neuroscience
University of Washington via Coursera
Reinforcement Learning
Brown University via Udacity
Reinforcement Learning
Indian Institute of Technology Madras via Swayam
FA17: Machine Learning
Georgia Institute of Technology via edX
Introduction to Reinforcement Learning
Higher School of Economics via Coursera