Start date cannot be after end date.
On The Convergence Rate Of Entropy-regularized Natural Policy Gradient With Linear Function
Presenter
- R. Srikant
August 3, 2021
ICERM
A Lyapunov approach for finite-sample convergence bounds with off-policy RL
Presenter
- Sanjay Shakkottai
August 3, 2021
ICERM
Towards a Theory of Representation Learning for Reinforcement Learning
Presenter
- Alekh Agarwal
August 2, 2021
ICERM
Reinforcement Learning in High Dimensional Systems (and why "reward" is not enough...)
Presenter
- Sham Kakade
August 2, 2021
ICERM