Videos 11
1. Introduction and Mini-Batches in On- and Off-Policy Deep Reinforcement Learning (18 min)
2A. Proximal Policy Optimization for Continuous Control (25 min)
2B. Deep Deterministic Policy Gradient for Continuous Control (10 min)
3A. Background Planning and Variational State Tabulation (16 min)
3B. Monte Carlo Tree Search and Alpha Zero (25 min)
3C. MuZero (14 min)
- Contact
- EPFL CH-1015 Lausanne
- +41 21 693 11 11
Follow the pulses of EPFL on social networks
© 2023 EPFL, all rights reserved