Videos 11
1. Introduction and Mini-Batches in On- and Off-Policy Deep Reinforcement Learning (18 min)
2A. Proximal Policy Optimization for Continuous Control (25 min)
2B. Deep Deterministic Policy Gradient for Continuous Control (10 min)
3A. Background Planning and Variational State Tabulation (16 min)
3B. Monte Carlo Tree Search and Alpha Zero (25 min)
3C. MuZero (14 min)
- Contact
- EPFL CH-1015 Lausanne
- +41 21 693 11 11
Suivre les pulsations de l'EPFL sur les réseaux sociaux
© 2023 EPFL, tous droits réservés