Videos 11 | Moodle

1. Introduction and Mini-Batches in On- and Off-Policy Deep Reinforcement Learning (18 min)

2A. Proximal Policy Optimization for Continuous Control (25 min)

2B. Deep Deterministic Policy Gradient for Continuous Control (10 min)

3A. Background Planning and Variational State Tabulation (16 min)

3B. Monte Carlo Tree Search and Alpha Zero (25 min)

3C. MuZero (14 min)

Browse the glossary using this index

Special | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | ALL

No entries found in this section

Contact
EPFL CH-1015 Lausanne
+41 21 693 11 11

Follow the pulses of EPFL on social networks

© 2023 EPFL, all rights reserved