Reading materials:
[9] Schulman et al, Trust Region Policy Optimization, ICML, 2015.
[10] Schulman et al, Proximal Policy Optimization Algorithms, arXiv, 2017.
Follow the pulses of EPFL on social networks
© 2023 EPFL, all rights reserved