Lecture 2: Dynamic Programming 1
MDPs; value and Q functions; value iteration, policy iteration; operator perspectives
Click lecture 2 (2022).pdf link to view the file.
MDPs; value and Q functions; value iteration, policy iteration; operator perspectives
Follow the pulses of EPFL on social networks
© 2023 EPFL, all rights reserved