MP2, Standard Deviation in Task 4.1

Re: MP2, Standard Deviation in Task 4.1

par Skander Moalla,
Nombre de réponses : 0
Yes yes, implementation doesn't matter. As long as the mean is state-dependent and the std state-independent and both are learned.
Typically you would put both of them in the policy class yes.