Yes yes, implementation doesn't matter. As long as the mean is state-dependent and the std state-independent and both are learned.
Typically you would put both of them in the policy class yes.
Typically you would put both of them in the policy class yes.