CS-456: DQN Project: heat-map for factorized agent

Hi!

Just wanted to clarify something about the question 5c: for factorized agent do we have to plot the heat-map with 8 values that we get from the output layer of our network or with 16 Q-values which we then compute for actions that can actually be taken?

Re: DQN Project: heat-map for factorized agent

par Ariane Delrocq, samedi, 3 juin 2023, 09:18

Hi,

You can plot just the 8 Q-values of the output of the network, the Q-values for the full actions can be easily readout from these.