A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
I am sorry if this is a stupid question. I have been getting negative losses for the actors. Is this normal? If not, how should I interpret it? Thanks!
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be resolved. The issue was opened by richielo and has received 3 comments.