A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Hi, May I ask how do you define more than one policy and reward function concurrently in a multi-agent setting? Thank you.
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be resolved. The issue was opened by zyzhang1130 and has received 1 comments.