A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
When the blue ball hits the green ball does not get a positive reward, and the green ball does not disappear? is't correct?
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be resolved. The issue was opened by Kyle1993 and has received 9 comments.