A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Thanks for the great work. Do you think the training can be sped up if we use multiple gpu / some form of parallel training in the update policy part instead of looping through the agent one by one? If so, what kind of approach would you suggest in doing so? Thanks again
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be resolved. The issue was opened by richielo and has received 3 comments.