On-Policy
Visit Toolon-policy is an AI Agents & Automation tool that implements Multi-Agent PPO (MAPPO) for cooperative multi-agent games. It supports environments like StarCraftII, Hanabi, and Google Research Football.
At a glance
Trending
on-policy is an AI Agents & Automation tool that implements Multi-Agent PPO (MAPPO) for cooperative multi-agent games. It supports environments like StarCraftII, Hanabi, and Google Research Football.
Trending
About
on-policy is the official implementation of Multi-Agent PPO (MAPPO), a multi-agent variant of Proximal Policy Optimization. This open-source tool is heavily based on an existing PyTorch A2C-PPO-ACKTR-GAIL implementation and is used in the paper "The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games." It supports various environments, including StarCraftII (SMAC and SMAC v2), Hanabi, Multiagent Particle-World Environments (MPEs), and Google Research Football (GRF). The repository provides core code for algorithms, environment wrappers, training rollouts, and policy updates, with default hyperparameters available for replication.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending