Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional Reasoning Approach

Xubo Lyu,Amin Banitalebi-Dehkordi,Mo Chen,Yong Zhang,Xubo Lyu,Amin Banitalebi-Dehkordi,Mo Chen,Yong Zhang

Cooperative multi-agent problems often require coordination between agents, which can be achieved through a centralized policy that considers the global state. Multi-agent policy gradient (MAPG) methods are commonly used to learn such policies, but they are often limited to problems with low-level action spaces. In complex problems with large state and action spaces, it is advantageous to extend M...