Policy Optimization for Markov Games: Unified Framework and Faster Convergence

Runyu Zhang, Qinghua Liu, Huan Wang, Caiming Xiong, Na Li, Yu Bai