ICLR2026

Solving Football by Exploiting Equilibrium Structure of 2p0s Differential Games with One-Sided Information

Mukesh Ghimire, Lei Zhang, Zhe Xu, Yi Ren

1 citation

Abstract

For a two-player imperfect-information extensive-form game (IIEFG) with KK time steps and a player action space of size UU, the game tree complexity is U2KU^{2K}, causing existing IIEFG solvers to struggle with large or infinite (U,K)(U,K), e.g., differential games with continuous action spaces. To partially address this scalability challenge, we focus on an important class of 2p0s games where the informed player (P1) knows the payoff while the uninformed player (P2) only has a belief over the set of II possible payoffs. Such games encompass a wide range of scenarios in sports, defense, cybersecurity, and finance. We prove that under mild conditions, P1's (resp. P2's) equilibrium strategy at any infostate concentrates on at most II (resp. I+1I+1) action prototypes. When IUI\ll U, this equilibrium structure causes the game tree complexity to collapse to IKI^K for P1 when P2 plays best responses, and (I+1)K(I+1)^K for P2 in a dual game where P1 plays best responses. We then show that exploiting this structure in model-free multiagent reinforcement learning and model predictive control leads to significant improvements in learning accuracy and efficiency from SOTA IIEFG solvers. Our demonstration solves a 22-player football game with continuous action spaces and K=10K=10 time steps, where the offense team needs to strategically conceal their play until a critical moment in order to exploit information advantage. Code is available here.