NeurIPS2024

Efficient Φ\Phi-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

Brian Hu Zhang, Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

Abstract

Recent breakthrough results by Dagan, Daskalakis, Fishelson and Golowich [2023] and Peng and Rubinstein [2023] established an efficient algorithm attaining at most ϵ\epsilon swap regret over extensive-form strategy spaces of dimension NN in NO~(1/ϵ)N^{\tilde O(1/\epsilon)} rounds. On the other extreme, Farina and Pipis [2023] developed an efficient algorithm for minimizing the weaker notion of linear-swap regret in poly(N)/ϵ2\mathsf{poly}(N)/\epsilon^2 rounds. In this paper, we develop efficient parameterized algorithms for regimes between these two extremes. We introduce the set of kk-mediator deviations, which generalize the untimed communication deviations recently introduced by Zhang, Farina and Sandholm [2024] to the case of having multiple mediators, and we develop algorithms for minimizing the regret with respect to this set of deviations in NO(k)/ϵ2N^{O(k)}/\epsilon^2 rounds. Moreover, by relating kk-mediator deviations to low-degree polynomials, we show that regret minimization against degree-kk polynomial swap deviations is achievable in NO(kd)3/ϵ2N^{O(kd)^3}/\epsilon^2 rounds, where dd is the depth of the game, assuming a constant branching factor. For a fixed degree kk, this is polynomial for Bayesian games and quasipolynomial more broadly when d=polylogNd = \mathsf{polylog} N -- the usual balancedness assumption on the game tree.