NeurIPS2021

Neural Auto-Curricula in Two-Player Zero-Sum Games

Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang

40 citations

Abstract

In this section, we recap the meta-solver properties that we need and illustrate how we designed models to achieve them. There exist two properties the model should have. • The model should handle a variable-length matrix input. • The model should be subject to row-permutation equivariance and column-permutation invariance. Three different techniques can be utilised to achieve the first property, which also corresponds to the three different models we propose: MLP based, Conv1d based and GRU based model. If not specifically mentioned, we utilise ReLU as the activation function for all MLP used in our meta-solver.