NeurIPS2025

Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

Davin Choo, Yuqi Pan, Tonghan Wang, Milind Tambe, Alastair van Heerden, Cheryl Johnson

4 citations

Abstract

We study a sequential decision-making problem on a nn-node graph G\mathcal{G} where each node has an unknown label from a finite set Ω\mathbf{\Omega}, drawn from a joint distribution P\mathcal{P} that is Markov with respect to G\mathcal{G}. At each step, selecting a node reveals its label and yields a label-dependent reward. The goal is to adaptively choose nodes to maximize expected accumulated discounted rewards. We impose a frontier exploration constraint, where actions are limited to neighbors of previously selected nodes, reflecting practical constraints in settings such as contact tracing and robotic exploration. We design a Gittins index-based policy that applies to general graphs and is provably optimal when G\mathcal{G} is a forest. Our implementation runs in O(n2Ω2)\mathcal{O}(n^2 \cdot |\mathbf{\Omega}|^2) time while using O(nΩ2)\mathcal{O}(n \cdot |\mathbf{\Omega}|^2) oracle calls to P\mathcal{P} and O(n2Ω)\mathcal{O}(n^2 \cdot |\mathbf{\Omega}|) space. Experiments on synthetic and real-world graphs show that our method consistently outperforms natural baselines, including in non-tree, budget-limited, and undiscounted settings. For example, in HIV testing simulations on real-world sexual interaction networks, our policy detects nearly all positive cases with only half the population tested, substantially outperforming other baselines.