ICLR2026

Sublinear Spectral Clustering Oracle with Little Memory

Ranran Shen, Xiaoyi Zhu, Pan Peng, Zengfeng Huang

Abstract

We study the problem of designing sublinear spectral clustering oracles for well-clusterable graphs. Such an oracle is an algorithm that, given query access to the adjacency list of a graph GG, first constructs a compact data structure D\mathcal{D} that captures the clustering structure of GG. Once built, D\mathcal{D} enables sublinear time responses to WhichCluster(G,x)(G,x) queries for any vertex xx. A major limitation of existing oracles is that constructing D\mathcal{D} requires Ω(n)\Omega(\sqrt{n}) memory, which becomes a bottleneck for massive graphs and memory-limited settings. In this paper, we break this barrier and establish a memory-time trade-off for sublinear spectral clustering oracles. Specifically, for well-clusterable graphs, we present oracles that construct D\mathcal{D} using much smaller than O(n)O(\sqrt{n}) memory (e.g., O(n0.01)O(n^{0.01})) while still answering membership queries in sublinear time. We also characterize the trade-off frontier between memory usage SS and query time TT, showing, for example, that ST=O~(n)S\cdot T=\widetilde{O}(n) for clusterable graphs with a logarithmic conductance gap, and we show that this trade-off is nearly optimal (up to logarithmic factors) for a natural class of approaches. Finally, to complement our theory, we validate the performance of our oracles through experiments on synthetic networks.