NeurIPS2024
Clustering with Non-adaptive Subset Queries
Hadley Black, Euiwoong Lee, Arya Mazumdar, Barna Saha
Abstract
Recovering the underlying clustering of a set of points by asking pair-wise same-cluster queries has garnered significant interest in the last decade. Given a query , , the oracle returns yes if the points are in the same cluster and no otherwise. For adaptive algorithms with pair-wise queries, the number of required queries is known to be , where is the number of clusters. However, non-adaptive schemes require queries, which matches the trivial upper bound attained by querying every pair of points. To break the quadratic barrier for non-adaptive queries, we study a generalization of this problem to subset queries for , where the oracle returns the number of clusters intersecting . Allowing for subset queries of unbounded size, queries is possible with an adaptive scheme (Chakrabarty-Liao, 2024). However, the realm of non-adaptive algorithms is completely unknown. In this paper, we give the first non-adaptive algorithms for clustering with subset queries. Our main result is a non-adaptive algorithm making queries, which improves to when is a constant. We also consider algorithms with a restricted query size of at most . In this setting we prove that queries are necessary and obtain algorithms making queries for any and queries for any . We also consider the natural special case when the clusters are balanced, obtaining non-adaptive algorithms which make and queries. Finally, allowing two rounds of adaptivity, we give an algorithm making queries in the general case and queries when the clusters are balanced.