ICLR2026
Differentially Private Domain Discovery
Vinod Raman, Travis Dick, Matthew Joseph
Abstract
We study several problems in differentially private domain discovery, where each user holds a subset of items from a shared but unknown domain, and the goal is to output an informative subset of items. For set union, we show that the simple baseline Weighted Gaussian Mechanism (WGM) has a near-optimal missing mass guarantee on Zipfian data as well as a distribution-free missing mass guarantee. We then apply the WGM as a domain-discovery precursor for existing known-domain algorithms for private top- and -hitting set and obtain new utility guarantees for their unknown domain variants. Finally, experiments demonstrate that all of our WGM-based methods are competitive with or outperform existing baselines for all three problems.