ICLR2026

Differentially Private Domain Discovery

Vinod Raman, Travis Dick, Matthew Joseph

Abstract

We study several problems in differentially private domain discovery, where each user holds a subset of items from a shared but unknown domain, and the goal is to output an informative subset of items. For set union, we show that the simple baseline Weighted Gaussian Mechanism (WGM) has a near-optimal 1\ell_1 missing mass guarantee on Zipfian data as well as a distribution-free \ell_\infty missing mass guarantee. We then apply the WGM as a domain-discovery precursor for existing known-domain algorithms for private top-kk and kk-hitting set and obtain new utility guarantees for their unknown domain variants. Finally, experiments demonstrate that all of our WGM-based methods are competitive with or outperform existing baselines for all three problems.