NeurIPS2024

Faster Differentially Private Top-k Selection: A Joint Exponential Mechanism with Pruning

Hao Wu, Hanwen Zhang

Abstract

We study the differentially private top-kk selection problem, aiming to identify a sequence of kk items with approximately the highest scores from dd items. Recent work by Gillenwater et al. (ICML'22) employs a direct sampling approach from the vast collection of dΘ(k)d^{\,\Theta(k)} possible length-kk sequences, showing superior empirical accuracy compared to previous pure or approximate differentially private methods. Their algorithm has a time and space complexity of O~(dk)\tilde{O}(dk). In this paper, we present an improved algorithm with time and space complexity O(d+k2/ϵlnd)O(d + k^2 / \epsilon \cdot \ln d), where ϵ\epsilon denotes the privacy parameter. Experimental results show that our algorithm runs orders of magnitude faster than their approach, while achieving similar empirical accuracy.