ICML2021

Differentially Private Quantiles

Jennifer Gillenwater, Matthew Joseph, Alex Kulesza

被引用 2 次

摘要

In the approximate quantiles problem, the goal is to output mm quantile estimates, the ranks of which are as close as possible to mm given quantiles 0q1qm10 \leq q_1 \leq\dots \leq q_m \leq 1. We present a mechanism for approximate quantiles that satisfies ε\varepsilon-differential privacy for a dataset of nn real numbers where the ratio between the distance between the closest pair of points and the size of the domain is bounded by ψ\psi. As long as the minimum gap between quantiles is sufficiently large, qiqi1Ω(mlog(m)log(ψ)nε)|q_i-q_{i-1}|\geq \Omega\left(\frac{m\log(m)\log(\psi)}{n\varepsilon}\right) for all ii, the maximum rank error of our mechanism is O(log(ψ)+log2(m)ε)O\left(\frac{\log(\psi) + \log^2(m)}{\varepsilon}\right) with high probability. Previously, the best known algorithm under pure DP was due to Kaplan, Schnapp, and Stemmer (ICML'22), who achieved a bound of O(log(ψ)log2(m)+log3(m)ε)O\left(\frac{\log(\psi)\log^2(m) + \log^3(m)}{\varepsilon}\right). Our improvement stems from the use of continual counting techniques which allows the quantiles to be randomized in a correlated manner. We also present an (ε,δ)(\varepsilon,\delta)-differentially private mechanism that relaxes the gap assumption without affecting the error bound, improving on existing methods when δ\delta is sufficiently close to zero. We provide experimental evaluation which confirms that our mechanism performs favorably compared to prior work in practice, in particular when the number of quantiles mm is large.