SIGMOD2023

A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy

Ritesh Ahuja, Sepanta Zeighami, Gabriel Ghinita, Cyrus Shahabi

被引用 13 次

摘要

Several companies (e.g., Meta, Google) have initiated "data-forgood" projects where aggregate location data are first sanitized and released publicly, which is useful to many applications in transportation, public health (e.g., COVID-19 spread) and urban planning. Differential privacy (DP) is the protection model of choice to ensure the privacy of the indivduals who generated the raw location data. However, current solutions fail to preserve data utility when each individual contributes multiple location reports (i.e., under userlevel privacy). To offset this limitation, public releases by Meta and Google use high privacy budgets (e.g., 𝜀 = 10-100), resulting in poor privacy. We propose a novel approach to release spatio-temporal data privately and accurately. We employ the pattern recognition power of neural networks, specifically variational auto-encoders (VAE), to reduce the noise introduced by DP mechanisms such that accuracy is increased, while the privacy requirement is still satisfied. Our extensive experimental evaluation on real datasets shows the clear superiority of our approach compared to benchmarks.