NeurIPS2023

Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

Edward Raff, Amol Khanna, Fred Lu

被引用 10 次

摘要

To the best of our knowledge, there are no methods today for training differentially private regression models on sparse input data. To remedy this, we adapt the Frank-Wolfe algorithm for L1L_1 penalized linear regression to be aware of sparse inputs and to use them effectively. In doing so, we reduce the training time of the algorithm from O(TDS+TNS)\mathcal{O}( T D S + T N S) to O(NS+TDlogD+TS2)\mathcal{O}(N S + T \sqrt{D} \log{D} + T S^2), where TT is the number of iterations and a sparsity rate SS of a dataset with NN rows and DD features. Our results demonstrate that this procedure can reduce runtime by a factor of up to 2,200×2,200\times, depending on the value of the privacy parameter ϵ\epsilon and the sparsity of the dataset.