ICML2023

The Fast Johnson-Lindenstrauss Transform Is Even Faster

Ora Nova Fandina, Mikael Møller Høgsgaard, Kasper Green Larsen

被引用 7 次

摘要

The seminal Fast Johnson-Lindenstrauss (Fast JL) transform by Ailon and Chazelle (SICOMP'09) embeds a set of nn points in dd-dimensional Euclidean space into optimal k=O(ε2lnn)k=O(\varepsilon^{-2} \ln n) dimensions, while preserving all pairwise distances to within a factor (1±ε)(1 \pm \varepsilon). The Fast JL transform supports computing the embedding of a data point in O(dlnd+kln2n)O(d \ln d +k \ln^2 n) time, where the dlndd \ln d term comes from multiplication with a d×dd \times d Hadamard matrix and the kln2nk \ln^2 n term comes from multiplication with a sparse k×dk \times d matrix. Despite the Fast JL transform being more than a decade old, it is one of the fastest dimensionality reduction techniques for many tradeoffs between ε,d\varepsilon, d and nn. In this work, we give a surprising new analysis of the Fast JL transform, showing that the kln2nk \ln^2 n term in the embedding time can be improved to (kln2n)/α(k \ln^2 n)/\alpha for an α=Ω(min{ε1ln(1/ε),lnn})\alpha = \Omega(\min\{\varepsilon^{-1}\ln(1/\varepsilon), \ln n\}). The improvement follows by using an even sparser matrix. We also complement our improved analysis with a lower bound showing that our new analysis is in fact tight.