NeurIPS2024
Approximating the Top Eigenvector in Random Order Streams
Praneeth Kacham, David P. Woodruff
Abstract
When rows of an matrix are given in a stream, we study algorithms for approximating the top eigenvector of the matrix (equivalently, the top right singular vector of ). We consider worst case inputs but assume that the rows are presented to the streaming algorithm in a uniformly random order. We show that when the gap parameter , then there is a randomized algorithm that uses bits of space and outputs a unit vector that has a correlation with the top eigenvector . Here denotes the number of heavy rows in the matrix, defined as the rows with Euclidean norm at least . We also provide a lower bound showing that any algorithm using bits of space can obtain at most correlation with the top eigenvector. Thus, parameterizing the space complexity in terms of the number of heavy rows is necessary for high accuracy solutions. Our results improve upon the requirement in a recent work of Price and Xun (FOCS 2024). We note that the algorithm of Price and Xun works for arbitrary order streams whereas our algorithm requires a stronger assumption that the rows are presented in a uniformly random order. We additionally show that the gap requirements in their analysis can be brought down to for arbitrary order streams and for random order streams. The requirement of for random order streams is nearly tight for their analysis as we obtain a simple instance with for which their algorithm, with any fixed learning rate, cannot output a vector approximating the top eigenvector .