NeurIPS2022
Fast Distance Oracles for Any Symmetric Norm
Yichuan Deng, Zhao Song, Omri Weinstein, Ruizhe Zhang
10 citations
Abstract
In the Distance Oracle problem, the goal is to preprocess vectors in a -dimensional metric space into a cheap data structure, so that given a query vector and a subset of the input data points, all distances for can be quickly approximated (faster than the trivial query time). This primitive is a basic subroutine in machine learning, data mining and similarity search applications. In the case of norms, the problem is well understood, and optimal data structures are known for most values of . Our main contribution is a fast distance oracle for any symmetric norm . This class includes norms and Orlicz norms as special cases, as well as other norms used in practice, e.g. top- norms, max-mixture and sum-mixture of norms, small-support norms and the box-norm. We propose a novel data structure with preprocessing time and space, and query time, for computing distances to a subset of data points, where is a complexity-measure (concentration modulus) of the symmetric norm. When , this runtime matches the aforementioned state-of-art oracles.