ICML2023
A new near-linear time algorithm for k-nearest neighbor search using a compressed cover tree
Yury Elkin, Vitaliy Kurlin
18 citations
Abstract
Given a reference set of points and a query set of points in a metric space, this paper studies an important problem of finding -nearest neighbors of every point in the set in a near-linear time. In the paper at ICML 2006, Beygelzimer, Kakade, and Langford introduced a cover tree on and attempted to prove that this tree can be built in time while the nearest neighbor search can be done in time with a hidden dimensionality factor. This paper fills a substantial gap in the past proofs of time complexity by defining a simpler compressed cover tree on the reference set . The first new algorithm constructs a compressed cover tree in time. The second new algorithm finds all -nearest neighbors of all points from using a compressed cover tree in time with a hidden dimensionality factor depending on point distributions of the given sets but not on their sizes.