ICLR2026

A Scalable Constant-Factor Approximation Algorithm for WpW_p Optimal Transport

Pankaj K Agarwal, Oliver Chubet, Sharath Raghvendra, Keegan Yao

摘要

Let (X,d)(X,d) be a metric space and let μ,ν\mu,\nu be discrete probability distributions supported on finite point sets A,BXA,B \subseteq X. For any p[1,]p \in [1,\infty], the WpW_p-distance between μ\mu and ν\nu, Wp(μ,ν)W_p(\mu, \nu), is defined as the pp-th root of the minimum cost of transporting all the probability mass from μ\mu to ν\nu, where moving a probability mass of δ\delta from aAa \in A to bBb \in B incurs a cost of δd(a,b)p\delta d(a,b)^p. We give a (Las Vegas) randomized algorithm that computes a (4+ε)(4+\varepsilon)-approximate WpW_p optimal-transport (OT) plan in O(n2+(n3/2ε1lognlogΔ)1+o(1)logU)O(n^2 + (n^{3/2}\varepsilon^{-1}\log n\log\Delta)^{1+o(1)}\log U) time with probability at least 11/n1-1/n, for all p[1,]p \in [1,\infty], where ε>0\varepsilon > 0 is an arbitrarily small constant and Δ\Delta is the ratio between the largest and smallest interpoint distances in ABA\cup B. The previous best result achieved an O(logn)O(\log n)-approximation in O(pn2)O(pn^2) time, for constant values of pp. Our algorithm significantly improves the approximation factor and, importantly, is the first quadratic-time method that extends to the WW_\infty-distance. In contrast, additive approximation methods such as Sinkhorn are efficient only for constant pp and fail to handle p=p=\infty. Our algorithm also extends to a query model where, for any integer k>1k > 1, we give an algorithm that preprocesses XX into clusters in O(n2+kn1+1/klognlogΔ)O(n^2+kn^{1+1/k}\log n\log\Delta) time, after which a O(k)O(k)-approximate WpW_p distance between any two distributions μ\mu and ν\nu with XX as support can be computed in (n1+1/klognlogΔ)1+o(1)(n^{1+1/k}\log n\log\Delta)^{1+o(1)} time with probability at most 11/n1-1/n. Finally, for p=p=\infty, we show that obtaining a relative approximation factor better than 22 in O(n2)O(n^2) time would resolve the long-standing open problem of computing a perfect matching in an arbitrary bipartite graph in quadratic time.