NeurIPS2023

Efficient Sampling of Stochastic Differential Equations with Positive Semi-Definite Models

Anant Raj, Umut Simsekli, Alessandro Rudi

3 citations

Abstract

This paper deals with the problem of efficient sampling from a stochastic differential equation, given the drift function and the diffusion matrix. The proposed approach leverages a recent model for probabilities (the positive semi-definite -- PSD model) from which it is possible to obtain independent and identically distributed (i.i.d.) samples at precision ε\varepsilon with a cost that is m2dlog(1/ε)m^2 d \log(1/\varepsilon) where mm is the dimension of the model, dd the dimension of the space. The proposed approach consists in: first, computing the PSD model that satisfies the Fokker-Planck equation (or its fractional variant) associated with the SDE, up to error ε\varepsilon, and then sampling from the resulting PSD model. Assuming some regularity of the Fokker-Planck solution (i.e. β\beta-times differentiability plus some geometric condition on its zeros) We obtain an algorithm that: (a) in the preparatory phase obtains a PSD model with L2 distance ε\varepsilon from the solution of the equation, with a model of dimension m=ε(d+1)/(β2s)(log(1/ε))d+1m = \varepsilon^{-(d+1)/(\beta-2s)} (\log(1/\varepsilon))^{d+1} where 1/2s11/2\leq s\leq1 is the fractional power to the Laplacian, and total computational complexity of O(m3.5log(1/ε))O(m^{3.5} \log(1/\varepsilon)) and then (b) for Fokker-Planck equation, it is able to produce i.i.d. samples with error ε\varepsilon in Wasserstein-1 distance, with a cost that is O(dε2(d+1)/β2log(1/ε)2d+3)O(d \varepsilon^{-2(d+1)/\beta-2} \log(1/\varepsilon)^{2d+3}) per sample. This means that, if the probability associated with the SDE is somewhat regular, i.e. β4d+2\beta \geq 4d+2, then the algorithm requires O(ε0.88log(1/ε)4.5d)O(\varepsilon^{-0.88} \log(1/\varepsilon)^{4.5d}) in the preparatory phase, and O(ε1/2log(1/ε)2d+2)O(\varepsilon^{-1/2}\log(1/\varepsilon)^{2d+2}) for each sample. Our results suggest that as the true solution gets smoother, we can circumvent the curse of dimensionality without requiring any sort of convexity.