NeurIPS2024

Non-asymptotic Global Convergence Analysis of BFGS with the Armijo-Wolfe Line Search

Qiujiang Jin, Ruichen Jiang, Aryan Mokhtari

Abstract

In this paper, we present the first explicit and non-asymptotic global convergence rates of the BFGS method when implemented with an inexact line search scheme satisfying the Armijo-Wolfe conditions. We show that BFGS achieves a global linear convergence rate of (11κ)t(1 - \frac{1}{\kappa})^t for μ\mu-strongly convex functions with LL-Lipschitz gradients, where κ=Lμ\kappa = \frac{L}{\mu} represents the condition number. Additionally, if the objective function's Hessian is Lipschitz, BFGS with the Armijo-Wolfe line search achieves a linear convergence rate that depends solely on the line search parameters, independent of the condition number. We also establish a global superlinear convergence rate of O((1t)t)\mathcal{O}((\frac{1}{t})^t). These global bounds are all valid for any starting point x0x_0 and any symmetric positive definite initial Hessian approximation matrix B0B_0, though the choice of B0B_0 impacts the number of iterations needed to achieve these rates. By synthesizing these results, we outline the first global complexity characterization of BFGS with the Armijo-Wolfe line search. Additionally, we clearly define a mechanism for selecting the step size to satisfy the Armijo-Wolfe conditions and characterize its overall complexity.