NeurIPS2024

Fully Unconstrained Online Learning

Ashok Cutkosky, Zakaria Mhammedi

Abstract

We provide an online learning algorithm that obtains regret GwTlog(wGT)+w2+G2G\|w_\star\|\sqrt{T\log(\|w_\star\|G\sqrt{T})} + \|w_\star\|^2 + G^2 on GG-Lipschitz convex losses for any comparison point ww_\star without knowing either GG or w\|w_\star\|. Importantly, this matches the optimal bound GwTG\|w_\star\|\sqrt{T} available with such knowledge (up to logarithmic factors), unless either w\|w_\star\| or GG is so large that even GwTG\|w_\star\|\sqrt{T} is roughly linear in TT. Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most"interesting"scenarios.