NeurIPS2023

Nearly Optimal Bounds for Cyclic Forgetting

William Swartworth, Deanna Needell, Rachel A. Ward, Mark Kong, Halyun Jeong

被引用 12 次

摘要

We provide theoretical bounds on the forgetting quantity in the continual learning setting for linear tasks, where each round of learning corresponds to projecting onto a linear subspace. For a cyclic task ordering on T tasks repeated m times each, we prove the best known upper bound of O ( T 2 /m ) on the forgetting. Notably, our bound holds uniformly over all choices of tasks and is independent of the ambient dimension. Our main technical contribution is a characterization of the union of all numerical ranges of products of T (real or complex) projections as a sinusoidal spiral, which may be of independent interest