ICLR2026

The Power of Small Initialization in Noisy Low-Tubal-Rank Tensor Recovery

Zhiyu Liu, Haobo Geng, Xudong Wang, Yandong Tang, Zhi Han, Yao Wang

1 citation

Abstract

We study the problem of recovering a low-tubal-rank tensor X_Rn×n×k\mathcal{X}\_\star\in \mathbb{R}^{n \times n \times k} from noisy linear measurements under the t-product framework. A widely adopted strategy involves factorizing the optimization variable as UU\mathcal{U} * \mathcal{U}^\top, where URn×R×k\mathcal{U} \in \mathbb{R}^{n \times R \times k}, followed by applying factorized gradient descent (FGD) to solve the resulting optimization problem. Since the tubal-rank rr of the underlying tensor X\mathcal{X}_\star is typically unknown, this method often assumes r<Rnr < R \le n, a regime known as over-parameterization. However, when the measurements are corrupted by some dense noise (e.g., sub-Gaussian noise), FGD with the commonly used spectral initialization yields a recovery error that grows linearly with the over-estimated tubal-rank RR. To address this issue, we show that using a small initialization enables FGD to achieve a nearly minimax optimal recovery error, even when the tubal-rank RR is significantly overestimated. Using a four-stage analytic framework, we analyze this phenomenon and establish the sharpest known error bound to date, which is independent of the overestimated tubal-rank RR. Furthermore, we provide a theoretical guarantee showing that an easy-to-use early stopping strategy can achieve the best known result in practice. All these theoretical findings are validated through a series of simulations and real-data experiments.