WWW2026

Shift-Resilient Diffusive Imputation for Variable Subset Forecasting

Haihua Xu, Qi Hao, He Zhang, Jianpeng Zhao, Ziyue Qiao, Lu Jiang, Pengfei Wang, Yingjie Zhou, Pengyang Wang

摘要

As an important task in modern web technologies, multivariate time series forecasting drives many core functionalities. However, in realistic web-scale environments, sensor failures, privacy filtering, sampling, and instrumentation churn frequently yield missing variables, making training sets complete while test sets contain only a small subset of variables. The challenge lies in utilizing incomplete data for forecasting, which is known as Variable Subset Forecasting (VSF). Distribution shift is inherent to time series and remains in VSF, including inter-series shift as changes in cross-series correlations and intra-series shift as substantial distribution differences within the same series across different time windows. Existing VSF approaches typically impute missing variables and then forecast on the completed series, yet they overlook these shifts and thus underperform in realistic web-scale scenarios. To address these challenges, we propose Shift Resilient Diffusive Imputation (SRDI), a framework tailored to VSF and robust to distribution shift. Specifically, SRDI integrates a divide-conquer strategy with the denoising process, which decomposes the input into invariant patterns and variant patterns, representing the temporally stable parts of inter-series correlation and the highly fluctuating parts, respectively. By extracting spatiotemporal features from each part separately and then appropriately combining them, inter-series shift can be effectively mitigated. Then, we innovatively organize SRDI and the forecasting model into a meta-learning paradigm tailored for VSF scenarios. We address the intra-series shift by treating time windows as tasks during training and employing an adaptation process before testing, which naturally supports robust online forecasting in dynamic web environments. Extensive experiments on four datasets have demonstrated our superior performance compared with state-of-the-art methods. Our code is available at https://github.com/xhhmacau/SRDI.