KDD2025

Generative Imputation with Multi-level Causal Consistency for Variable Subset Forecasting

Qi Hao, Yue Gao, Runchang Liang, Yunhe Zhang, Pengyang Wang

被引用 1 次

摘要

Variable Subset Forecasting (VSF) poses critical challenges in time series analysis when entire variables become unavailable during inference. Existing imputation methods relying on inter-variable correlations fail catastrophically in VSF due to two inherent limitations: (1) Missing variable collapse, where the complete absence of certain variables invalidates correlation-based dependency learning, and (2) Temporal covariate shift, where time-evolving data distributions destabilize correlation patterns learned from training data. To address these fundamental issues, we propose Generative Imputation with Multi-level Causal Consistency (GIMCC ), establishing causality-driven imputation as the first principled solution for VSF. Our key innovation lies in enforcing causal invariance through dual consistency constraints: global causal isomorphism ensures the imputed variables preserve the ground-truth causal graph structure of the complete system, while local causal subgraph alignment maintains consistency between observed variables and their causal neighborhood dependencies. By decoupling causality from spurious correlations, GIMCC provides time-invariant imputation signals robust to distribution shifts, which explicitly preserves causal relationships via multivariate spectral convolutions. Extensive experiments across five real-world domains demonstrate that GIMCC achieves average improvements of 20-60% in MAE/RMSE over correlation-based imputation baselines, remarkably outperforming full-variable training ( Oracle ) in temporal covariate shift scenarios. Our work bridges the critical gap between causal analysis and practical forecasting systems under variable absence, offering theoretically grounded guarantees for real-world deployment.