WWW2026

Invariant Learning on Heterogeneous Graphs via Subgraph Environment Inference

Yanghui Fu, Yunfei Wang, Hao Zou, Yue He, Haotian Wang, Qing Cheng, Guangquan Cheng, Shixuan Liu

摘要

The out-of-distribution (OOD) generalization of graph neural networks poses significant challenges in Web applications, where data resides in complex heterogeneous information networks. Such networks exhibit not only structural heterogeneity but also distribution shifts arising from evolving user behaviors and data collection biases. Conventional GNNs often struggle to identify stable patterns in such heterogeneous graphs, particularly in the absence of explicit environment labels. The core issue is that latent environments can create spurious correlations between node features, local topology, and labels. Models may then rely on these environment-specific shortcuts for predictions, failing to learn the invariant mechanisms that generalize under distribution shifts. To address these limitations, we propose InvHG (Invariant Learning on Heterogeneous Graphs via Subgraph Environment Inference), a causality-inspired framework that infers latent environments at the subgraph level, disentangles type-specific confounding effects, and leverages regularized expert fusion to learn invariant representations. Extensive experiments on heterogeneous graph OOD benchmarks demonstrate that InvHG consistently outperforms state-of-the-art methods, offering a robust solution for complex Web graph learning. The source code is available at https://github.com/mok630/InvHG.