AAAI2026
Scalable Semi-supervised Community Search via Graph Transformer on Attributed Heterogeneous Information Networks
Linlin Ding, Zhaosong Zhao, Mo Li, Yishan Pan, Xin Wang, Renata Borovica-Gajic
摘要
Attributed heterogeneous information networks (AHINs) encode rich semantics through diverse node and edge types. Recent learning-based community search methods on AHINs have shown promising performance but face two major limitations: (i) difficulty scaling to large graphs due to memoryintensive neighbor-based propagation (e.g., GNNs and nodelevel attention), and (ii) reliance on explicit communitylevel labels, which are often unavailable or costly to obtain. To address these issues, we propose a scalable Semisupervised Community Search framework on AHINs (SC-SAH), enabling scalability and efficiency, while eliminating the need for community-level labels by leveraging readily available node classification labels. Specifically, we devise MvSF2Token to extract Multi-view Semantic Features (MvSFs) as compact subgraph-level tokens before training, significantly reducing model propagation complexity. We then design a View-Aware Semantic Graph Transformer (VASGhormer) to effectively encode MvSFs by capturing cross-view dependencies and fusing semantic features. The combination of MvSF2Token and VASGhormer ensures scalability, efficiency, and robust performance. Furthermore, we propose a novel View-Aware Contrastive Learner to train VASGhormer without requiring community-level supervision. Extensive experiments on five real-world datasets show that SCSAH significantly outperforms state-of-the-art methods, achieving 18.06% higher performance and 10.43× faster training. Our code is included in the supplementary material.