WWW2026

Relation-Aware Multimodal Analogical Reasoning with Modality Fingerprints and Adaptive Gating

Ruofan Wang, Zijian Huang, Qiqi Wang, Yuchen Su, Robert Amor, Kaiqi Zhao, Meng-Fen Chiang

Abstract

Analogical reasoning over Multimodal Knowledge Graphs (MMKGs) couples abductive relation induction with inductive tail completion. However, existing approaches rely on static fusion mechanisms that overlook the inherent asymmetry of modal relevance: while visual cues elucidate concrete entities, they are often noisy or irrelevant for abstract concepts, where text and graph structure provide decisive signals. Furthermore, prior methods fail to enforce consistency between induced relations and the modality patterns implied by the analogical context. To bridge this gap, we introduce RMAR, a Relation-aware Multimodal Analogical Reasoning framework with two complementary paths. An explicit path estimates modality fingerprints to score compatibility during relation induction and guide fusion during tail completion. An implicit path employs adaptive gating to blend structural, textual, and visual signals conditioned on the specific query context. To address the limitations of current benchmarks, which overrepresent concrete entities, we release MCNetAnalogy, and its companion graph, MCNetKG, a rigorous dataset enriched with abstract concepts and actions. RMAR is backbone-agnostic and works with multimodal knowledge graph embedding (MKGE) and transformer-based (MPT) pipelines. Extensive experiments demonstrate that RMAR delivers consistent gains across both embedding-based and transformer-based backbones, achieving a 29% relative improvement on MCNetAnalogy. Ablation studies confirm that RMAR's relation-aware modulation is particularly effective when modal evidence is weak or ambiguous.