WWW2026

CEAT: Context-Emotion Adversarial Training Framework for Robust Emotion-Driven Fraud Detection

Chaoqun Li, Si Wu, Yijun Lu, Yuyin Ma, Jinyao Liu, Dingyi Jia, Mingda Han, Feng Li, Pengfei Hu

Abstract

The rapid proliferation of emotion-aware web services has necessitated the analysis of multimodal user interactions. However, this introduces new vulnerabilities where adversaries exploit emotional signals to circumvent fraud detection systems. Despite its improved utility, the robustness of multimodal fraud detection against emotion-driven adversarial manipulation remains significantly underexplored. Existing paradigms often treat emotional cues as static features, overlooking the adversary's capability to strategically modulate multimodal signals (e.g., facial micro-expressions, vocal intonation, and textual styles) to mimic genuine behavior. Furthermore, prevalent evaluations are typically confined to unimodal perturbations and fail to account for context-consistent, cross-modal attacks, thereby compromising system reliability in real-world deployments. To bridge this gap, we propose Context-Emotion Adversarial Training (CEAT), a robust framework designed to fortify multimodal fraud detection against emotion-based attacks. CEAT leverages a Transformer-based architecture to synergistically model emotional features (e.g., visual dynamics and acoustic prosody) alongside semantic context derived from text, yielding a unified representation. Crucially, CEAT introduces a context-aware perturbation mechanism that injects noise into the emotional latent space during training. This process preserves semantic consistency while encouraging the learning of emotion-invariant and discriminative representations. Additionally, a contrastive learning objective is integrated to maximize the distributional divergence between genuine and adversarial samples within the latent manifold. Extensive experiments on multimodal benchmarks demonstrate that CEAT significantly outperforms state-of-the-art baselines, exhibiting superior robustness under simulated emotion-driven attack scenarios.