WWW2026

Collusion-Aware Set-level Learning Framework for Detecting Spamming Activities

Yuli Liu

Abstract

Spamming activities (e.g., fake reviews, click farming, and deceptive content promotion) are increasingly conducted through collusive groups that exploit collective dynamics to manipulate platform metrics and mislead users, posing serious threats to the fairness, credibility, and functionality of online systems. To counteract these harmful behaviors, the task of spam detection has emerged as a critical area of research. However, existing detection methods generally remain limited in three key aspects: (i) They treat detection as a standard classification task, where representation learning and optimization are loosely coupled and suboptimal for capturing complex behaviors; (ii) They rely primarily on individual-level representation modeling, making it difficult to detect collective cheating strategies; (iii) They lack dedicated objective functions explicitly designed to characterize group-level spamming activities. To overcome these limitations, we introduce a collusion-aware Set-level learning framework (SetDet) that redefines the spam Detection task as a unified setwise optimization problem. Our approach offers three core advantages: (i) It enables end-to-end optimization by jointly learning representations and performing detection in a single, integrated process; (ii) It incorporates a model-level design for collusion representation, effectively capturing the temporal and relational patterns of coordinated spam; (iii) It pioneers a dedicated set-level optimization criterion that aligns closely with the structural characteristics of group-based cheating behaviors and accounts for class imbalance in real-world scenarios. Extensive experiments confirm the generalizability and superior performance of our framework across diverse spam scenarios and collusion strategies.