ICLR2026
Near Optimal Robust Federated Learning Against Data Poisoning Attack
Jingfan Yu, Zhixuan Fang
Abstract
We revisit data poisoning attacks in the federated learning system. There will be worker nodes (each has training data samples) cooperatively training one model for a machine-learning task, and a fraction (i.e., ) of the workers may suffer from the data poisoning attack. We mainly focus on the challenging and practical case where is small and is large, such that each worker does not have enough statistical information to identify the poisoned data by itself, while in total they have enough data to learn the task if the poisoned data are detected. Therefore, we propose a mechanism for workers to cooperatively detect workers with poisoned data. In terms of attack loss, our mechanism achieves in IID setting and in non-IID setting, where is the VC-dimension of the learning model and is a concentration parameter characterizing the non-IIDness. Alongside attack loss, our mechanism limits the adversary’s free-ride gain even when it cannot be directly quantified by the attack loss. We also propose the lower bound of the attack loss, and our proposed algorithm matches the lower bound when both in IID setting and non-IID setting.