WWW2026

Prompt-Induced Linguistic Fingerprints for LLM-Generated Fake News Detection

Chi Wang, Min Gao, Zongwei Wang, Junwei Yin, Kai Shu, Chenghua Lin

被引用 3 次

摘要

With the rapid advancement of large language models (LLMs), producing realistic fake news has become increasingly effortless, challenging existing detection methods that rely on lexical and syntactic patterns. To address this, we shift our focus to the generation process and analyze how malicious prompts manipulate model outputs. We construct pairs of LLM-generated real and fake news and apply malicious prompts to reconstruct them as fake. By comparing the original-token generation probabilities recorded during reconstruction, we observe a consistent statistical divergence: tokens from real news tend to have lower reconstruction likelihoods than those from fake news. We define this distributional divergence as linguistic fingerprint. Building on this insight, we propose LIFE (Linguistic Fingerprints Extraction), a novel detection framework that reconstructs token-level probability distributions guided by malicious prompts to capture these discriminative linguistic patterns. To fully exploit the extracted fingerprints, LIFE further introduces a key-fragment amplification module that adaptively identifies and accentuates the most distinctive linguistic fragments, thereby enhancing detection reliability across diverse prompting scenarios. Extensive experiments demonstrate that LIFE achieves state-ofthe-art performance in detecting LLM-generated fake news while maintaining strong generalization to human-LLM mixed cases. The code is available 1 . CCS Concepts • Computing methodologies → Natural language processing.