EMNLP2025

DiNaM: Disinformation Narrative Mining with Large Language Models

Witold Sosnowski, Arkadiusz Modzelewski, Kinga Skorupska, Adam Wierzbicki

Abstract

Disinformation poses a significant threat to democratic societies, public health, and national security. To address this challenge, factchecking experts analyze and track disinformation narratives. However, the process of manually identifying these narratives is highly time-consuming and resource-intensive. In this article, we introduce DiNaM, the first algorithm and structured framework specifically designed for mining disinformation narratives. DiNaM uses a multi-step approach to uncover disinformation narratives. It first leverages Large Language Models (LLMs) to detect false information, then applies clustering techniques to identify underlying disinformation narratives. We evaluated DiNaM's performance using groundtruth disinformation narratives from the EUD-isinfoTest dataset. The evaluation employed the Weighted Chamfer Distance (WCD), which measures the similarity between two sets of embeddings: the ground truth and the predicted disinformation narratives. DiNaM achieved a state-of-the-art WCD score of 0.73, outperforming general-purpose narrative mining methods by a notable margin of 16.4-24.7%. We are releasing DiNaM's codebase and the dataset to the public.