EMNLP2025

REALM: Recursive Relevance Modeling for LLM-based Document Re-Ranking

Pinhuan Wang, Zhiqiu Xia, Chunhua Liao, Feiyi Wang, Hang Liu

Abstract

Large Language Models (LLMs) have shown strong capabilities in document re-ranking, a key component in modern Information Retrieval (IR) systems. However, existing LLMbased approaches face notable limitations, including ranking uncertainty, unstable top-k recovery, and high token cost due to tokenintensive prompting. To effectively address these limitations, we propose REALM, an uncertainty-aware re-ranking framework that models LLM-derived relevance as Gaussian distributions and refines them through recursive Bayesian updates. By explicitly capturing uncertainty and minimizing redundant queries, REALM achieves better rankings more efficiently. Experimental results demonstrate that our REALM surpasses state-of-the-art rerankers while significantly reducing token usage and latency, improving NDCG@10 by 0.7 -11.9 and simultaneously reducing the number of LLM inferences by 23.4 -84.4%, promoting it as the next-generation re-ranker for modern IR systems.