WWW2026
Beyond Factual Queries: A Novel Predictive Retrieval-Augmented Generation
Debo Cheng, Jianfeng Deng, Qingfeng Chen, Jinyi Jie, Jiangzhang Gan
Abstract
Retrieval-augmented generation (RAG) has proven effective at mitigating limitations of large language models (LLMs), including outdated knowledge, semantic gaps, and hallucinations. However, existing RAG techniques are primarily optimised for factual question answering—where answers can be directly retrieved—rather than for predictive tasks that require inferring unknown outcomes, such as user preferences in recommendations. To address this gap, we propose PRAG, a RAG framework tailored for predictive settings. PRAG computes the semantic similarity between a user's historical interactions and target information, and adaptively integrates this similarity as prompt weights, thereby enhancing the LLM's ability to model personalised, semantics-informed preferences. PRAG integrates seamlessly into LLM-based applications; instantiated in recommender systems, it explicitly captures user-specific preference signals via semantic similarity modelling. Experiments using two LLM backbones and four real-world datasets show that PRAG significantly improves predictive recommendation performance.