WWW2026

From Prediction to Understanding: Leveraging Reasoning in Large Language Model-based Recommendations

Zhi-Yuan Chen, Siyu Lu, Qiang Liu, Xingxing Wang, Yankai Lin

摘要

Recently, large language models (LLMs) have shown great promise in sequential recommendation. Existing methods typically transform users' historical interactions into textual sequences and then feed these sequences into LLMs to generate recommended items. However, since the output of LLMs is limited to recommendations, the models must analyze user interactions and infer user preferences implicitly. This implicit approach constrains the expressive reasoning abilities of LLMs. Moreover, as the output of LLMs consists solely of recommended items, the resulting recommendations lack explainability. To address these issues, we propose RE2, which enables LLMs to explicitly generate reasoning content before providing recommendations. Making the reasoning process explicit helps elicit the reasoning abilities of LLMs and simultaneously enhances the explainability of recommendation results. RE2 consists of three steps: (1) Reasoning Collection, which collects reasoning data by guiding LLMs to generate both reasoning content and recommended items for each interaction sequence. (2) Pattern Imitation, which leverages the collected data to train LLMs via supervised fine-tuning to imitate the pattern of first generating reasoning content and then providing recommendations. (3) Pattern Internalization, which further internalizes this reasoning-and-recommendation pattern and enhances both recommendation performance and the rationality of the reasoning content through reinforcement learning. RE2 can be implemented under both the self-distillation and teacher-distillation frameworks, without requiring external user metadata such as user reviews. Experimental results demonstrate the effectiveness of RE2 in improving recommendation performance and in generating high-quality reasoning content. Furthermore, we show that RE2 can mitigate popularity bias while maintaining recommendation accuracy to some extent. Our data and code are available at https://github.com/zhiyuanc2001/RE2.