ICML2025
GSM-∞: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?
Yang Zhou, Hongyi Liu, Zhuoming Chen, Yuandong Tian, Beidi Chen
摘要
Motivation Overview RAG Results Computation Graph GSM-Infinite Experiments 3 Motivation Overview RAG Results Computation Graph GSM-Infinite Experiments 4 Motivation Overview RAG Results Computation Graph GSM-Infinite Experiments Long-context LLMs are Getting Amazingly Strong Gemini 1.5 Pro is getting almost Perfect Score on 10M Context Retrieval