ICML2025

GSM-∞: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?

Yang Zhou, Hongyi Liu, Zhuoming Chen, Yuandong Tian, Beidi Chen

摘要

Motivation Overview RAG Results Computation Graph GSM-Infinite Experiments 3 Motivation Overview RAG Results Computation Graph GSM-Infinite Experiments 4 Motivation Overview RAG Results Computation Graph GSM-Infinite Experiments Long-context LLMs are Getting Amazingly Strong Gemini 1.5 Pro is getting almost Perfect Score on 10M Context Retrieval