ICML2025
AuPair: Golden Example Pairs for Code Repair
Aditi Mavalankar, Hassan Mansoor, Zita Marinho, Mariia Samsikova, Tom Schaul
摘要
Scaling up inference-time compute has proven to be a valuable strategy in improving the performance of Large Language Models (LLMs) without fine-tuning. An important task that can benefit from additional inference-time compute is selfrepair; given an initial flawed response or guess, the LLM corrects its own mistake and produces an improved response or fix. We leverage the in-