ICLR2025

Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks

Rushang Karia, Daniel Bramblett, Daksh Dobhal, Siddharth Srivastava

摘要

Collaboration with AI requires clear, effective communication. Promising approach: having the AI between the AI and the human. • natural language (e.g., describing code) • formal language (e.g., code, system specifications) This approach is already being used in vibe coding. • Generates code from a prompt • Explains the generated code This requires the AI to be semantically accurate doing this translation.