AAAI2026

AgentGraph: Trace-to-Graph Platform for Interactive Analysis and Robustness Testing in Agentic AI Systems

Zekun Wu, Seonglae Cho, Cristian E. Muñoz Villalobos, Theo King, Umar Mohammed, Emre Kazim, María Pérez-Ortiz, Sahan Bulathwela, Adriano S. Koshiyama

Abstract

Modern Agentic AI systems plan, reason, and act across multiple steps, creating execution patterns that are difficult to interpret. Existing observability platforms track prompt I/O and operational metrics but require manual inspection of traces to reconstruct structure and reasoning. We present Agent-Graph, which converts execution logs into interactive knowledge graphs and actionable insights. Nodes represent agents, tasks, tools, data inputs/outputs, and humans, while typed edges capture relations such as inputs consumed, tasks delegated or sequenced, tools required or used, outputs produced and delivered, and interventions from agents or humans. Each graph element links to its exact trace span, ensuring verifiability. Building on this representation, AgentGraph enables two analyses: qualitative trace-grounded failure detection and optimisation recommendations, and quantitative robustness evaluation via perturbation testing and causal attribution. Live Demo: huggingface.co/spaces/holistic-ai/AgentGraph Demo Video: youtu.be/btrS9pfDYJY?si=dDX4tIs-oS2O2d2p