VLDB2025
Alphabets, Grammars, Calculators, and the End of Hand-Crafted Systems
Stratos Idreos
被引用 1 次
摘要
The AI revolution is transforming every scientific field and business sector, driving an unprecedented demand for data-centric computation. As new data types, hardware platforms, and workloads appear faster than ever before, the backbone systems that power this revolution must evolve just as quickly. Yet a single system architecture, whether tuned for computing analytics, generative AI, or machine learning, faces a design space larger than 10 100 alternatives, and we still cling to a handful of "good" templates that each require years of manual design and implementation tuning. It is time to abandon this artisanal practice and embrace self-designing systems: systems that can reason about and refactor their own architecture. We show that by modeling the design space of systems as an alphabet of low-level design primitives and whole architectures as sentences in a grammar over that alphabet, "systems calculators" can now synthesize fresh systems blueprints on demand. The Data Calculator explores trillions of previously unknown data-structure variants to pick an optimal layout; Cosine and Limousine generate novel NoSQL stores that run up to three orders of magnitude faster than today's best deployments; the Image Calculator co-designs entirely new storage formats and neural networks to speed vision pipelines by 10; and LegoAI and TorchTitan invent novel distributed training algorithms for large AI models that extract every flop and byte from modern accelerators. These results signal a future in which systems research increasingly focuses on crafting richer alphabets and grammars while machines write the sentences, freeing designers and researchers to pursue more profound questions and enabling practitioners to dial in cost, latency, and accuracy with surgical precision.