ACL2025

FlashBack: Efficient Retrieval-Augmented Language Modeling for Fast Inference

Runheng Liu, Xingchen Xiao, Heyan Huang, Zewen Chi, Zhijing Wu

Abstract

,