ACL·2025FlashBack: Efficient Retrieval-Augmented Language Modeling for Fast InferenceRunheng Liu, Xingchen Xiao, Heyan Huang, Zewen Chi, Zhijing WuDOI出版方摘要,