ACL2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

Zhuo Chen, Xinyu Wang, Yong Jiang, Pengjun Xie, Fei Huang, Kewei Tu

被引用 4 次

摘要

In the era of large language models, applying techniques such as Retrieval Augmented Generation can better address Open-Domain Question-Answering problems. Due to constraints including model sizes and computing resources, the length of context is often limited, and it becomes challenging to empower the model to cover overlong contexts while answering questions from open domains. This paper proposes a general and convenient method to cover longer contexts in Open-Domain Question-Answering tasks. It leverages a small encoder and cross-attention mechanism and effectively encodes contexts. With our method, the original language models can cover several times longer contexts while keeping the computing requirements close to the baseline. Our experiments demonstrate that after finetuning, there is improved performance across two held-in datasets, four held-out datasets, and also in two In Context Learning settings. Our code will be released at https://github. com/Alibaba-NLP/Vec-RA-ODQA . * Corresponding author Knowledge Also one episode of "The Alvin Show" from the 1960s was released. Alvin and the Chipmunks Alvin and the Chipmunks, originally David Seville and the Chipmunks or simply the Chipmunks, is an American animated music group created by Ross Bagdasarian Sr. for a novelty record in 1958. The group consists of three singing animated anthropomorphic chipmunks: Alvin, the