ACL2024
Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers
Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, Nan Du
Abstract
Although dominant in natural language processing, transformer-based models remain challenged by the task of long-sequence processing,