ACL2024

Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers

Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, Nan Du

摘要

Although dominant in natural language processing, transformer-based models remain challenged by the task of long-sequence processing,