ICLR2026
BoRA: Towards More Expressive Low-Rank Adaptation with Block Diversity
Shiwei Li, Xiandi Luo, Haozhao Wang, Xing Tang, Ziqiang Cui, Dugang Liu, Yuhua Li, Yichen Li, Xiuqiang He, Ruixuan Li
5 citations
Abstract
Low-rank adaptation (LoRA) is a parameter-efficient fine-tuning (PEFT) method widely used in large language models (LLMs). It approximates the update of a pretrained weight matrix by the product of two low-rank matrices, , where and . Increasing the dimension can raise the rank of LoRA weights (i.e., ), which typically improves fine-tuning performance but also significantly increases the number of trainable parameters. In this paper, we propose Block Diversified Low-Rank Adaptation (BoRA), which improves the rank of LoRA weights with a small number of additional parameters. Specifically, BoRA treats the product as a block matrix multiplication, where and are partitioned into blocks along the columns and rows, respectively (i.e., and ). Consequently, the product becomes the concatenation of the block products for . To enhance the diversity of different block products, BoRA introduces a unique diagonal matrix for each block multiplication, resulting in . By leveraging these block-wise diagonal matrices, BoRA increases the rank of LoRA weights by a factor of while only requiring additional parameters. Extensive experiments across multiple datasets and models demonstrate the superiority of BoRA, and ablation studies further validate its scalability.