ICLR2026

BoRA: Towards More Expressive Low-Rank Adaptation with Block Diversity

Shiwei Li, Xiandi Luo, Haozhao Wang, Xing Tang, Ziqiang Cui, Dugang Liu, Yuhua Li, Yichen Li, Xiuqiang He, Ruixuan Li

5 citations

Abstract

Low-rank adaptation (LoRA) is a parameter-efficient fine-tuning (PEFT) method widely used in large language models (LLMs). It approximates the update of a pretrained weight matrix WRm×nW\in\mathbb{R}^{m\times n} by the product of two low-rank matrices, BABA, where ARr×nA \in\mathbb{R}^{r\times n} and BRm×r(rmin{m,n})B\in\mathbb{R}^{m\times r} (r\ll\min\{m,n\}). Increasing the dimension rr can raise the rank of LoRA weights (i.e., BABA), which typically improves fine-tuning performance but also significantly increases the number of trainable parameters. In this paper, we propose Block Diversified Low-Rank Adaptation (BoRA), which improves the rank of LoRA weights with a small number of additional parameters. Specifically, BoRA treats the product BABA as a block matrix multiplication, where AA and BB are partitioned into bb blocks along the columns and rows, respectively (i.e., A=[A1,,Ab]A=[A_1,\dots,A_b] and B=[B1,,Bb]B=[B_1,\dots,B_b]^\top). Consequently, the product BABA becomes the concatenation of the block products BiAjB_iA_j for i,j[b]i,j\in[b]. To enhance the diversity of different block products, BoRA introduces a unique diagonal matrix Σi,jRr×r\Sigma_{i,j} \in \mathbb{R}^{r\times r} for each block multiplication, resulting in BiΣi,jAjB_i \Sigma_{i,j} A_j. By leveraging these block-wise diagonal matrices, BoRA increases the rank of LoRA weights by a factor of bb while only requiring b2rb^2r additional parameters. Extensive experiments across multiple datasets and models demonstrate the superiority of BoRA, and ablation studies further validate its scalability.