HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Tian, Chunlin; Shi, Zhan; Guo, Zhijiang; Li, Li; Xu, Chengzhong

Computer Science > Computation and Language

arXiv:2404.19245 (cs)

[Submitted on 30 Apr 2024 (v1), last revised 23 May 2024 (this version, v2)]

Title:HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Authors:Chunlin Tian, Zhan Shi, Zhijiang Guo, Li Li, Chengzhong Xu

View PDF HTML (experimental)

Abstract:Adapting Large Language Models (LLMs) to new tasks through fine-tuning has been made more efficient by the introduction of Parameter-Efficient Fine-Tuning (PEFT) techniques, such as LoRA. However, these methods often underperform compared to full fine-tuning, particularly in scenarios involving complex datasets. This issue becomes even more pronounced in complex domains, highlighting the need for improved PEFT approaches that can achieve better performance. Through a series of experiments, we have uncovered two critical insights that shed light on the training and parameter inefficiency of LoRA. Building on these insights, we have developed HydraLoRA, a LoRA framework with an asymmetric structure that eliminates the need for domain expertise. Our experiments demonstrate that HydraLoRA outperforms other PEFT approaches, even those that rely on domain knowledge during the training and inference phases.

Comments:	19 pages, 7 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.19245 [cs.CL]
	(or arXiv:2404.19245v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2404.19245

Submission history

From: Chunlin Tian [view email]
[v1] Tue, 30 Apr 2024 04:01:09 UTC (10,076 KB)
[v2] Thu, 23 May 2024 15:06:02 UTC (10,095 KB)

Computer Science > Computation and Language

Title:HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators