Computer Science > Computation and Language

arXiv:2410.13025 (cs)

[Submitted on 16 Oct 2024 (v1), last revised 2 Dec 2024 (this version, v2)]

Title:LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Authors:Akshara Prabhakar, Yuanzhi Li, Karthik Narasimhan, Sham Kakade, Eran Malach, Samy Jelassi

Abstract:Low-Rank Adaptation (LoRA) is a popular technique for parameter-efficient fine-tuning of Large Language Models (LLMs). We study how different LoRA modules can be merged to achieve skill composition -- testing the performance of the merged model on a target task that involves combining multiple skills, each skill coming from a single LoRA. This setup is favorable when it is difficult to obtain training data for the target task and when it can be decomposed into multiple skills. First, we identify practically occurring use-cases that can be studied under the realm of skill composition, e.g. solving hard math-word problems with code, creating a bot to answer questions on proprietary manuals or about domain-specialized corpora. Our main contribution is to show that concatenation of LoRAs (CAT), which optimally weights LoRAs that were individually trained on different skills, outperforms existing model- and data- merging techniques; for instance on math-word problems, CAT beats these methods by an average of 43% and 12% respectively. Thus, this paper advocates model merging as an efficient way to solve compositional tasks and underscores CAT as a simple, compute-friendly and effective procedure. To our knowledge, this is the first work demonstrating the superiority of model merging over data mixing for binary skill composition tasks. Code and data are available at this https URL

Comments:	COLING 2025 Industry track; 9 pages plus references and appendices
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2410.13025 [cs.CL]
	(or arXiv:2410.13025v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.13025

Submission history

From: Akshara Prabhakar [view email]
[v1] Wed, 16 Oct 2024 20:33:06 UTC (4,895 KB)
[v2] Mon, 2 Dec 2024 06:40:50 UTC (4,895 KB)

Computer Science > Computation and Language

Title:LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators