Computer Science > Computation and Language

arXiv:2410.07490 (cs)

[Submitted on 9 Oct 2024]

Title:MoDEM: Mixture of Domain Expert Models

Authors:Toby Simonds, Kemal Kurniawan, Jey Han Lau

Abstract:We propose a novel approach to enhancing the performance and efficiency of large language models (LLMs) by combining domain prompt routing with domain-specialized models. We introduce a system that utilizes a BERT-based router to direct incoming prompts to the most appropriate domain expert model. These expert models are specifically tuned for domains such as health, mathematics and science. Our research demonstrates that this approach can significantly outperform general-purpose models of comparable size, leading to a superior performance-to-cost ratio across various benchmarks. The implications of this study suggest a potential paradigm shift in LLM development and deployment. Rather than focusing solely on creating increasingly large, general-purpose models, the future of AI may lie in developing ecosystems of smaller, highly specialized models coupled with sophisticated routing systems. This approach could lead to more efficient resource utilization, reduced computational costs, and superior overall performance.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2410.07490 [cs.CL]
	(or arXiv:2410.07490v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.07490

Submission history

From: Toby Simonds [view email]
[v1] Wed, 9 Oct 2024 23:52:54 UTC (339 KB)

Computer Science > Computation and Language

Title:MoDEM: Mixture of Domain Expert Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MoDEM: Mixture of Domain Expert Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators