Computer Science > Computation and Language

arXiv:2309.14393 (cs)

[Submitted on 25 Sep 2023 (v1), last revised 19 Jan 2024 (this version, v2)]

Title:LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Authors:Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Prateek Sharma, Fan Chen, Lei Jiang

Abstract:The carbon footprint associated with large language models (LLMs) is a significant concern, encompassing emissions from their training, inference, experimentation, and storage processes, including operational and embodied carbon emissions. An essential aspect is accurately estimating the carbon impact of emerging LLMs even before their training, which heavily relies on GPU usage. Existing studies have reported the carbon footprint of LLM training, but only one tool, mlco2, can predict the carbon footprint of new neural networks prior to physical training. However, mlco2 has several serious limitations. It cannot extend its estimation to dense or mixture-of-experts (MoE) LLMs, disregards critical architectural parameters, focuses solely on GPUs, and cannot model embodied carbon footprints. Addressing these gaps, we introduce \textit{\carb}, an end-to-end carbon footprint projection model designed for both dense and MoE LLMs. Compared to mlco2, \carb~significantly enhances the accuracy of carbon footprint estimations for various LLMs. The source code is released at \url{this https URL}.

Comments:	15 pages, 8 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
Cite as:	arXiv:2309.14393 [cs.CL]
	(or arXiv:2309.14393v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.14393
Journal reference:	published in ICLR2024

Submission history

From: Lei Jiang [view email]
[v1] Mon, 25 Sep 2023 14:50:04 UTC (710 KB)
[v2] Fri, 19 Jan 2024 17:33:44 UTC (694 KB)

Computer Science > Computation and Language

Title:LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators