Computer Science > Computation and Language

arXiv:2406.10868 (cs)

[Submitted on 16 Jun 2024 (v1), last revised 19 Dec 2024 (this version, v4)]

Title:Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

Authors:Lihu Chen, Adam Dejl, Francesca Toni

Abstract:Large Language Models (LLMs) possess vast amounts of knowledge within their parameters, prompting research into methods for locating and editing this knowledge. Previous work has largely focused on locating entity-related (often single-token) facts in smaller models. However, several key questions remain unanswered: (1) How can we effectively locate query-relevant neurons in decoder-only LLMs, such as Llama and Mistral? (2) How can we address the challenge of long-form (or free-form) text generation? (3) Are there localized knowledge regions in LLMs? In this study, we introduce Query-Relevant Neuron Cluster Attribution (QRNCA), a novel architecture-agnostic framework capable of identifying query-relevant neurons in LLMs. QRNCA allows for the examination of long-form answers beyond triplet facts by employing the proxy task of multi-choice question answering. To evaluate the effectiveness of our detected neurons, we build two multi-choice QA datasets spanning diverse domains and languages. Empirical evaluations demonstrate that our method outperforms baseline methods significantly. Further, analysis of neuron distributions reveals the presence of visible localized regions, particularly within different domains. Finally, we show potential applications of our detected neurons in knowledge editing and neuron-based prediction.

Comments:	AAAI 2025 Main Track
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.10868 [cs.CL]
	(or arXiv:2406.10868v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.10868

Submission history

From: Lihu Chen [view email]
[v1] Sun, 16 Jun 2024 09:36:32 UTC (416 KB)
[v2] Mon, 19 Aug 2024 09:46:39 UTC (1,687 KB)
[v3] Tue, 20 Aug 2024 09:25:23 UTC (1,687 KB)
[v4] Thu, 19 Dec 2024 16:22:07 UTC (1,691 KB)

Computer Science > Computation and Language

Title:Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators