Computer Science > Computation and Language

arXiv:2408.08803 (cs)

[Submitted on 16 Aug 2024 (v1), last revised 19 Sep 2024 (this version, v2)]

Title:FourierKAN outperforms MLP on Text Classification Head Fine-tuning

Authors:Abdullah Al Imran, Md Farhan Ishmam

Abstract:In resource constraint settings, adaptation to downstream classification tasks involves fine-tuning the final layer of a classifier (i.e. classification head) while keeping rest of the model weights frozen. Multi-Layer Perceptron (MLP) heads fine-tuned with pre-trained transformer backbones have long been the de facto standard for text classification head fine-tuning. However, the fixed non-linearity of MLPs often struggles to fully capture the nuances of contextual embeddings produced by pre-trained models, while also being computationally expensive. In our work, we investigate the efficacy of KAN and its variant, Fourier KAN (FR-KAN), as alternative text classification heads. Our experiments reveal that FR-KAN significantly outperforms MLPs with an average improvement of 10% in accuracy and 11% in F1-score across seven pre-trained transformer models and four text classification tasks. Beyond performance gains, FR-KAN is more computationally efficient and trains faster with fewer parameters. These results underscore the potential of FR-KAN to serve as a lightweight classification head, with broader implications for advancing other Natural Language Processing (NLP) tasks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2408.08803 [cs.CL]
	(or arXiv:2408.08803v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.08803

Submission history

From: Md Farhan Ishmam [view email]
[v1] Fri, 16 Aug 2024 15:28:02 UTC (125 KB)
[v2] Thu, 19 Sep 2024 14:18:59 UTC (170 KB)

Computer Science > Computation and Language

Title:FourierKAN outperforms MLP on Text Classification Head Fine-tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FourierKAN outperforms MLP on Text Classification Head Fine-tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators