Computer Science > Computation and Language

arXiv:2305.05627 (cs)

[Submitted on 9 May 2023]

Title:An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text

Authors:Yova Kementchedjhieva, Ilias Chalkidis

View PDF

Abstract:Standard methods for multi-label text classification largely rely on encoder-only pre-trained language models, whereas encoder-decoder models have proven more effective in other classification tasks. In this study, we compare four methods for multi-label classification, two based on an encoder only, and two based on an encoder-decoder. We carry out experiments on four datasets -- two in the legal domain and two in the biomedical domain, each with two levels of label granularity -- and always depart from the same pre-trained model, T5. Our results show that encoder-decoder methods outperform encoder-only methods, with a growing advantage on more complex datasets and labeling schemes of finer granularity. Using encoder-decoder models in a non-autoregressive fashion, in particular, yields the best performance overall, so we further study this approach through ablations to better understand its strengths.

Comments:	9 pages, long paper at ACL 2023 Findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.05627 [cs.CL]
	(or arXiv:2305.05627v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.05627

Submission history

From: Ilias Chalkidis [view email]
[v1] Tue, 9 May 2023 17:13:53 UTC (6,630 KB)

Computer Science > Computation and Language

Title:An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators