Computer Science > Computation and Language

arXiv:2305.11442 (cs)

[Submitted on 19 May 2023 (v1), last revised 25 May 2023 (this version, v2)]

Title:Zero-Shot Text Classification via Self-Supervised Tuning

Authors:Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing

View PDF

Abstract:Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at this https URL .

Comments:	Accepted to the Findings of ACL 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2305.11442 [cs.CL]
	(or arXiv:2305.11442v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.11442

Submission history

From: Chaoqun Liu [view email]
[v1] Fri, 19 May 2023 05:47:33 UTC (7,581 KB)
[v2] Thu, 25 May 2023 06:10:04 UTC (7,581 KB)

Computer Science > Computation and Language

Title:Zero-Shot Text Classification via Self-Supervised Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Zero-Shot Text Classification via Self-Supervised Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators