Computer Science > Computation and Language

arXiv:2401.05669 (cs)

[Submitted on 11 Jan 2024]

Title:ConcEPT: Concept-Enhanced Pre-Training for Language Models

Authors:Xintao Wang, Zhouhong Gu, Jiaqing Liang, Dakuan Lu, Yanghua Xiao, Wei Wang

Abstract:Pre-trained language models (PLMs) have been prevailing in state-of-the-art methods for natural language processing, and knowledge-enhanced PLMs are further proposed to promote model performance in knowledge-intensive tasks. However, conceptual knowledge, one essential kind of knowledge for human cognition, still remains understudied in this line of research. This limits PLMs' performance in scenarios requiring human-like cognition, such as understanding long-tail entities with concepts. In this paper, we propose ConcEPT, which stands for Concept-Enhanced Pre-Training for language models, to infuse conceptual knowledge into PLMs. ConcEPT exploits external taxonomies with entity concept prediction, a novel pre-training objective to predict the concepts of entities mentioned in the pre-training contexts. Unlike previous concept-enhanced methods, ConcEPT can be readily adapted to various downstream applications without entity linking or concept mapping. Results of extensive experiments show the effectiveness of ConcEPT in four tasks such as entity typing, which validates that our model gains improved conceptual knowledge with concept-enhanced pre-training.

Comments:	12pages. Work completed in 2023.01
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.05669 [cs.CL]
	(or arXiv:2401.05669v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.05669

Submission history

From: Xintao Wang [view email]
[v1] Thu, 11 Jan 2024 05:05:01 UTC (7,677 KB)

Computer Science > Computation and Language

Title:ConcEPT: Concept-Enhanced Pre-Training for Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ConcEPT: Concept-Enhanced Pre-Training for Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators