Abstract
When extracting entity relations from unstructured Chinese texts, it is necessary to address the problems of relation overlap and entity overlap that may occur within a single sentence. Current mainstream entity relation extraction strategies mostly focus on the influence of syntactic structure information, while ignoring the potential relationships between the syntactic structure of the text and overlapping entity relation triples. A model based on the Syntactic Dependency Tree and Multi-dimensional Corner Marking strategy (SDT-MCM) is proposed for overlapping entity relation extraction in this paper. Firstly, SDT-MCM constructs a character adjacency graph based on its syntactic dependency tree. And graph convolutional neural networks (GCNs) are utilized to extract syntactic features from the character adjacency graph. These are then combined with the semantic features extracted by BERT to form joint features. A novel multidimensional corner marking strategy is introduced to annotate overlap situations. Finally, entities and relations are jointly extracted using a score-based classifier. To validate the effectiveness of SDT-MCM, experiments on five public datasets including DuIE2.0, CMeIE, FinRE, SanWen, and CHIP2022 were conducted for entity relation extraction and overlap entity-relation comparison. SDT-MCM achieved F1 scores of 75.25%, 45.30%, and 36.42% on DuIE2.0, FinRE, and SanWen datasets respectively, outperforming all the compared models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Feng, J., Zhang, T., Hang, T.T.: Survey of overlapping entities and relations extraction. Comput. Eng. Appli. 58(1), 1–11 (2022)
Liu, X., Ning, B., Wang, M., et al.: Fine-grained sentiment triplet extraction method based on syntactic enhancement. J. Comput. Res. Develop. 60(7), 1649–1660 (2023)
Zhang, X., Guo, R., Huang, D.: Named entity recognition based on dependency. In: J. Chin. Inform. Process. 35(6) 11 (2021)
Feng, J., Zhang, T., Hang, T.T.: Survey of overlapping entities and relations extraction. Comput. Eng. Appli. 58(1), 1–11 (2022)
Wei, Z., Su, J., Wang, Y., et al.: A novel cascade binary tagging framework for relational triple extraction. arXiv preprint arXiv:1909.03227, (2019)
Wang, Y., Yu, B., Zhang, Y, et al.: TPLinker: single-stage joint extraction of entities and relations through token pair linking. arXiv preprint arXiv:2010.13415, (2020)
Shang, Y-M., Huang, H., Mao, X.: Onerel: joint entity and relation extraction with one module in one step. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 6(10), pp. 11285–11293 (2022)
Tian, Y., Chen, G., Song, Y., et al.: Dependency-driven relation extraction with attentive graph convolutional networks. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp: 4458–4471, (2021)
Hao, Z., Liu C., Qin, L.: Entity and relationship joint extraction model of food inspection announcement based on Chinese character adjacency graph. Trans. Chin. Soc. Agricult. Eng. (Trans. CSAE) 39(14), 283–292 (2023)
Cui, Y., Che, W., Liu, T., et al.: Pre-training with whole word masking for chinese bert. IEEE/ACM Trans. Audio, Speech Lang. Process. 29, 3504–3514 (2021)
Vasiliev, Y.: Natural language processing with Python and spaCy: A practical introduction (2020)
Li, S., He, W., Shi, Y., et al.: Duie: a large-scale chinese dataset for information extraction. In: natural Language Processing and Chinese Computing: 8th CCF International Conference, NLPCC 2019, Dunhuang, China, 9–14 October 2019, Proceedings, Part II 8, pp:791–800 (2019)
Guan, T., Zan, H., Zhou, X., et al.: CMeIE: construction and evaluation of Chinese medical information extraction dataset. In: Natural Language Processing and Chinese Computing: 9th CCF International Conference, NLPCC 2020, Zhengzhou, China, 14--18 October 2020, Proceedings, Part I 9, pp: 270--282 (2020)
Gan, Z., Zan, H., Guan, T., et al.: Overview of CHIP 2020 shared task 2: entity and relation extraction in Chinese medical text. J. Chin. Inform. Process. 36(6), 101–108 (2022)
Rajpoot, P.K., Parikh, A.: Gpt-finre: in-context learning for financial relation extraction using large language models. arXiv preprint arXiv:2306.17519, (2023)
Xu, J., Wen, J., Sun, X., et al.: A discourse-level named entity recognition and relation extraction dataset for chinese literature text. arXiv preprint arXiv:1711.07010, (2017)
Fu, T J., Li, Peng H., Ma, W. Y.: GraphRel: modeling Text as Relational Graphs for Joint Entity and Relation Extraction. Meeting of the Association for Computational Linguistics (2019)
Acknowledgement
This paper is the results of the research project funded by the China Fundamental Research Funds for the Central Universities (No:2662022XXYJ001, 2662022JC004, 2662023XXPY005). Partly supported by the open funds of the National Key Laboratory of Crop Genetic Improvement (ZK202205).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Liu, X. et al. (2024). Overlapping Entity Relation Extraction Based on Syntactic Dependency Tree and Multi-dimensional Corner Marking Strategy. In: Huang, DS., Si, Z., Zhang, Q. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science(), vol 14877. Springer, Singapore. https://doi.org/10.1007/978-981-97-5669-8_18
Download citation
DOI: https://doi.org/10.1007/978-981-97-5669-8_18
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5668-1
Online ISBN: 978-981-97-5669-8
eBook Packages: Computer ScienceComputer Science (R0)