[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3548608.3559300acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccirConference Proceedingsconference-collections
research-article

Chinese Named Entity Recognition Method Based On Multi-head Attention Enhancing Word Information

Published: 14 October 2022 Publication History

Abstract

Chinese named entity recognition (CNER) is one of the important tasks in natural language processing. Unlike the English, Chinese lacks explicit word boundaries. Therefore, many models were designed to address this issue by incorporating word lexicon information into the CNER. However, lots of irrelevant information may be included when matching the entire lexicon for each character. Inspired by the SoftLexicon method, we propose a multi-head attention based model to simplify the introduced lexicon information to generate word-level attention vector. In this method, a word vector matched for each character is first obtained and further weighted by the relevance with the character-level vector to calculate the word-level attention vector. In this way, only the words existing in the sentence are matched, which reduces the scope of word matching. The effectiveness of this method is verified on multiple Chinese datasets.

References

[1]
Z. H. Huang, W. Xu, K. Yu, (2015) Bidirectional LSTM-CRF models for sequence tagging. J. arXiv preprint arXiv:1508.01991.
[2]
Q. Cai. (2019) Research on Chinese Naming Recognition Model Based on BERT Embedding . In: IEEE 10th International Conference on Software Engineering and Service Science (ICSESS), 2019, pp. 1-4.
[3]
N. Li, H. M. Guan, P Yang, W. Y. Dong. (2020) BERT-IDCNN-CRF for named entity recognition in Chinese. J. Journal of Shandong University(Natural Science), v.55(1): 106–113.(in chinese)
[4]
Y. Zhang, J Yang. (2018) Chinese ner using lattice lstm. J. arXivpreprint arXiv:1805.02023.
[5]
W. Liu, T. Xu, Q Xu, J. Y. song, Y. R. Zu. (2019) An encoding strategy based word-character LSTM for Chinese NER. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Minneapolis, Minnesota pp. 2379-2389.
[6]
T. Gui, R. Ma, Q. Zhang, L. Zhao, Y.G. Jiang, X Huang. (2019) CNN-Based Chinese NER with Lexicon Rethinking. In: IJCAI. Macao pp.4982-4988.
[7]
R. X. Ding, P. J. Xie, X. Y. Zhang, W. Lu, L. Li, L. Si. (2019) A neural multi-digraph model for Chinese NER with gazetteers. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy pp: 1462-1467.
[8]
X. Li, H. Yan, X. Qiu, X. Huang. (2020) FLAT: Chinese NER using flat-lattice transformer. J. arXiv preprint arXiv:2004.11795.
[9]
D. D. Zhao, D. G. Huang, J. N. Meng, F. Gu, P. Zhang. (2021) Chinese Named Entity Recognition Based on multi head attention and word fusion. J. Computer Engineering and Applications: 1-9. (in chinese)
[10]
R. Ma, M. Peng, Q. Zhang, X. Huang. (2019) Simplify the usage of lexicon in Chinese NER. J. arXiv preprint arXiv:1908.05969.
[11]
S. S. Zhong, X. Cheng, M. H. Zhao, Y. K. Zhang. Chinese named entity recognition method for introducing word set-level attention mechanism. J. Journal of Jilin University, (Engineering and Technology Edition) :1-7. (in chinese)
[12]
J. Devlin, M. W. Chang, K. Lee, K. Toutanova. (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. J. arXiv preprint arXiv:1810.04805

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCIR '22: Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics
June 2022
905 pages
ISBN:9781450397179
DOI:10.1145/3548608
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 October 2022

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCIR 2022

Acceptance Rates

Overall Acceptance Rate 131 of 239 submissions, 55%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 30
    Total Downloads
  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media