[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3421515.3421534acmotherconferencesArticle/Chapter ViewAbstractPublication PagessspsConference Proceedingsconference-collections
research-article

An Improved Chinese Named Entity Recognition Method with TB-LSTM-CRF

Published: 17 December 2020 Publication History

Abstract

Owing to the lack of natural delimiters, Chinese named entity recognition (NER) is more challenging than it in English. While Chinese word segmentation (CWS) is generally regarded as key and open problem for Chinese NER, its accuracy is critical for the downstream models trainings and it also often suffers from out-of-vocabulary (OOV). In this paper, we propose an improved Chinese NER model called TB-LSTM-CRF, which introduces a Transformer Block on top of LSTM-CRF. The proposed model with Transformer Block exploits the self-attention mechanism to capture the information from adjacent characters and sentence contexts. It is more practical with using small-size character embeddings. Compared with the baseline using LSTM-CRF, experiment results show our method TB-LSTM-CRF is competitive without the need of any external resources, for instance other dictionaries.

References

[1]
Chiu, J. P., & Nichols, E. (2016). Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics, 4, 357-370.
[2]
Liu, L., Shang, J., Ren, X., Xu, F. F., Gui, H., Peng, J., & Han, J. (2018, April). Empower sequence labeling with task-aware neural language model. In Thirty-Second AAAI Conference on Artificial Intelligence.
[3]
Huang, S., Sun, X., & Wang, H. (2017, November). Addressing domain adaptation for chinese word segmentation with global recurrent structure. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (pp. 184-193).
[4]
Chen, X., Shi, Z., Qiu, X., & Huang, X. (2017). Adversarial multi-criteria learning for chinese word segmentation. arXiv preprint arXiv:1704.07556.
[5]
Liu, Z., Zhu, C., & Zhao, T. (2010, August). Chinese named entity recognition with a sequence labeling approach: based on characters, or based on words?. In International Conference on Intelligent Computing (pp. 634-640). Springer, Berlin, Heidelberg.
[6]
Li, H., Hagiwara, M., Li, Q., & Ji, H. (2014, May). Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese. In LREC (pp. 2532-2536).
[7]
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. In Advances in neural information processing systems (pp. 5998-6008).
[8]
Ratinov, L., & Roth, D. (2009, June). Design challenges and misconceptions in named entity recognition. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009) (pp. 147-155).
[9]
Weischedel, R., Pradhan, S., Ramshaw, L., Palmer, M., Xue, N., Marcus, M., ... & Houston, A. (2011). OntoNotes Release 4.0. LDC2011T03, Philadelphia, Penn.: Linguistic Data Consortium.
[10]
Peng, N., & Dredze, M. (2015, September). Named entity recognition for chinese social media with jointly trained embeddings. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (pp. 548-554).
[11]
Zhang, Y., & Yang, J. (2018). Chinese ner using lattice lstm. arXiv preprint arXiv:1805.02023.
[12]
Yang, J., Teng, Z., Zhang, M., & Zhang, Y. (2016, April). Combining discrete and neural features for sequence labeling. In International Conference on Intelligent Text Processing and Computational Linguistics (pp. 140-154). Springer, Cham.
[13]
Zhu, Y., & Wang, G. (2019, June). CAN-NER: Convolutional attention network for Chinese named entity recognition. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (pp. 3384-3393).
[14]
Peng, N., & Dredze, M. (2016). Improving named entity recognition for chinese social media with word segmentation representation learning. arXiv preprint arXiv:1603.00786.

Cited By

View all
  • (2023)Intelligent Tool-Wear Prediction Based on Informer Encoder and Bi-Directional Long Short-Term MemoryMachines10.3390/machines1101009411:1(94)Online publication date: 11-Jan-2023
  • (2022)Research on Knowledge Graph Construction of Early Stage Auxiliary Decision-making System for Power Grid Planning Projects2022 IEEE 2nd International Conference on Electronic Technology, Communication and Information (ICETCI)10.1109/ICETCI55101.2022.9832288(412-416)Online publication date: 27-May-2022

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSPS '20: Proceedings of the 2020 2nd Symposium on Signal Processing Systems
July 2020
125 pages
ISBN:9781450388627
DOI:10.1145/3421515
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Chinese Named Entity Recognition
  2. Chinese Natural Language Processing
  3. Knowledge Graph
  4. LSTM-CRF
  5. Transformer Block

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SSPS 2020

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)1
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Intelligent Tool-Wear Prediction Based on Informer Encoder and Bi-Directional Long Short-Term MemoryMachines10.3390/machines1101009411:1(94)Online publication date: 11-Jan-2023
  • (2022)Research on Knowledge Graph Construction of Early Stage Auxiliary Decision-making System for Power Grid Planning Projects2022 IEEE 2nd International Conference on Electronic Technology, Communication and Information (ICETCI)10.1109/ICETCI55101.2022.9832288(412-416)Online publication date: 27-May-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media