[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3651671.3651725acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

Chinese Named Entity Recognition in the Ship News Field Based on Adversarial Transfer Learning

Published: 07 June 2024 Publication History

Abstract

In natural language processing (NLP), named entity recognition (NER) is a preliminary task that aims to recognize predefined types of named entities from a given text sequence. The accuracy of NER has been significantly increased by deep learning. However, deep learning models rely heavily on annotated data while many specific fields, e.g., the ship news field, often lack of extensive annotated data, which makes the NER task much more challenging. In this paper, we propose a new NER model based on adversarial learning, which is composed of BERT, convolutional neural networks (CNN) and conditional random fields (CRF). In addition, the Chinese word segmentation (CWS) task or the NER task from different domains is introduced to increase the accuracy of NER in the domain through the transfer of task-shared information. Adversarial learning can fully exploit task-shared information and filter out noise. Experiments are conducted on three different public datasets, as well as on our own ship news dataset. According to the results, our model outperforms the state-of-the-art baselines and achieves good performance in the ship news field.

References

[1]
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, and Shengping Liu. 2018. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In Proceedings of the 2018 conference on empirical methods in natural language processing. 182–192.
[2]
Huanlei Chen, Weiwen Zhang, Lianglun Cheng, and Haiming Ye. 2022. Diverse and High-Quality Data Augmentation Using GPT for Named Entity Recognition. In International Conference on Neural Information Processing. Springer, 272–283.
[3]
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. Journal of machine learning research 12, ARTICLE (2011), 2493–2537.
[4]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
[5]
Yaroslav Ganin and Victor Lempitsky. 2015. Unsupervised domain adaptation by backpropagation. In International conference on machine learning. PMLR, 1180–1189.
[6]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. Advances in neural information processing systems 27 (2014).
[7]
Hangfeng He and Xu Sun. 2016. F-score driven max margin neural network for named entity recognition in Chinese social media. arXiv preprint arXiv:1611.04234 (2016).
[8]
Shulin Hu, Huajun Zhang, Xuesong Hu, and Jinfu Du. 2022. Chinese Named Entity Recognition based on BERT-CRF Model. In 2022 IEEE/ACIS 22nd International Conference on Computer and Information Science (ICIS). IEEE, 105–108.
[9]
John Lafferty, Andrew McCallum, and Fernando CN Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. (2001).
[10]
Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360 (2016).
[11]
Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, and Lawrence D Jackel. 1989. Backpropagation applied to handwritten zip code recognition. Neural computation 1, 4 (1989), 541–551.
[12]
Gina-Anne Levow. 2006. The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the Fifth SIGHAN workshop on Chinese language processing. 108–117.
[13]
Xiaoya Li, Jingrong Feng, Yuxian Meng, Qinghong Han, Fei Wu, and Jiwei Li. 2019. A unified MRC framework for named entity recognition. arXiv preprint arXiv:1910.11476 (2019).
[14]
Xiaonan Li, Hang Yan, Xipeng Qiu, and Xuan-Jing Huang. 2020. FLAT: Chinese NER Using Flat-Lattice Transformer. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 6836–6842.
[15]
Tianyu Liu, Jin-Ge Yao, and Chin-Yew Lin. 2019. Towards improving neural named entity recognition with gazetteers. In Proceedings of the 57th annual meeting of the association for computational linguistics. 5301–5307.
[16]
Wei Liu, Tongge Xu, Qinghua Xu, Jiayu Song, and Yueran Zu. 2019. An encoding strategy based word-character LSTM for Chinese NER. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2379–2389.
[17]
Ruotian Ma, Minlong Peng, Qi Zhang, Zhongyu Wei, and Xuan-Jing Huang. 2020. Simplify the Usage of Lexicon in Chinese NER. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 5951–5960.
[18]
Sara Meftah, Nasredine Semmar, Mohamed-Ayoub Tahiri, Youssef Tamaazousti, Hassane Essafi, and Fatiha Sadat. 2020. Multi-task supervised pretraining for neural domain adaptation. In Proceedings of the Eighth International Workshop on Natural Language Processing for Social Media. 61–71.
[19]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[20]
Nanyun Peng and Mark Dredze. 2016. Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 149–155.
[21]
Sebastian Ruder and Barbara Plank. 2018. Strong Baselines for Neural Semi-Supervised Learning under Domain Shift. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1044–1054.
[22]
Maobin Weng and Weiwen Zhang. 2023. Named Entity Recognition Based on BERT-BiLSTM-SPAN in Low Resource Scenarios. In 2023 15th International Conference on Computer Research and Development (ICCRD). IEEE, 32–37.
[23]
Yao Xiao, Jingbo Peng, Luoyi Fu, and Haisong Zhang. 2021. Bag of Tricks for Chinese Named Entity Recognition. In 2021 International Joint Conference on Neural Networks (IJCNN). IEEE, 1–8.
[24]
Yue Zhang and Jie Yang. 2018. Chinese NER Using Lattice LSTM. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1554–1564.
[25]
Joey Tianyi Zhou, Hao Zhang, Di Jin, and Xi Peng. 2019. Dual adversarial transfer for sequence labeling. IEEE transactions on pattern analysis and machine intelligence 43, 2 (2019), 434–446.

Index Terms

  1. Chinese Named Entity Recognition in the Ship News Field Based on Adversarial Transfer Learning

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMLC '24: Proceedings of the 2024 16th International Conference on Machine Learning and Computing
    February 2024
    757 pages
    ISBN:9798400709234
    DOI:10.1145/3651671
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 June 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Adversarial learning
    2. BERT
    3. Chinese NER
    4. Ship news

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICMLC 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 17
      Total Downloads
    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 11 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media