[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3357254.3357283acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

Chinese named entity recognition in power domain based on Bi-LSTM-CRF

Published: 16 August 2019 Publication History

Abstract

Efficient recognition of proprietary entities is an important basic work for text data mining and intelligent application in power domain. Traditional power domain Named Entity Recognition (NER) methods rely on feature engineering seriously, which unable to learn power entity features automatically. In order to learn entity features automatically and extract power domain named entities efficiently, a model based on Bidirectional Long Short-Term Memory Neural Networks (Bi-LSTM) and Conditional Random Field (CRF) was proposed in this paper. Word representations were fed into the neural networks as an additional feature and Skip-gram embeddings were trained on power domain corpus. Experimental results showed the precision rate reaches higher than 88.25% and the recalling rate reaches higher than 88.04%, which confirm the method based on Bi-LSTM and CRF is effective for named entity recognition in the power domain.

References

[1]
Qiu J. Research on Chinese textual data mining techniques and reliability applications in power system[D]. Hangzhou, China: Zhejiang University, 2016.
[2]
Li D.P, Chen Z.Y, et al. A wide area service oriented architecture design for plug and play of power grid equipment[J]. Procedia Computer Science, 2018, 129:353--357.
[3]
Chen Z.Y, Li D.P, et al. The application of power grid equipment plug and play based on wide area SOA[C]// IEEE International Conference on Energy Internet. 2018:19--23.
[4]
Wang M, Chang N.C, Wang L, Chen Z.Y, et al. Architecture of Network Analysis Service Based on Dispatching and Control Cloud[J]. Power System Technology, 2018, 42(8):2659--2665.
[5]
Gao X.Y, Chen Z.Y, Tang S, et al. Adaptive weighted imbalance learning with application to abnormal activity recognition[J]. Neurocomputing, 2016, 173(3):1927--1935.
[6]
Zhao T, Zhang Y, Zhang D. Application technology of big data in smart distribution grid and its prospect analysis[J]. Power System Technology, 2014, 38(12):3305--3312.
[7]
Cao J, Chen L.S, Qiu J, et al. Semantic Framework-Based Defect Text Mining Technique and Application in Power Grid[J]. Power System Technology, 2017, 41(2):637--643.
[8]
Klinger R, Kolarik C, Fluck J, et al. Detection of IUPAC and IUPAC-like chemical names[J]. Bioinformatics, 2008, 24(13):i268--i276.
[9]
Zhu F, Shen B.R, Bourdon J. Combined SVM-CRFs for Biological Named Entity Recognition with Maximal Bidirectional Squeezing[J]. PLoS ONE, 2012, 7(6):e39230-.
[10]
Qin Y, Zeng Y.F. Research of Clinical Named Entity Recognition Based on Bi-LSTM-CRF[J]. Journal of Shanghai Jiaotong University(Science), 2018, v.23(03):58--63.
[11]
Xu Y, Wang Y, Liu T, et al. Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries[J]. Journal of the American Medical Informatics Association, 2014, 21(e1):e84--e92.
[12]
Collobert R, Weston J, Bottou L, et al. Natural language processing (almost) from scratch.[J]. Journal of Machine Learning Research, 2011, 12(1):2493--2537.
[13]
Peng N, Dredze M. Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning[C]// Meeting of the Association for Computational Linguistics. 2016.
[14]
Du X.M, Qin J.F, Guo S.Y, et al. Text Mining of Typical Defects in Power Equipment[J]. High Voltage Engineering, 2018, 44(4): 1078--1084.
[15]
Xu J.J, Sun X, He H.F, et al. Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media: A Unified Model[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018:1--1.
[16]
Ma X.Z, Hovy E. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF[J]. Association for Computational Linguistics (ACL), 2016, p 1064--1074.
[17]
Zhao H, Wang F. A Deep Learning Model and Self-Training Algorithm for Theoretical Terms Extraction[J]. Journal of the China Society for Scientific and Technical Information, 2018, 37(09):67--82.
[18]
Sundermeyer M, Ney H, Schluter R. From Feedforward to Recurrent LSTM Neural Networks for Language Modeling[J]. IEEE/ACM Transactions on Audio Speech & Language Processing, 2015, 23(3):517--529.
[19]
Graves A, Jürgen S. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks, 2005, 18(5--6):602--610.
[20]
Lample G, Ballesteros M, Subramanian S, et al. Neural architectures for named entity recognition [C]//Proceedings of NAACL-2016, San Diego, US: ACL, 2016: 260--270.
[21]
Sun X, Ren F, Huang D. Dual-chain Unequal-state CRF for Chinese New Word Detection and POS Tagging[C]// International Conference on Natural Language Processing & Knowledge Engineering. 2009.
[22]
Li S.L, Guo Y.K. Biomedical Named Entity Recognition with CNN-BLSTM-CRF[J]. Journal of Chinese Information Processing, 2018, 32(1):116--122.
[23]
Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[J]. Computer Science, 2013.
[24]
Zhan J, Qu D, Li Z. Recurrent neural network language model based on word vector features[J]. Pattern Recognition and Artificial Intelligence, 2015, (4): 299--305.
[25]
Kingma D.P, Ba J. Adam: A Method for Stochastic Optimization[J]. Computer Science, 2014.
[26]
Wong, Tzu T. Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation[J]. Pattern Recognition, 2015, 48(9):2839--2846.

Cited By

View all
  • (2022)A Chinese word segmentation method based on dictionary and HMMProceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering10.1145/3573428.3573542(644-649)Online publication date: 21-Oct-2022
  • (2022)Radial Basis Function Attention for Named Entity RecognitionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/353901422:1(1-18)Online publication date: 30-Nov-2022
  • (2022)LBPSC: A Hybrid Prediction Model for Chinese Named Entity Recognition in Water Environment2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC53654.2022.9945417(223-228)Online publication date: 9-Oct-2022
  • Show More Cited By

Index Terms

  1. Chinese named entity recognition in power domain based on Bi-LSTM-CRF

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIPR '19: Proceedings of the 2nd International Conference on Artificial Intelligence and Pattern Recognition
    August 2019
    198 pages
    ISBN:9781450372299
    DOI:10.1145/3357254
    • Conference Chairs:
    • Li Ma,
    • Xu Huang
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 August 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Bi-LSTM
    2. CRF
    3. named entity recognition
    4. power domain

    Qualifiers

    • Research-article

    Funding Sources

    • Beijing Key Laboratory of Research and System Evaluation of Power Dispatching Automation Technology
    • Sichuan Science and Technology Program
    • Science and Technology Program of State Grid Corporation of China
    • Science and Technology Innovation Program of China Electric Power Research Institute

    Conference

    AIPR 2019

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)11
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)A Chinese word segmentation method based on dictionary and HMMProceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering10.1145/3573428.3573542(644-649)Online publication date: 21-Oct-2022
    • (2022)Radial Basis Function Attention for Named Entity RecognitionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/353901422:1(1-18)Online publication date: 30-Nov-2022
    • (2022)LBPSC: A Hybrid Prediction Model for Chinese Named Entity Recognition in Water Environment2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC53654.2022.9945417(223-228)Online publication date: 9-Oct-2022
    • (2021)Named Entity Recognition in Electric Power Metering Domain Based on Attention MechanismIEEE Access10.1109/ACCESS.2021.31231549(152564-152573)Online publication date: 2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media