[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3639233.3639349acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnlpirConference Proceedingsconference-collections
research-article

Multi-perspective Enhancement Of Text Semantic Matching

Published: 05 March 2024 Publication History

Abstract

With the widespread adoption of social media, text matching tasks have gained a prominent role in the field of natural language processing. However, traditional text matching methods often overly rely on shallow features and do not adequately consider the contextual semantic information within the text. With the rise of deep neural networks, deep learning methods have ushered in new directions in the field of text matching, enabling us to better understand and process semantic information within text. We introduce a framework that combines various models and techniques to more accurately assess semantic similarity between Chinese texts. Within this framework, we integrate the contextual understanding capability of BERT with neural network architectures such as Bidirectional Long Short-Term Memory (BiLSTM), Convolutional Neural Networks (CNN), and self-attention mechanisms. Through the fusion of multi-perspective information, we are better equipped to handle semantic ambiguities and polysemy in text matching tasks, ultimately improving matching performance. Extensive experimental evaluations were conducted on publicly available Chinese datasets, BQ and LCQMC. The results demonstrate that our multi-perspective textual semantic matching approach has significantly outperformed existing methods in terms of performance.

References

[1]
Kim Y. Convolutional Neural Networks for Sentence Classification[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014: 1746-1751.
[2]
He T, Huang W, Qiao Y, Text-attentional convolutional neural network for scene text detection[J]. IEEE transactions on image processing, 2016, 25(6): 2529-2541.
[3]
Lai Y, Feng Y, Yu X, Lattice cnns for matching based chinese question answering[C]//Proceedings of the AAAI conference on artificial intelligence. 2019, 33(01): 6634-6641.
[4]
Elman J L. Finding structure in time[J]. Cognitive science, 1990, 14(2): 179-211.
[5]
Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural computation, 1997, 9(8): 1735-1780.
[6]
Cho K, van Merrienboer B, Gulcehre C, Learning phrase representations using RNN encoder-decoder for statistical machine translation[C]//Conference on Empirical Methods in Natural Language Processing (EMNLP 2014). 2014.
[7]
Vaswani A, Shazeer N, Parmar N, Attention is all you need[J]. Advances in neural information processing systems, 2017, 30.
[8]
Kenton J D M W C, Toutanova L K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[C]//Proceedings of NAACL-HLT. 2019: 4171-4186.
[9]
Hu B, Lu Z, Li H, Convolutional neural network architectures for matching natural language sentences[J]. Advances in neural information processing systems, 2014, 27.
[10]
Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences[C]//52nd Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 2014.
[11]
Mueller J, Thyagarajan A. Siamese recurrent architectures for learning sentence similarity[C]//Proceedings of the AAAI conference on artificial intelligence. 2016, 30(1).
[12]
Wu Y, Wu W, Li Z, Learning Matching Models with Weak Supervision for Response Selection in Retrieval-based Chatbots[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2018: 420-425.
[13]
Stickland A C, Murray I. Bert and pals: Projected attention layers for efficient adaptation in multi-task learning[C]//International Conference on Machine Learning. PMLR, 2019: 5986-5995.
[14]
Reimers N, Gurevych I. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 2019.
[15]
Nair V, Hinton G E. Rectified linear units improve restricted boltzmann machines[C]//Proceedings of the 27th international conference on machine learning (ICML-10). 2010: 807-814.
[16]
Su, J.: Text emotion classification (iv): Better loss function (2017). https:// spaces. ac. cn/ archi ves/4293. Accessed 30 March 2017.
[17]
Chen J, Chen Q, Liu X, The bq corpus: A large-scale domain-specific chinese corpus for sentence semantic equivalence identification[C]//Proceedings of the 2018 conference on empirical methods in natural language processing. 2018: 4946-4951.
[18]
Liu X, Chen Q, Deng C, Lcqmc: A large-scale chinese question matching corpus[C]//Proceedings of the 27th international conference on computational linguistics. 2018: 1952-1962.
[19]
Zhang X, Lu W, Li F, Deep feature fusion model for sentence semantic matching[J]. Computers, Materials and Continua, 2019.
[20]
Chen Q, Zhu X, Ling Z H, Enhanced LSTM for Natural Language Inference[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2017: 1657-1668.
[21]
Wang Z, Hamza W, Florian R. Bilateral multi-perspective matching for natural language sentences[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017: 4144-4150.
[22]
Cui Y, Che W, Liu T, Pre-training with whole word masking for chinese bert[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 29: 3504-3514.
[23]
Sun Y, Wang S, Li Y, Ernie: Enhanced representation through knowledge integration[J]. arXiv preprint arXiv:1904.09223, 2019.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
NLPIR '23: Proceedings of the 2023 7th International Conference on Natural Language Processing and Information Retrieval
December 2023
336 pages
ISBN:9798400709227
DOI:10.1145/3639233
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 March 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. BERT
  2. multi-perspective
  3. semantic matching
  4. text analysis

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

NLPIR 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 21
    Total Downloads
  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)5
Reflects downloads up to 07 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media