Bi-matching Mechanism to Combat Long-tail Senses of Word Sense Disambiguation

Junwei Zhang^13,14,
Ruifang He^13,14 &
Fengyu Guo¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13714))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

Abstract

The long-tail phenomenon of word sense distribution in linguistics causes Word Sense Disambiguation (WSD) to face both head senses with a large number of samples and tail senses with only a few samples. Traditional recognition methods are suitable for head senses with sufficient training samples, but they cannot effectively deal with tail senses. Inspired by the diverse memory and recognition abilities of children’s linguistic behavior, we propose a bi-matching mechanism approach for WSD. Considering that tail senses are often presented in the form of fixed collocations, a collocation feature matching method suitable for tail senses is designed; the traditional definition matching method is used for head senses; finally, the two matching methods are combined to construct a WSD model with the bi-matching mechanism (called Bi-MWSD). Bi-MWSD can effectively combat the difficulty of identifying the tail senses due to insufficient training samples. The experiments are implemented in the standard English all-words WSD evaluation framework and the training data augmented evaluation framework. The experimental results outperform the baseline models and achieve state-of-the-art performance under the data augmentation evaluation framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 63.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 79.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

Integrating Structural Context with Local Context for Disambiguating Word Senses

WordNet and Wiktionary-Based Approach for Word Sense Disambiguation

Notes

References

Navigli, R., Camacho-Collados, J., Raganato, A.: Word sense disambiguation: a unified evaluation framework and empirical comparison. In: EACL (2017)
Google Scholar
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41, 1–69 (2009)
Article Google Scholar
Bevilacqua, M., Pasini, T., Raganato, A., Navigli, R.: Recent trends in word sense disambiguation: a survey. In: IJCAI (2021)
Google Scholar
Neale, S., Gomes, L.-M., Agirre, E., Lacalle, O.-L., Branco, A.-H.: Word sense-aware machine translation: including senses as contextual features for improved translation models. In: LREC (2016)
Google Scholar
Rios Gonzales, A., Mascarell, L., Sennrich, R.: Improving word sense disambiguation in neural machine translation with sense embeddings. In: WMT (2017)
Google Scholar
Dewadkar, D.-A., Haribhakta, Y.-V., Kulkarni, P.-A., Balvir, P.-D.: Unsupervised word sense disambiguation in natural language understanding. In: ICAI (2010)
Google Scholar
Mills, M.-T., Bourbakis, N.-G.: Graph-based methods for natural language processing and understanding-a survey and analysis. IEEE Trans. Syst. Man Cybern. Syst. 44, 59–71 (2014)
Article Google Scholar
Li, W., Madabushi, H.-T., Lee, M.-G.: UoB_UK at SemEval 2021 Task 2: Zero-shot and few-shot learning for multi-lingual and cross-lingual word sense disambiguation. In: SEMEVAL (2021)
Google Scholar
Kumar, S., Jat, S., Saxena, K., Talukdar, P.-P.: Zero-shot word sense disambiguation using sense definition embeddings. In: ACL (2019)
Google Scholar
Blevins, T., Zettlemoyer, L.: Moving down the long tail of word sense disambiguation with gloss informed bi-encoders. In: ACL (2020)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL (2019)
Google Scholar
Huang, L., Sun, C., Qiu, X., Huang, X.: GlossBERT: BERT for word sense disambiguation with gloss knowledge. In: EMNLP (2019)
Google Scholar
Holla, N., Mishra, P., Yannakoudakis, H., Shutova, E.: Learning to learn to disambiguate: meta-learning for few-shot word sense disambiguation. In: EMNLP (2020)
Google Scholar
Du, Y., Holla, N., Zhen, X., Snoek, C.-G., Shutova, E.: Meta-learning with variational semantic memory for word sense disambiguation. In: ACL (2021)
Google Scholar
Ibuka, M.: Kindergarten is Too Late!. Souvenir Press, London (1977)
Google Scholar
Chen, H., Xia, M., Chen, D.: Non-parametric few-shot learning for word sense disambiguation. In: NAACL (2021)
Google Scholar
Yuan, D., Richardson, J., Doherty, R., Evans, C., Altendorf, E.: Semi-supervised word sense disambiguation with neural models. In: COLING (2016)
Google Scholar
Raganato, A., Bovi, C.-D., Navigli, R.: Neural sequence learning models for word sense disambiguation. In: EMNLP (2017)
Google Scholar
Le, M.-N., Postma, M., Urbani, J., Vossen, P.: A Deep dive into word sense disambiguation with LSTM. In: COLING (2018)
Google Scholar
Kågebäck, M., Salomonsson, H.: Word sense disambiguation using a bidirectional LSTM. In: COLING (2016)
Google Scholar
Scarlini, B., Pasini, T., Navigli, R.: SensEmBERT: context-enhanced sense embeddings for multilingual word sense disambiguation. In: AAAI (2020)
Google Scholar
Hadiwinoto, C., Ng, H.-T., Gan, W.-C.: Improved word sense disambiguation using pre-trained contextualized word representations. In: EMNLP (2019)
Google Scholar
Du, J., Qi, F., Sun, M.: Using BERT for word sense disambiguation. arXiv:1909.08358 (2019)
Luo, F., Liu, T., Xia, Q., Chang, B., Sui, Z.: Incorporating glosses into neural word sense disambiguation. In: ACL (2018)
Google Scholar
Fernandez, A.-D., Stevenson, M., Martínez-Romo, J., Araujo, L.: Co-occurrence graphs for word sense disambiguation in the biomedical domain. Artif. Intell. Med. 87, 9–19 (2018)
Article Google Scholar
Scarlini, B., Pasini, T., Navigli, R.: With more contexts comes better performance: contextualized sense embeddings for all-round word sense disambiguation. In: EMNLP (2020)
Google Scholar
Dongsuk, O., Kwon, S., Kim, K., Ko, Y.: Word sense disambiguation based on word similarity calculation using word vector representation from a knowledge-based graph. In: COLING (2018)
Google Scholar
Pasini, T.: The knowledge acquisition bottleneck problem in multilingual word sense disambiguation. In: IJCAI (2020)
Google Scholar
Kingma, D.-P., Ba, J.: Adam: A Method for Stochastic Optimization. CoRR, abs/1412.6980 (2015)
Google Scholar
Pradhan, S., Loper, E., Dligach, D., Palmer, M.: SemEval-2007 Task 2017: English lexical sample. In: SRL and All Words, Fourth International Workshop on Semantic Evaluations (2007)
Google Scholar
Edmonds, P., Cotton, S.: SENSEVAL-2: Overview. *SEMEVAL (2001)
Google Scholar
Snyder, B., Palmer, M.: The English all-words task. In: ACL (2004)
Google Scholar
Navigli, R., Jurgens, D., Vannella, D.: SemEval-2013 task 12: multilingual word sense disambiguation. In: *SEMEVAL (2013)
Google Scholar
Moro, A., Navigli, R.: SemEval-2015 Task 13: multilingual all-words sense disambiguation and entity linking. In: *SEMEVAL (2015)
Google Scholar
Fellbaum, C.-D.: WordNet: An Electronic Lexical Database. Language. MIT Press, Cambridge (2000)
Google Scholar
Loureiro, D., Jorge, A.-M.: Language modelling makes sense: propagating representations through wordnet for full-coverage word sense disambiguation. In: ACL (2019)
Google Scholar
Wang, M., Wang, Y.: A synset relation-enhanced framework with a try-again mechanism for word sense disambiguation. In: EMNLP (2020)
Google Scholar
Bevilacqua, M., Navigli, R.: Breaking Through the 80% Glass Ceiling: Raising the State of the Art in Word Sense Disambiguation by Incorporating Knowledge Graph Information. ACL (2020)
Google Scholar
Berend, G.: Sparsity Makes Sense: Word Sense Disambiguation Using Sparse Contextualized Word Representations. EMNLP (2020)
Google Scholar
Wang, M., Zhang, J., Wang, Y.: Enhancing the Context Representation in Similarity-based Word Sense Disambiguation. EMNLP (2021)
Google Scholar
Song, Y., Ong, X.C., Ng, H.T., Lin, Q.: Improved Word Sense Disambiguation with Enhanced Sense Representations. EMNLP (2021)
Google Scholar
Conia, S., Navigli, R.: Framing Word Sense Disambiguation as a Multi-Label Problem for Model-Agnostic Knowledge Integration. EACL (2021)
Google Scholar
Wang, M., Wang, Y.: Word Sense Disambiguation: Towards Interactive Context Exploitation from Both Word and Sense Perspectives. ACL (2021)
Google Scholar

Download references

Acknowledgements

Our work is supported by the National Natural Science Foundation of China (61976154), the National Key R &D Program of China (2019YFC1521200), the State Key Laboratory of Communication Content Cognition, People’s Daily Online (No. A32003), and the National Natural Science Foundation of China (No. 62106176).

Author information

Authors and Affiliations

Tianjin Key Laboratory of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
Junwei Zhang & Ruifang He
State Key Laboratory of Communication Content Cognition, People’s Daily Online, Beijing, China
Junwei Zhang & Ruifang He
College of Computer and Information Engineering, Tianjin Normal University, Tianjin, China
Fengyu Guo

Authors

Junwei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ruifang He
View author publications
You can also search for this author in PubMed Google Scholar
Fengyu Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ruifang He or Fengyu Guo .

Editor information

Editors and Affiliations

Grenoble Alpes University, Saint Martin d'Hères, France
Massih-Reza Amini
INSA Rouen Normandy, Saint Etienne du Rouvray, France
Stéphane Canu
Ruhr-Universität Bochum, Bochum, Germany
Asja Fischer
KU Leuven, Leuven, Belgium
Tias Guns
Central European University, Vienna, Austria
Petra Kralj Novak
Aristotle University of Thessaloniki, Thessaloniki, Greece
Grigorios Tsoumakas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., He, R., Guo, F. (2023). Bi-matching Mechanism to Combat Long-tail Senses of Word Sense Disambiguation. In: Amini, MR., Canu, S., Fischer, A., Guns, T., Kralj Novak, P., Tsoumakas, G. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13714. Springer, Cham. https://doi.org/10.1007/978-3-031-26390-3_36

Download citation

DOI: https://doi.org/10.1007/978-3-031-26390-3_36
Published: 17 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26389-7
Online ISBN: 978-3-031-26390-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Bi-matching Mechanism to Combat Long-tail Senses of Word Sense Disambiguation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

Integrating Structural Context with Local Context for Disambiguating Word Senses

WordNet and Wiktionary-Based Approach for Word Sense Disambiguation

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Bi-matching Mechanism to Combat Long-tail Senses of Word Sense Disambiguation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

Integrating Structural Context with Local Context for Disambiguating Word Senses

WordNet and Wiktionary-Based Approach for Word Sense Disambiguation

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation