[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2187836.2187898acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

LINDEN: linking named entities with knowledge base via semantic knowledge

Published: 16 April 2012 Publication History

Abstract

Integrating the extracted facts with an existing knowledge base has raised an urgent need to address the problem of entity linking. Specifically, entity linking is the task to link the entity mention in text with the corresponding real world entity in the existing knowledge base. However, this task is challenging due to name ambiguity, textual inconsistency, and lack of world knowledge in the knowledge base. Several methods have been proposed to tackle this problem, but they are largely based on the co-occurrence statistics of terms between the text around the entity mention and the document associated with the entity. In this paper, we propose LINDEN, a novel framework to link named entities in text with a knowledge base unifying Wikipedia and WordNet, by leveraging the rich semantic knowledge embedded in the Wikipedia and the taxonomy of the knowledge base. We extensively evaluate the performance of our proposed LINDEN over two public data sets and empirical results show that LINDEN significantly outperforms the state-of-the-art methods in terms of accuracy.

References

[1]
S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, and Z. Ives. Dbpedia: A nucleus for a web of open data. In Proceedings of ISWC, pages 11--15, 2007.
[2]
A. Bagga and B. Baldwin. Entity-based cross-document coreferencing using the vector space model. In Proceedings of COLING, pages 79--85, 1998.
[3]
R. Bekkerman and A. McCallum. Disambiguating web appearances of people in a social network. In Proceedings of WWW, pages 463--470, 2005.
[4]
R. Bunescu and M. Pasca. Using Encyclopedic Knowledge for Named Entity Disambiguation. In Proceedings of EACL, pages 9--16, 2006.
[5]
R. L. Cilibrasi and P. M. B. Vitanyi. The google similarity distance. IEEE Trans. on Knowl. and Data Eng., 19:370--383, March 2007.
[6]
S. Cucerzan. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of EMNLP-CoNLL, pages 708--716, 2007.
[7]
M. Dredze, P. McNamee, D. Rao, A. Gerber, and T. Finin. Entity disambiguation for knowledge base population. In Proceedings of COLING, pages 277--285, 2010.
[8]
C. Fellbaum, editor. WordNet: An Electronic Lexical Database. The MIT Press, 1998.
[9]
T. Hasegawa, S. Sekine, and R. Grishman. Discovering relations among named entities from large corpora. In Proceedings of ACL, pages 415--422, 2004.
[10]
L. Jiang, J. Wang, N. An, S. Wang, J. Zhan, and L. Li. Grape: A graph-based framework for disambiguating people appearances in web search. In Proceedings of ICDM, pages 199--208, 2009.
[11]
N. Kambhatla. Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations. In Proceedings of ACL, 2004.
[12]
S. Kulkarni, A. Singh, G. Ramakrishnan, and S. Chakrabarti. Collective annotation of Wikipedia entities in web text. In Proceedings of SIGKDD, pages 457--466, 2009.
[13]
D. Lin. An information-theoretic definition of similarity. In Proceedings of ICML, pages 296--304, 1998.
[14]
B. Malin. Unsupervised name disambiguation via social network similarity. In Proceedings of Workshop on Link Analysis, Counterterrorism, and Security, 2005.
[15]
B. Malin, E. Airoldi, and K. M. Carley. A network analysis model for disambiguation of names in lists. Comput. Math. Organ. Theory, 11:119--139, July 2005.
[16]
G. S. Mann and D. Yarowsky. Unsupervised personal name disambiguation. In Proceedings of CONLL, pages 33--40, 2003.
[17]
P. McNamee, H. Simpson, and H. T. Dang. Overview of the tac 2009 knowledge base population track. In Text Analysis Conference (TAC), 2009.
[18]
R. Mihalcea. Unsupervised large-vocabulary word sense disambiguation with graph-based algorithms for sequence data labeling. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 411--418, 2005.
[19]
R. Mihalcea and A. Csomai. Wikify! linking documents to encyclopedic knowledge. In Proceedings of CIKM, pages 233--242, 2007.
[20]
D. Milne and I. H. Witten. An effective, low-cost measure of semantic relatedness obtained from Wikipedia links. In Proceedings of WIKIAI, 2008.
[21]
D. Milne and I. H. Witten. Learning to link with wikipedia. In Proceeding of CIKM, pages 509--518, 2008.
[22]
E. Minkov, W. W. Cohen, and A. Y. Ng. Contextual search and name disambiguation in email using graphs. In Proceedings of SIGIR, pages 27--34, 2006.
[23]
R. Navigli and M. Lapata. Graph connectivity measures for unsupervised word sense disambiguation. In Proceedings of the 20th international joint conference on Artificial intelligence, pages 1683--1688, 2007.
[24]
R. Navigli and P. Velardi. Structural semantic interconnections: a knowledge-based approach to word sense disambiguation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27:1075--1086, 2005.
[25]
T. Pedersen, A. Purandare, and A. Kulkarni. Name Discrimination by Clustering Similar Contexts. In Proceedings of CICLing, pages 226--237, 2005.
[26]
F. Suchanek, G. Kasneci, and G. Weikum. Yago: A Large Ontology from Wikipedia and WordNet. Journal of Web Semantics, 6(3):203--217, 2008.
[27]
F. M. Suchanek, G. Kasneci, and G. Weikum. Yago: A core of semantic knowledge unifying wordnet and wikipedia. In Proceedings of WWW, pages 697--706, 2007.
[28]
F. Wu and D. S. Weld. Autonomously semantifying wikipedia. In Proceedings of CIKM, pages 41--50, 2007.
[29]
F. Wu and D. S. Weld. Automatically refining the wikipedia infobox ontology. In Proceedings of WWW, pages 635--644, 2008.
[30]
D. Zelenko, C. Aone, and A. Richardella. Kernel methods for relation extraction. J. Mach. Learn. Res., 3:1083--1106, 2003.
[31]
T. Zesch, I. Gurevych, and M. Mühlhauser. Analyzing and Accessing Wikipedia as a Lexical Semantic Resource. In Biannual Conference of the Society for Computational Linguistics and Language Technology, 2007.

Cited By

View all
  • (2024)Knowledge Graph Embedding: A Survey from the Perspective of Representation SpacesACM Computing Surveys10.1145/3643806Online publication date: 2-Feb-2024
  • (2023)Ontology Learning Applications of Knowledge Base Construction for Microelectronic Systems InformationInformation10.3390/info1403017614:3(176)Online publication date: 9-Mar-2023
  • (2023)A simple semantic ranking approach for entity linkingThird International Seminar on Artificial Intelligence, Networking, and Information Technology (AINIT 2022)10.1117/12.2667421(43)Online publication date: 22-Feb-2023
  • Show More Cited By

Index Terms

  1. LINDEN: linking named entities with knowledge base via semantic knowledge

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '12: Proceedings of the 21st international conference on World Wide Web
    April 2012
    1078 pages
    ISBN:9781450312295
    DOI:10.1145/2187836
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Univ. de Lyon: Universite de Lyon

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 April 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. entity linking
    2. fact integration
    3. knowledge base
    4. semantic knowledge
    5. wikipedia

    Qualifiers

    • Research-article

    Conference

    WWW 2012
    Sponsor:
    • Univ. de Lyon
    WWW 2012: 21st World Wide Web Conference 2012
    April 16 - 20, 2012
    Lyon, France

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)20
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 23 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Knowledge Graph Embedding: A Survey from the Perspective of Representation SpacesACM Computing Surveys10.1145/3643806Online publication date: 2-Feb-2024
    • (2023)Ontology Learning Applications of Knowledge Base Construction for Microelectronic Systems InformationInformation10.3390/info1403017614:3(176)Online publication date: 9-Mar-2023
    • (2023)A simple semantic ranking approach for entity linkingThird International Seminar on Artificial Intelligence, Networking, and Information Technology (AINIT 2022)10.1117/12.2667421(43)Online publication date: 22-Feb-2023
    • (2023)The SG-CIM Entity Linking Method Based on BERT and Entity Name Embeddings2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)10.1109/AINIT59027.2023.10212510(362-366)Online publication date: 16-Jun-2023
    • (2022)Learning Entity Linking Features for Emerging EntitiesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3197707(1-14)Online publication date: 2022
    • (2022)Knowledge Base Entity Lookup using Named Entity Recognition: a case study on YAGO2022 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)10.1109/ICCCIS56430.2022.10037689(429-434)Online publication date: 4-Nov-2022
    • (2022)An entity linking method based on graph feature and electricity domain knowledge base2022 International Conference on Computers, Information Processing and Advanced Education (CIPAE)10.1109/CIPAE55637.2022.00065(276-281)Online publication date: Aug-2022
    • (2022)Adversarial Transfer for Classical Chinese NER with Translation Word SegmentationNatural Language Processing and Chinese Computing10.1007/978-3-031-17120-8_24(298-310)Online publication date: 24-Sep-2022
    • (2022)Chinese Named Entity Recognition Based on Dynamically Adjusting Feature WeightsCollaborative Computing: Networking, Applications and Worksharing10.1007/978-3-030-92635-9_1(3-19)Online publication date: 1-Jan-2022
    • (2021)TQELProceedings of the VLDB Endowment10.14778/3476249.347630914:11(2642-2654)Online publication date: 27-Oct-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media