research-article

High-quality Task Division for Large-scale Entity Alignment

Authors:

Bing Liu,

Wen Hua,

Guido Zuccon,

Genghong Zhao,

Xia ZhangAuthors Info & Claims

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Pages 1258 - 1268

https://doi.org/10.1145/3511808.3557352

Published: 17 October 2022 Publication History

Get Access

Abstract

Entity Alignment (EA) aims to match equivalent entities that refer to the same real-world objects and is a key step for Knowledge Graph (KG) fusion. Most neural EA models cannot be applied to large-scale real-life KGs due to their excessive consumption of GPU memory and time. One promising solution is to divide a large EA task into several subtasks such that each subtask only needs to match two small subgraphs of the original KGs. However, it is challenging to divide the EA task without losing effectiveness. Existing methods display low coverage of potential mappings, insufficient evidence in context graphs, and largely differing subtask sizes.

In this work, we design the DivEA framework for large-scale EA with high-quality task division. To include in the EA subtasks a high proportion of the potential mappings originally present in the large EA task, we devise a counterpart discovery method that exploits the locality principle of the EA task and the power of trained EA models. Unique to our counterpart discovery method is the explicit modelling of the chance of a potential mapping. We also introduce an evidence passing mechanism to quantify the informativeness of context entities and find the most informative context graphs with flexible control of the subtask size. Extensive experiments show that DivEA achieves higher EA performance than alternative state-of-the-art solutions.

Supplementary Material

MP4 File (CIKM22-fp0406.mp4)

Presentation video of paper "High-quality Task Division for Large-scale Entity Alignment". In this video, we give a high-level and intuitive explanation of our framework DivEA.

Download
164.80 MB

References

[1]

Sö ren Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. 2007. DBpedia: A Nucleus for a Web of Open Data. In The Semantic Web, 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007 (Lecture Notes in Computer Science, Vol. 4825), Karl Aberer, Key-Sun Choi, Natasha Fridman Noy, Dean Allemang, Kyung-Il Lee, Lyndon J. B. Nixon, Jennifer Golbeck, Peter Mika, Diana Maynard, Riichiro Mizoguchi, Guus Schreiber, and Philippe Cudré -Mauroux (Eds.). Springer, 722--735. https://doi.org/10.1007/978-3-540-76298-0_52

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Large-scale optimization using immune algorithm

MRGN: Multiscale Relation-Gated Graph Network for Entity Alignment

Using combinatorial optimization to solve entity alignment: An efficient unsupervised model

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations