Abstract
The growing need of sharing data and digital resources within and across organizations has produced a novel attention on issues related to ontology and instance matching. After an introductory classification of the main techniques and tools for ontology matching, the chapter focuses on instance matching by providing an accurate classification of the matching techniques proposed in the literature, and a comparison of the recent instance matching tools according to the results achieved in the OAEI 2009 contest. Ontology and instance matching solutions developed in the BOEMIE project for multimedia resource management and ontology evolution are finally presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shvaiko, P., Euzenat, J.: Ten Challenges for Ontology Matching. In: Chung, S. (ed.) OTM 2008, Part II. LNCS, vol. 5332. Springer, Heidelberg (2008)
Castano, S., Ferrara, A., Montanelli, S.: Dealing with Matching Variability of Semantic Web Data Using Contexts. In: Pernici, B. (ed.) CAiSE 2010. LNCS, vol. 6051, pp. 194–208. Springer, Heidelberg (2010)
Bouquet, P., Stoermer, H., Mancioppi, M., Giacomuzzi, D.: OkkaM: Towards a Solution to the Identity Crisis on the Semantic Web. In: Proc. of the 3rd Italian Semantic Web Workshop, Pisa, Italy (2006)
Wang, C., Lu, J., Zhang, G.: Integration of Ontology Data through Learning Instance Matching. In: Proc. of the 2006 IEEE/WIC/ACM Int. Conference on Web Intelligence (WI 2006), Washington, DC, USA, pp. 536–539 (2006)
Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An Empirical Study of Instance-Based Ontology Matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)
Engmann, D., Maßmann, S.: Instance Matching with COMA++. In: Proc. of the Workshop on Datenbanksysteme in Business, Technologie und Web (BTW 2007), Aachen, Germany (2007)
Kalfoglou, Y., Schorlemmer, M.: Ontology Mapping: the State of the Art. The Knowledge Engineering Review 18(1) (2003)
Noy, N.: Semantic Integration: a Survey of Ontology-based Approaches. SIGMOD Record, Special Issue on Semantic Integration 33(4) (2004)
INTEROP: State of the Art and State of the Practice Including Initial Possible Research Orientations. Deliverable D8.1, NoE INTEROP - IST Project n. 508011 - 6th EU Framework Programme (2004)
Shvaiko, P., Euzenat, J.: A Survey of Schema-based Matching Approaches. Journal on Data Semantics IV (2005)
Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, Heidelberg (2007)
Rahm, E., Bernstein, P.: A Survey of Approaches to Automatic Schema Matching. The VLDB Journal 10(4) (2001)
Navarro, G.: A Guided Tour to Approximate String Matching. ACM Computing Surveys 33(1), 31–88 (2001)
Levenshtein, V.: Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. Soviet Physics Doklady 10(8) (1966)
Cormode, G., Muthukrishnan, S.: The String Edit Distance Matching Problem with Moves. In: Proc. of the 13th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2002), San Francisco, CA, USA, pp. 667–676 (2002)
Ukkonen, E., Wood, D.: Approximate String Matching with Suffix Automata. Algorithmica 10(5), 353–364 (1993)
Baeza-Yates, R.A.: Text-Retrieval: Theory and Practice. In: Proc. of the IFIP 12th World Computer Congress on Algorithms, Software, Architecture - Information Processing, Amsterdam, The Netherlands, pp. 465–476 (1992)
Navarro, G., Baeza-Yates, R.A., Sutinen, E., Tarhio, J.: Indexing Methods for Approximate String Matching. IEEE Data Engineering Bulletin 24(4), 19–27 (2001)
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic Schema Matching with Cupid. In: Proc. of the Int. Conference on Very Large Data Bases (VLDB 2002), Hong Kong, China, pp. 49–58 (2002)
Castano, S., De Antonellis, V., De Capitani Di Vimercati, S.: Global viewing of heterogeneous data sources. IEEE Transactions on Knowledge and Data Engineering 13(2), 277–297 (2001)
Jeh, G., Widom, J.: SimRank: a Measure of Structural-Context Similarity. In: Proc. of the 8th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining (KDD 2002), Edmonton, Alberta, Canada, pp. 538–543 (2002)
Shasha, D., Wang, J.T.L., Giugno, R.: Algorithmics and Applications of Tree and Graph Searching. In: Proc. of the 21st ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2002), Madison, Wisconsin, USA, pp. 39–52 (2002)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In: Proc. of the 18th Int. Conference on Data Engineering (ICDE 2002), San Jose, CA, USA (2002)
Meilicke, C., Stuckenschmidt, H., Tamilin, A.: Repairing Ontology Mappings. In: Proc. of the 22nd Conference on Artificial Intelligence (AAAI 2007), Vancouver, BC, Canada, pp. 1408–1413 (2007)
Giunchiglia, F., Shvaiko, P.: Semantic Matching. Knowledge Engineering Review 18(3) (2003)
Borgida, A., Serafini, L.: Distributed Description Logics: Assimilating Information from Peer Sources. Journal on Data Semantics I, 153–184 (2003)
Castano, S., Ferrara, A., Lorusso, D., Näth, T.H., Möller, R.: Mapping Validation by Probabilistic Reasoning. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 170–184. Springer, Heidelberg (2008)
Meilicke, C., Stuckenschmidt, H., Tamilin, A.: Reasoning Support for Mapping Revision. Journal of Logic and Computation 19(5), 807–829 (2009)
Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to Map between Ontologies on the Semantic Web. In: Proc. of the 11th Int. Conference on World Wide Web (WWW 2002), Honolulu, Hawaii, USA, pp. 662–673 (2002)
Lacher, M., Groh, G.: Facilitating the Exchange of Explicit Knowledge Through Ontology Mappings. In: Proc. of the 14th Int. FLAIRS Conference, Key West, FL, USA, pp. 305–309 (2001)
Peng, Y., Ding, Z., Pan, R.: Uncertainty in Ontology Mapping: A Bayesian Perspective. In: Proc. of the Information Interpretation and Integration Conference (I3CON), Gaithersburg, MD, USA (2004)
Smith, A., Elkan, C.: A Bayesian Network Framework for Reject Inference. In: Proc. of the 10th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining (KDD 2004), New York, NY, USA, pp. 286–295 (2004)
Euzenat, J.: The Ontology Alignment Evaluation Initiative
Seddiqui, M.H., Aono, M.: An Efficient and Scalable Algorithm for Segmented Alignment of Ontologies of Arbitrary Size. Web Semantics: Science, Services and Agents on the World Wide Web 7(4), 344–356 (2009)
Cruz, I.F., Palandri Antonelli, F., Stroe, C.: AgreementMaker Efficient Matching for Large Real-World Schemas and Ontologies. In: Proc. of the 35th Int. Conference on Very Large Databases (VLDB 2009), Lyon, France, pp. 1586–1589 (2009)
David, J., Guillet, F., Briand, H.: Association Rule Ontology Matching Approach. Int. Journal on Semantic Web and Information Systems 3(2), 27–49 (2007)
Jean-Mary, Y.R., Shironoshita, E.P., Kabuka, M.R.: Ontology matching with semantic verification. Web Semantics: Science, Services and Agents on the World Wide Web 7(3), 235–251 (2009)
Gracia, J., Lopez, V., D’Aquin, M., Sabou, M., Motta, E., Mena, E.: Solving Semantic Ambiguity to Improve Semantic Web based Ontology Matching. In: Proc. of the 2nd Int. Workshop on Ontology Matching, Busan, Korea (2007)
Nagy, M., Vargas-Vera, M., Motta, E.: DSSim - Managing Uncertainty on the Semantic Web. In: Proc. of the 2nd Int. Workshop on Ontology Matching, Busan, Korea (2007)
Kensche, D., Quix, C., Chatti, M., Jarke, M.: GeRoMe: A Generic Role Based Metamodel for Model Management. Journal on Data Semantics VIII, 82–117 (2007)
Reul, Q., Pan, J.Z.: KOSIMap: Ontology Alignments Results for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Wang, P., Xu, B.: Lily: Ontology Alignment Results for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Bock, J., Liu, P., Hettenhausen, J.: MapPSO Results for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Li, J., Tang, J., Li, Y., Luo, Q.: RiMOM: A Dynamic Multistrategy Ontology Alignment Framework. IEEE Transactions on Knowledge and Data Engineering (TKDE) 21(8), 1218–1232 (2008)
Lambrix, P., Tan, H.: SAMBO-A System for Aligning and Merging Biomedical Ontologies. Web Semantics: Science, Services and Agents on the World Wide Web 4(3), 196–206 (2006)
Xu, P., Tao, H., Zang, T., Wang, Y.: Alignment Results of SOBOM for OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Hamdi, F., Safar, B., Niraula, N.B., Reynaud, C.: TaxoMap in the OAEI 2009 Alignment Contest. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Gu, L., Baxter, R., Vickers, D., Rainsford, C.: Record Linkage: Current Practice and Future Directions. Technical report, CSIRO Mathematical and Information Sciences, Canberra, Australia (2003)
Zhou, R., Hansen, E.A.: Domain-Independent Structured Duplicate Detection. In: Proc. of the 21st National Conference on Artificial Intelligence (AAAI 2006), Boston, Massachusetts, USA (2006)
Singla, P., Domingos, P.: Multi-Relational Record Linkage. In: Proc. of the 3rd KDD Workshop on Multi-Relational Data Mining, Seattle, WA, USA (2004)
Sarawagi, S., Bhamidipaty, A.: Interactive Deduplication Using Active Learning. In: Proc. of the 8th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining (KDD 2002), Edmonton, Alberta, Canada, pp. 269–278 (2002)
Verykios, V., Elmagarmid, A., Houstis, E.: Automating the Approximate Record-Matching Process. Information Sciences - Informatics and Computer Science: An Int. Journal 126(1), 83–98 (2000)
Christen, P.: A Two-Step Classification Approach to Unsupervised Record Linkage. In: Proc. of the 6th Australasian Data Mining Conference (AusDM 2007), Gold Coast, Australia, pp. 111–119 (2007)
Christen, P.: Automatic Record Linkage using Seeded Nearest Neighbour and Support Vector Machine Classification. In: Proc. of the 14th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, pp. 151–159 (2008)
Pasula, H., Marthi, B., Milch, B., Russell, S., Shpitser, I.: Identity Uncertainty and Citation Matching. In: Proc. of the Conference on Advances in Neural Information Processing Systems (NIPS 2002), Vancouver, BC, Canada, pp. 1401–1408 (2002)
Dey, D., Sarkar, S., De, P.: Entity Matching in Heterogeneous Databases: A Distance Based Decision Model. In: Proc. of the 31th Annual Hawaii Int. Conference on System Sciences (HICSS 1998), Kohala Coast, Hawaii, USA, pp. 305–315 (1998)
Dey, D., Sarkar, S., De, P.: A Distance-Based Approach to Entity Reconciliation in Heterogeneous Databases. IEEE Transactions on Knowledge and Data Engineering 14(3), 567–582 (2002)
Guha, S., Koudas, N., Marathe, A., Srivastava, D.: Merging the Results of Approximate Match Operations. In: Proc. 30th Int. Conference on Very Large Databases (VLDB 2004), Toronto, Canada, pp. 636–647 (2004)
Winkler, W.: Frequency-Based Matching in Fellegi-Sunter Model of Record Linkage. Statistical research report series rr/2000/06, US Bureau of the Census, Washington, DC, USA (2000)
Wang, Y., Madnick, S.: The Inter-Database Instance Identification Problem in Integrating Autonomous Systems. In: Proc. of the 5th Int. Conference on Data Engineering (ICDE 1989), Washington, DC, USA, pp. 46–55 (1989)
Hernández, M., Stolfo, S.: Real-World Data is Dirty: Data Cleansing and the Merge/Purge Problem. Data Mining and Knowledge Discovery 2(1), 9–37 (1998)
Bhattacharya, I., Getoor, L.: Iterative Record Linkage for Cleaning and Integration. In: Proc. of the 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2004), New York, NY, USA (2004)
Yan, S., Lee, D., Kan, M., Giles, L.: Adaptive Sorted Neighborhood Methods for Efficient Record Linkage. In: Proc. of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2007), Vancouver, BC, Canada, pp. 185–194 (2007)
Bertolazzi, P., De Santis, L., Scannapieco, M.: Automatic Record Matching in Cooperative Information Systems. In: Proc. of the ICDT Int. Workshop on Data Quality in Cooperative Information Systems (DQCIS 2003), Siena, Italy (2003)
Newcombe, H.: Handbook of Record Linkage. Oxford University Press, Inc., Oxford (1988)
Euzenat, J., Ferrara, A., Hollink, L., Isaac, A., Joslyn, C., Malaisé, V., Meilicke, C., Nikolov, A., Pane, J., Sabou, M., Scharffe, F., Shvaiko, P., Spiliopoulos, V., Stuckenschmidt, H., Sváb-Zamazal, O., Svátek, V., Trojahn dos Santos, C., Vouros, G.A., Wang, S.: Results of the Ontology Alignment Evaluation Initiative 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Lin, D.: An Information-Theoretic Definition of Similarity. In: Proc. of the 15th Int. Conference on Machine Learning, San Francisco, CA, USA, pp. 296–304 (1998)
Castano, S., Ferrara, A., Lorusso, D., Montanelli, S.: The HMatch 2.0 Suite for Ontology Matchmaking. In: Proc. of the 4th Workshop on Semantic Web Applications and Perspectives (SWAP 2007), Bari, Italy (2007)
Stoermer, H., Rassadko, N.: Results of OKKAM Feature Based Entity Matching Algorithm for Instance Matching Contest of OAEI 2009. In: Proc. of the 4th Int. Workshop on Ontology Matching, Chantilly, VA, USA (2009)
Castano, S., Ferrara, A., Montanelli, S.: Matching Ontologies in Open Networked Systems: Techniques and Applications. Journal on Data Semantics V (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Castano, S., Ferrara, A., Montanelli, S., Varese, G. (2011). Ontology and Instance Matching. In: Paliouras, G., Spyropoulos, C.D., Tsatsaronis, G. (eds) Knowledge-Driven Multimedia Information Extraction and Ontology Evolution. Lecture Notes in Computer Science(), vol 6050. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20795-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-20795-2_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20794-5
Online ISBN: 978-3-642-20795-2
eBook Packages: Computer ScienceComputer Science (R0)