Abstract
Useful information is often scattered over multiple sources. Therefore, automatic data integration that guarantees high data quality is extremely important. One of the crucial operations in data integration from different sources is the detection of different representations of the same piece of information (called coreferent data) and translation to a common, unified representation. That translation is also known as value mapping. However, values mappings are often not explicit i.e. the specific value may be mapped to more than one value. In this paper, we investigate automatic selection method which reduces the set of one-to-many mappings to the set of one-to-one mappings for attributes whose domains are partially ordered and where the given order relation reflects a notion of generality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ananthakrishna, R., Chaudhuri, S., Ganti, V.: Eliminating fuzzy duplicates in data warehouses. In: Proceedings of the 28th International Conference on Very Large Data Bases (2002)
Bronselaer, A., Szymczak, M., Zadrożny, S., De Tré, G.: Dynamical order construction in data fusion. Submitted for review in VLDB Journal (2014)
Cohen, W.W.: Integration of heterogeneous databases without common domains using queries based on textual similarity. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, SIGMOD 1998, pp. 201–212. ACM, New York (1998)
Do, H., Rahm, E.: Coma - a system for flexible combination of schema matching approaches. In: Proceedings of the 28th International Conference on Very Large Data Bases, pp. 610–621 (2002)
Doan, A., Lu, Y., Lee, Y., Han, J.: Object matching for information integration: A profiler-based approach. In: Proceedings of the IJCAI 2003 Workshop on Information Integration on the Web, pp. 53–58 (2003)
Fellegi, I.P., Sunter, A.B.: A theory for record linkage. Journal of the American Statistical Association 64, 1183–1210 (1969)
Kang, J., Lee, D., Mitra, P.: Identifying value mappings for data integration: An unsupervised approach. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, J.-Y., Sheng, Q.Z. (eds.) WISE 2005. LNCS, vol. 3806, pp. 544–551. Springer, Heidelberg (2005)
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic schema matching with Cupid. In: Proceedings of the 27th International Conference on Very Large Data Bases, VLDB 2001, pp. 49–58. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Naumann, F., Bilke, A., Bleiholder, J., Weis, M.: Data fusion in three steps: Resolving inconsistencies at schema-, tuple-, and value-level. Bulletin of The Technical Committee on Data Engineering, 21–31 (2006)
Prade, H.: Possibility sets, fuzzy sets and their relation to Lukasiewicz logic. In: Proceedings 12th Int. Symp. on Multiple-Valued Logic, pp. 223–227 (1982)
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. The VLDB Journal 10(4), 334–350 (2001)
Stevens, S.S.: On the theory of scales of measurement. Science 103(2684), 677–680 (1946)
Szymczak, M., Bronselaer, A., Zadrożny, S., De Tré, G.: Semantical mappings of attribute values for data integration. In: Proceedings of the 2014 NAFIPS Annual Meeting, pp. 1–8. IEEE (2014)
Szymczak, M., Koepke, J.: Matching methods for semantic annotationbased xml document transformations. In: New Developments in Fuzzy Sets, Intuitionistic Fuzzy Sets, Generalized Nets and Related Topics. Applications, pp. 297–308. SRI PAS (2012)
Szymczak, M., Zadrożny, S., De Tré, G.: Coreference detection in XML metadata. In: Proceedings of the 2013 Joint IFSA World Congress NAFIPS Annual Meeting, pp. 1354–1359 (2013)
Tejada, S., Knoblock, C.A., Minton, S.: Learning domain-independent string transformation weights for high accuracy object identification. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2002, pp. 350–359. ACM, New York (2002)
Zadeh, L.A.: Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst. 100, 9–34 (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Szymczak, M., Bronselaer, A., Zadrożny, S., De Tré, G. (2015). Selection of Semantical Mapping of Attribute Values for Data Integration. In: Angelov, P., et al. Intelligent Systems'2014. Advances in Intelligent Systems and Computing, vol 322. Springer, Cham. https://doi.org/10.1007/978-3-319-11313-5_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-11313-5_51
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11312-8
Online ISBN: 978-3-319-11313-5
eBook Packages: EngineeringEngineering (R0)