[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Selection of Semantical Mapping of Attribute Values for Data Integration

  • Conference paper
Intelligent Systems'2014

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 322))

  • 1527 Accesses

Abstract

Useful information is often scattered over multiple sources. Therefore, automatic data integration that guarantees high data quality is extremely important. One of the crucial operations in data integration from different sources is the detection of different representations of the same piece of information (called coreferent data) and translation to a common, unified representation. That translation is also known as value mapping. However, values mappings are often not explicit i.e. the specific value may be mapped to more than one value. In this paper, we investigate automatic selection method which reduces the set of one-to-many mappings to the set of one-to-one mappings for attributes whose domains are partially ordered and where the given order relation reflects a notion of generality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 143.50
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 179.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Ananthakrishna, R., Chaudhuri, S., Ganti, V.: Eliminating fuzzy duplicates in data warehouses. In: Proceedings of the 28th International Conference on Very Large Data Bases (2002)

    Google Scholar 

  2. Bronselaer, A., Szymczak, M., Zadrożny, S., De Tré, G.: Dynamical order construction in data fusion. Submitted for review in VLDB Journal (2014)

    Google Scholar 

  3. Cohen, W.W.: Integration of heterogeneous databases without common domains using queries based on textual similarity. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, SIGMOD 1998, pp. 201–212. ACM, New York (1998)

    Chapter  Google Scholar 

  4. Do, H., Rahm, E.: Coma - a system for flexible combination of schema matching approaches. In: Proceedings of the 28th International Conference on Very Large Data Bases, pp. 610–621 (2002)

    Google Scholar 

  5. Doan, A., Lu, Y., Lee, Y., Han, J.: Object matching for information integration: A profiler-based approach. In: Proceedings of the IJCAI 2003 Workshop on Information Integration on the Web, pp. 53–58 (2003)

    Google Scholar 

  6. Fellegi, I.P., Sunter, A.B.: A theory for record linkage. Journal of the American Statistical Association 64, 1183–1210 (1969)

    Article  Google Scholar 

  7. Kang, J., Lee, D., Mitra, P.: Identifying value mappings for data integration: An unsupervised approach. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, J.-Y., Sheng, Q.Z. (eds.) WISE 2005. LNCS, vol. 3806, pp. 544–551. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  8. Madhavan, J., Bernstein, P.A., Rahm, E.: Generic schema matching with Cupid. In: Proceedings of the 27th International Conference on Very Large Data Bases, VLDB 2001, pp. 49–58. Morgan Kaufmann Publishers Inc., San Francisco (2001)

    Google Scholar 

  9. Naumann, F., Bilke, A., Bleiholder, J., Weis, M.: Data fusion in three steps: Resolving inconsistencies at schema-, tuple-, and value-level. Bulletin of The Technical Committee on Data Engineering, 21–31 (2006)

    Google Scholar 

  10. Prade, H.: Possibility sets, fuzzy sets and their relation to Lukasiewicz logic. In: Proceedings 12th Int. Symp. on Multiple-Valued Logic, pp. 223–227 (1982)

    Google Scholar 

  11. Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. The VLDB Journal 10(4), 334–350 (2001)

    Article  MATH  Google Scholar 

  12. Stevens, S.S.: On the theory of scales of measurement. Science 103(2684), 677–680 (1946)

    Article  MATH  Google Scholar 

  13. Szymczak, M., Bronselaer, A., Zadrożny, S., De Tré, G.: Semantical mappings of attribute values for data integration. In: Proceedings of the 2014 NAFIPS Annual Meeting, pp. 1–8. IEEE (2014)

    Google Scholar 

  14. Szymczak, M., Koepke, J.: Matching methods for semantic annotationbased xml document transformations. In: New Developments in Fuzzy Sets, Intuitionistic Fuzzy Sets, Generalized Nets and Related Topics. Applications, pp. 297–308. SRI PAS (2012)

    Google Scholar 

  15. Szymczak, M., Zadrożny, S., De Tré, G.: Coreference detection in XML metadata. In: Proceedings of the 2013 Joint IFSA World Congress NAFIPS Annual Meeting, pp. 1354–1359 (2013)

    Google Scholar 

  16. Tejada, S., Knoblock, C.A., Minton, S.: Learning domain-independent string transformation weights for high accuracy object identification. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2002, pp. 350–359. ACM, New York (2002)

    Google Scholar 

  17. Zadeh, L.A.: Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst. 100, 9–34 (1999)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcin Szymczak .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Szymczak, M., Bronselaer, A., Zadrożny, S., De Tré, G. (2015). Selection of Semantical Mapping of Attribute Values for Data Integration. In: Angelov, P., et al. Intelligent Systems'2014. Advances in Intelligent Systems and Computing, vol 322. Springer, Cham. https://doi.org/10.1007/978-3-319-11313-5_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11313-5_51

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11312-8

  • Online ISBN: 978-3-319-11313-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics