[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3372938.3372954acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbdiotConference Proceedingsconference-collections
research-article

Data Integration and NoSQL Systems: A State of the Art

Published: 07 January 2020 Publication History

Abstract

Data Integration is one of the older research problems in database area. It aims to combine data stored in different data sources and provide a unified view of data to the user.
With the spread of a new generation of database systems, called NoSQL systems, data integration becomes more challenging, since we have to integrate data stored in systems that implement different data models and query languages. Inspired by these motivations, we provide in this paper an overview of data integration challenges and solutions in the context of NoSQL systems.

References

[1]
Hanen Abbes and Faiez Gargouri. 2016. Big data integration: A MongoDB database and modular ontologies based approach. Procedia Computer Science 96 (2016), 446--455.
[2]
Souad Amghar, Safae Cherdal, and Salma Mouline. 2018. Which NoSQL database for IoT Applications?. In Proceedings of the International Conference on Selected Topics in Mobile and Wireless Networking (MoWNeT). IEEE, 131--137.
[3]
Grigoris Antoniou and Frank Van Harmelen. 2004. Web ontology language: Owl. In Handbook on ontologies. Springer, 67--92.
[4]
Paolo Atzeni, Francesca Bugiotti, and Luca Rossi. 2012. Uniform access to nonrelational database systems: The SOS platform. In Proceedings of International Conference on Advanced Information Systems Engineering. Springer, 160--174.
[5]
Paolo Atzeni, Francesca Bugiotti, and Luca Rossi. 2014. Uniform access to NoSQL systems. Information Systems 43 (2014), 117--133.
[6]
Jinchuan Chen, Yueguo Chen, Xiaoyong Du, Cuiping Li, Jiaheng Lu, Suyun Zhao, and Xuan Zhou. 2013. Big data challenge: a data management perspective. Frontiers of Computer Science 7, 2 (2013), 157--164.
[7]
Alejandro Corbellini, Cristian Mateos, Alejandro Zunino, Daniela Godoy, and Silvia Schiaffino. 2017. Persisting big-data: The NoSQL landscape. Information Systems 63 (2017), 1--23.
[8]
Olivier Curé, Myriam Lamolle, and Chan Le Duc. 2013. Ontology based data integration over document and column family oriented NOSQL. arXiv preprint arXiv:1307.2603 (2013).
[9]
AnHai Doan, Alon Halevy, and Zachary Ives. 2012. Principles of data integration. Elsevier.
[10]
Chamberlin Don. 2018. DON CHAMBERLIN SQL++ FOR SQL USERS: A TUTORIAL. Couchbase, Inc.
[11]
Avrilia Floratou, Umar Farooq Minhas, and Fatma Özcan. 2014. SQL-on-Hadoop: full circle back to shared-nothing database architectures. Proceedings of the VLDB Endowment 7, 12 (2014), 1295--1306.
[12]
Luis Gravano, Panagiotis G Ipeirotis, Nick Koudas, and Divesh Srivastava. 2003. Text joins in an RDBMS for web data integration. In Proceedings of the 12th international conference on World Wide Web. ACM, 90--101.
[13]
Thomas R Gruber. 1993. A translation approach to portable ontology specifications. Knowledge acquisition 5, 2 (1993), 199--220.
[14]
Alon Halevy, Anand Rajaraman, and Joann Ordille. 2006. Data integration: the teenage years. In Proceedings of the 32nd international conference on Very large data bases. VLDB Endowment, 9--16.
[15]
Denis Hünich and Ralph Müller-Pfefferkorn. 2010. Managing large datasets with iRODSâĂŤA performance analysis. In Proceedings of the International Multi-conference on Computer Science and Information Technology. IEEE, 647--654.
[16]
Maurizio Lenzerini. 2002. Data integration: A theoretical perspective. In Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 233--246.
[17]
Alon Y Levy. 2000. Logic-based techniques in data integration. In Logic-based artificial intelligence. Springer, 575--595.
[18]
Guoliang Li. 2017. Human-in-the-loop data integration. Proceedings of the VLDB Endowment 10, 12 (2017), 2006--2017.
[19]
Kian Win Ong, Yannis Papakonstantinou, and Romain Vernoux. 2014. The SQL++ query language: Configurable, unifying and semi-structured. arXiv preprint arXiv:1405.3631 (2014).
[20]
Andrew Pavlo and Matthew Aslett. 2016. What's really new with NewSQL? ACM Sigmod Record 45, 2 (2016), 45--55.
[21]
Jorge Pérez, Marcelo Arenas, and Claudio Gutierrez. 2009. Semantics and complexity of SPARQL. ACM Transactions on Database Systems (TODS) 34, 3 (2009), 16.
[22]
Ágnes Vathy-Fogarassy and Tamás Hugyák. 2017. Uniform data access platform for SQL and NoSQL database systems. Information Systems 69 (2017), 93--105.
[23]
Pedro Manuel Vieira-Marques, Sergi Robles, Jordi Cucurull, Ricardo Joao Cruz-Correia, Guillermo Navarro, and Ramon Marti. 2006. Secure integration of distributed medical data using mobile agents. IEEE Intelligent Systems 21, 6 (2006), 47--54.
[24]
Hao Xu, Ben Keller, Antoine de Torcy, and Jason Coposky. 2017. QueryArrow: Semantically Unified Query and Update of Heterogeneous Data Stores. In Proceedings of the iRODS User Group Meeting 2017. 71.
[25]
Patrick Ziegler and Klaus R Dittrich. 2007. Data integration: problems, approaches, and perspectives. In Conceptual modelling in information systems engineering. Springer, 39--58.

Cited By

View all
  • (2020)Semantic Layer Construction for Big Data Integration2020 International Conference on Advanced Information Technologies (ICAIT)10.1109/ICAIT51105.2020.9261799(24-29)Online publication date: 4-Nov-2020
  • (2020)Storing, preprocessing and analyzing tweets: finding the suitable noSQL systemInternational Journal of Computers and Applications10.1080/1206212X.2020.184694644:6(586-595)Online publication date: 17-Nov-2020

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
BDIoT '19: Proceedings of the 4th International Conference on Big Data and Internet of Things
October 2019
476 pages
ISBN:9781450372404
DOI:10.1145/3372938
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 January 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Big Data
  2. Data Integration
  3. NoSQL database systems

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

BDIoT'19

Acceptance Rates

BDIoT '19 Paper Acceptance Rate 75 of 136 submissions, 55%;
Overall Acceptance Rate 75 of 136 submissions, 55%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)2
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Semantic Layer Construction for Big Data Integration2020 International Conference on Advanced Information Technologies (ICAIT)10.1109/ICAIT51105.2020.9261799(24-29)Online publication date: 4-Nov-2020
  • (2020)Storing, preprocessing and analyzing tweets: finding the suitable noSQL systemInternational Journal of Computers and Applications10.1080/1206212X.2020.184694644:6(586-595)Online publication date: 17-Nov-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media