Abstract
Semantic web provides information for humans as well as computers to semantically maintain a large-scale of data and provide a meaningful content of unstructured data. It offers new benefits for big-data research and applications. Big data is a new term refers to a massive collection of datasets from various sources in structured, semi-structured, and unstructured data collection. Their integration faces many problems such as the structural and the semantic heterogeneity as the processing of these data is difficult using traditional databases and software techniques. In this paper, the data resources are extracted and aggregated from different sources on the web following by using the geospatial ontology to transform this data into RDF format. RDF format is used to integrate the data semantically and construct the big-data semantic model that is used to store data. The major contribution of this research is to aggregate, integrate, and represent geospatial data semantically. A case study of cities data is used to illustrate the proposed workflow functionalities. The main result of this research is to solve the heterogeneous problem in different data sources with improving the data aggregation, integration, and representation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wu, H., Yamaguchi, A.: Semantic web technologies for the big data in life sciences. Biosci. Trends 8(4), 192–201 (2014)
Ahmed, Z., Gerhard, D.: Web to Semantic Web & Role of Ontology (2010). arXiv preprint: arXiv:1008.1331
Jain, V., Singh, M.: Ontology-based information retrieval in semantic web: a survey. Int. J. Inf. Technol. Comput. Sci. (IJITCS) 5(10), 62 (2013)
Di Martino, B., Esposito, A., Nacchia, S., Maisto, S.A.: A semantic model for business process patterns to support cloud deployment. Comput. Sci. Res. Dev. 32(3–4), 257–267 (2017)
Kang, L., Yi, L., Dong, L.: Research on construction methods of big data semantic model. In: Proceedings of the World Congress on Engineering (WCE 2014), vol. 1, London, UK (2014)
Bansal, S.K.: Towards a semantic extract-transform-load (ETL) framework for big data integration. In: 2014 IEEE International Congress on Big Data (BigData Congress), Anchorage, pp. 522–529. IEEE (2014)
Bertino, E.: Big data – opportunities and challenges. In: IEEE 37th Annual Computer Software and Applications Conference, Kyoto, Japan, pp. 479–480 (2013)
Thirunarayan, K., Sheth, A.: Semantics-empowered approaches to big data processing for physical-cyber-social applications. In: Semantics for Big Data: Papers from the AAAI Symposium. AAAI Technical report FS-13-04, Arlington, Virginia, USA, pp. 68–75 (2013)
Arputhamary, B., Arockiam, L.: A review on big data integration. Int. J. Comput. Appl., 21–26 (2014)
Bizer, C., Boncz, P., Brodie, M.L., Erling, O.: The meaningful use of big data: four perspectives–four challenges. ACM SIGMOD Rec. 40(4), 56–60 (2012)
Bansal, S.K., Kagemann, S.: Integrating big data: a semantic extract-transform-load framework. Computer 48(3), 42–50 (2014)
Bergamaschi, S., Guerra, F., Orsini, M., Sartori, C., Vincini, M.: A semantic approach to ETL technologies. Data Knowl. Eng. 70(8), 717–731 (2011)
Jiang, L., Cai, H., Xu, B.: A domain ontology approach in the ETL process of data warehousing. In: 2010 IEEE 7th International Conference on e-Business Engineering (ICEBE), Shanghai, pp. 30–35 (2010)
Huang, O.R., Du, Y.L., Zhang, M.H., Zhang, C.: Application of ontology-based automatic ETL in marine data integration. In: IEEE Symposium on Electrical & Electronics Engineering (EEESYM), Kuala Lumpur, Malaysia, pp. 11–13 (2012)
Cruz, I.F., Ganesh, V.R., Mirrezaei, S.I.: Semantic extraction of geographic data from web tables for big data integration. In: Proceedings of the 7th Workshop on Geographic Information Retrieval, Orlando, FL, USA, pp. 19–26. ACM (2013)
Zhang, Y., Chiang, Y.Y., Szekely, P., Knoblock, C.A.: A semantic approach to retrieving, linking, and integrating heterogeneous geospatial data. In: Joint Proceedings of the Workshop on AI Problems and Approaches for Intelligent Environments and Workshop on Semantic Cities, pp. 31–37. ACM (2013)
Boury-Brisset, A.-C.: Managing semantic big data for intelligence. In: STIDS, pp. 41–47 (2013)
Xiong, J., Liu, Y., Liu, W.: Ontology-based integration and sharing of big data educational resources. In: IEEE 11th Web Information System and Application Conference (WISA), Tianjin, China, pp. 245–248 (2014)
Gollapudi, S.: Aggregating financial services data without assumptions: a semantic data reference architecture. In: 2015 IEEE International Conference on Semantic Computing (ICSC), Anaheim, CA, USA, pp. 312–315 (2015)
Saradha, A.: Semantic integration of heterogeneous web data for tourism domain using ontology-based resource description language. J. Comput. Appl. 3(3), 1 (2010)
Jadhao, H., Aghav, D.J., Vegiraju, A.: Semantic tool for analysing unstructured data. Int. J. Sci. Eng. Res. 3(8), 1–7 (2012)
MapCruzin data Homepage. http://www.mapcruzin.com/. Accessed 21 Oct 2017
DATA.GOV Homepage. https://catalog.data.gov/. Accessed 20 Oct 2017
United States Census Homepage. https://www.census.gov/. Accessed 1 Oct 2017
OST/SEC Homepage. http://www.nws.noaa.gov/. Accessed 20 Oct 2017
Cities data Homepage. https://www.uscitieslist.org/. Accessed 19 Oct 2017
Gaslamp media Homepage. https://www.gaslampmedia.com. Accessed 19 Oct 2017
David, J., Euzenat, J., Scharffe, F., Trojahn dos Santos, C.: The alignment API 4.0. Semant. Web Interoperability Usability Appl. 2(1), 3–10 (2011)
Euzenat, J.: An API for ontology alignment. In: International Semantic Web Conference, pp. 698–712. Springer, Heidelberg (2004)
Matthews, B.: Semantic web technologies. E-learning 6(6), 8 (2005)
RDF Homepage. https://www.w3.org/RDF/. Accessed 15 Oct 2017
RDF Homepage. http://www.webopedia.com/TERM/R/RDF.html. Accessed 1 Oct 2017
OWL Homepage. https://www.w3.org/2001/sw/wiki/OWL. Accessed 21 Oct 2017
SPARQL Query Language for RDF Homepage. https://www.w3.org/TR/rdf-sparql-query/. Accessed 20 Oct 2017
Protégé Homepage. http://protegewiki.stanford.edu/wiki/Main_Page. Accessed 19 Oct 2017
Do, H.H., Melnik, S., Rahm, E.: Comparison of schema matching evaluations. In: Net. ObjectDays: International Conference on Object-Oriented and Internet-Based Technologies, Concepts, and Applications for a Networked World, pp. 221–237. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Saber, A., Al-Zoghby, A.M., Elmougy, S. (2018). Big-Data Aggregating, Linking, Integrating and Representing Using Semantic Web Technologies. In: Hassanien, A., Tolba, M., Elhoseny, M., Mostafa, M. (eds) The International Conference on Advanced Machine Learning Technologies and Applications (AMLTA2018). AMLTA 2018. Advances in Intelligent Systems and Computing, vol 723. Springer, Cham. https://doi.org/10.1007/978-3-319-74690-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-74690-6_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-74689-0
Online ISBN: 978-3-319-74690-6
eBook Packages: EngineeringEngineering (R0)