Abstract
Several data replication techniques have been developed to support high-performance data accesses to the remotely produced scientific data. Most of those techniques, however, do not provide the replica consistency because the data replica is just periodically updated through the remote clients. We have developed two kinds of data replication techniques, called owner-initiated replication and client-initiated replication. Our replication techniques do not need to use file system-level locking functions so that they can easily be ported to any of file systems. In this paper we describe the design and implementation of our two replication techniques and present performance results on Linux clusters.
Chapter PDF
Similar content being viewed by others
References
Allcock, B., Foster, I., Nefedova, V., Chervenak, A., Deelman, E., Kesselman, C., Leigh, J., Sim, A., Shoshani, A., Drach, B., Williams, D.: High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies. In: SC 2001 (November 2001)
Moore, R., Rajasekar, A.: Data and Metadata Collections for Scientific Applications. In: Hertzberger, B., Hoekstra, A.G., Williams, R. (eds.) HPCN-Europe 2001. LNCS, vol. 2110, p. 72. Springer, Heidelberg (2001)
Chervenak, A., Deelman, E., Kesselman, C., Pearlman, L., Singh, G.: A Metadata Catalog Service for Data Intensive Applications. GriPhyN technical report (2002)
Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration WG. Global Grid Forum, June 22 (2002)
Chervenak, A., Foster, I., Kesselman, C., Salisbury, C., Tuecke, S.: The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets. Journal of Network and Computer Applications 23, 187–200 (2001)
No, J., Park, H.: GEDAS: A Data Management System for Data Grid Environments. In: Sunderam, V.S., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds.) ICCS 2005. LNCS, vol. 3514, pp. 485–492. Springer, Heidelberg (2005)
No, J., Thakur, R., Choudhary, A.: High-Performance Scientific Data Management System. Journal of Parallel and Distributed Computing 4(64), 434–447 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
No, J., Park, C.W., Park, S.S. (2006). Data Replication Techniques for Data-Intensive Applications. In: Alexandrov, V.N., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds) Computational Science – ICCS 2006. ICCS 2006. Lecture Notes in Computer Science, vol 3994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11758549_141
Download citation
DOI: https://doi.org/10.1007/11758549_141
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34385-1
Online ISBN: 978-3-540-34386-8
eBook Packages: Computer ScienceComputer Science (R0)