[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

Cluster-based file replication in large-scale distributed systems

Published: 01 June 1992 Publication History

Abstract

The increasing need for data sharing in large-scale distributed systems may place a heavy burden on critical resources such as file servers and networks. Our examination of the workload in one large commercial engineering environment shows that wide-spread sharing of unstable files among tens to hundreds of users is common. Traditional client-based file cacheing techniques are not scalable in such environments.
We propose Frolic, a scheme for cluster-based file replication in large-scale distributed file systems. A cluster is a group of workstations and one or more file servers on a local area network. Large distributed systems may have tens or hundreds of clusters connected by a backbone network. By dynamically creating and maintaining replicas of shared files on the file servers in the clusters using those files, we effectively reduce reliance on central servers supporting such files, as well as reduce the distances between the accessing sites and data. We propose and study algorithms for the two main issues in Frolic, 1) locating a valid file replica, and 2) maintaining consistency among replicas. Our simulation experiments using a statistical workload model based upon measurement data and real workload characteristics show that cluster-based file replication can significantly reduce file access delays and server and backbone network utilizations in large-scale distributed systems over a wide range of workload conditions. The workload characteristics most critical to replication performance are: the size of shared files, the number of clusters that modify a file, and the number of consecutive accesses to files from a particular cluster.

References

[1]
M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout. Measurements of a distributed file system. In Proc. 13th A CM Symposium on Operating System Principles. ACM, October 1991.
[2]
P. Biswas and K.K. Ramakrishnan. File access characterization of VAX/VMS environments. In l Oth Int. Conf. on Distributed Computing Systems, November 1990.
[3]
John Howard, M. Kazar, S. Menees, D. Nichols, M. Satyanarayanan, R. Sidebotham, and M. West. Scale and performance in a distributed file system. A CM Transactions on Computer Systems, 6(1), February 1988.
[4]
Q. Kure. Optimization of File Migration in Distributed Systems. PhD thesis, University of California Berkeley, 1988.
[5]
K. Li. Memory coherence in shared virtual memory systems. A CM Transactions on Computer Systems, Vol 7, No 4, November 1989.
[6]
M. Molloy. Validation of MVA models for client/server systems. Hewlett-Packard, 1990.
[7]
J. Ousterhout, H. Da Costa, D. Harrison, J. Kunze, M. Kupfer, and J. Thompson. A trace-driven analysis of the Unix 4.2 BSD file system. In Proc. l Oth A CM Symposium on Operating System Principles. ACM, December 1985.
[8]
H.S. Sandhu. File replication and performance in large-scale distributed systems. Master's thesis, University of Toronto, January 1991.
[9]
H. Schwetman. Using CSIM to model complex systems. In Proceeding of the 1988 Winter Simulation Conference, 1988.
[10]
R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon. Design and implementation of the Sun network file system. In Useuiz Conference and E~hibition, Portland, OR~ Summer 1985.
[11]
M. Stumm and S. Zhou. Algorithms implementing distributed shared memory. Computer, May 1990.
[12]
H.S. Sandhu and S. Zhou. A case study of file system workload in a large-scale distributed environment. Technical report, University of Toronto, 1992. In preparation.
[13]
S. Zhou and C. Siebenmann. Snooper user guide, January 1990. University of Toronto.

Cited By

View all
  • (2012)Fault Tolerance and Recovery for Group Communication Services in Distributed NetworksJournal of Computer Science and Technology10.1007/s11390-012-1224-127:2(298-312)Online publication date: 5-Mar-2012
  • (2009)A proactive low-overhead file replication scheme for structured P2P content delivery networksJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.02.00869:5(429-440)Online publication date: 1-May-2009
  • (2005)Peer-to-Peer Data Preservation through Storage AuctionsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2005.3416:3(246-257)Online publication date: 1-Mar-2005
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 20, Issue 1
June 1992
260 pages
ISSN:0163-5999
DOI:10.1145/149439
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGMETRICS '92/PERFORMANCE '92: Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
    June 1992
    267 pages
    ISBN:0897915070
    DOI:10.1145/133057
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 1992
Published in SIGMETRICS Volume 20, Issue 1

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)95
  • Downloads (Last 6 weeks)18
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2012)Fault Tolerance and Recovery for Group Communication Services in Distributed NetworksJournal of Computer Science and Technology10.1007/s11390-012-1224-127:2(298-312)Online publication date: 5-Mar-2012
  • (2009)A proactive low-overhead file replication scheme for structured P2P content delivery networksJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.02.00869:5(429-440)Online publication date: 1-May-2009
  • (2005)Peer-to-Peer Data Preservation through Storage AuctionsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2005.3416:3(246-257)Online publication date: 1-Mar-2005
  • (1993)Cache management algorithms for flexible filesystemsACM SIGMETRICS Performance Evaluation Review10.1145/174215.17421921:2(16-25)Online publication date: 1-Dec-1993
  • (2016)Semantic QoS synchronization of Web services2016 International Symposium on Networks, Computers and Communications (ISNCC)10.1109/ISNCC.2016.7746060(1-6)Online publication date: May-2016
  • (2015)A novel framework for bindings synchronization of Web servicesService Oriented Computing and Applications10.1007/s11761-014-0155-99:1(59-74)Online publication date: 1-Mar-2015
  • (2012)Analyzing IT-enabled effectiveness in government sectorACM SIGMIS Database: the DATABASE for Advances in Information Systems10.1145/2096140.209614442:4(38-62)Online publication date: 13-Jan-2012
  • (2011)"Beyond the social web: the geo-social revolution" by Salvatore Scellato with Ching-man Au Yeung as CoordinatorACM SIGWEB Newsletter10.1145/2020936.20209412011:Autumn(1-5)Online publication date: 1-Sep-2011
  • (2011)Towards designing more effective systems by understanding user experiencesACM SIGWEB Newsletter10.1145/2020936.20209402011:Autumn(1-3)Online publication date: 1-Sep-2011
  • (2011)Data Collection in Wireless Sensor Networks with Mobile ElementsACM Transactions on Sensor Networks10.1145/1993042.19930498:1(1-31)Online publication date: 1-Aug-2011
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media