[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

Analysis of file I/O traces in commercial computing environments

Published: 01 June 1992 Publication History

Abstract

Improving the performance of the file system is becoming increasingly important to alleviate the effect of I/O bottlenecks in computer systems. To design changes to an existing file system or to architect a new file system it is important to understand current usage patterns. In this paper we analyze file I/O traces of several existing production computer sytems to understand file access behavior.
Our analysis suggests that a relatively small percentage of the files are active. The amount of total data active is also quite small for interactive environments. An average file encounters a relatively small number of file opens while receiving an order of magnitude larger number of reads to it. An average process opens quite a large number of files over a typical prime time period. What is more significant is that the effect of outliers on many of the characteristics we studied is dominant. A relatively small number of processes dominate the activity, and a very small number of files receive most of these operations.
In addition, we provide a comprehensive analysis of the dynamic sharing of files in each of these enviroments, addressing both the simultaneous and sequential sharing aspects, and the activity to these shared files. We observe that although only a third of the active files are sequentially shared, they receive a very large proportion of the total operations. We analyze the traces from a given environment across different lengths of time, such as one hour, three hour and whole work-day intervals and do this for 3 different environments. This gives us an idea of the shortest length of the trace needed to have confidence in the estimation of the parameters.

References

[1]
Biswas, P., Ramakdshnan, K.K., "File Access Characterization of VAX/VMS Environments," Proceedings of the 10th International Conference on Distributed Computing Systems, Paris, France, May 1990.
[2]
Bodnarchuk, R., Bunt, R., "A Synthetic Workload Model for a Distributed Systems File Server," Proceedings of the 1991 ACM Sigmetrics Conference on Measurement and Modeling of Computer Systems, Vol. 19, No. 1, May 1991.
[3]
Floyd, R., "Short-Term File Reference Patterns in a Unix Environment," Technical Report TR-177, Department of Computer Science, University of Rochester, 1986.
[4]
Gray, C.G., Cheriton, D.R., "Leases: An Efficient Fault- Tolerant Mechanism for Distributed File Cache Consistency," Proceedings of the 12th ACM Symposium on Operating Systems Principles, Litchfield Park, December 1989.
[5]
Howard, J.H., Kazar, M.L., Menees, S.G., Nichols, D.A., Satyanarayanan, M., Sidebotham, R.N., West, M.J., "Scale and Performance in a Distributed File System," ACM Transactions on Computer Systems 6(1), February 1988.
[6]
Jain, R., Turner, R., "Workload Characterization using Image Accounting," Proceedings of the Computer Performance Evaluation Users Group 18th Meeting, Washington D.C., October 1982.
[7]
Kronenberg, N., Levy, H., Strecker, W., "VAXclusters: A Closely-coupled Distributed System," ACM Transactions on Computer Systems, Vol. 4, No. 2, May 1986.
[8]
Kure, O., "Optimization of File Migration in Distributed Systems," Ph.D. thesis, Computer Science Division, University of California, Berkeley, CA, Report No. UCB/CSD/88/413, April 1988.
[9]
Lazowska, E., Zahorjan, J., Cheriton, D., Zwaenepoel, W., "File Access Performance of Diskless Workstations," ACM Transactions on Computer Systems 4(2), August 1986.
[10]
Majumdar, S., Bunt, R., "Measurement and Analysis of Locality Phases in File Referencing Behaviour", Proceedings of the Performance '86 and ACM Sigmetrics '86, Joint Conference on Computer Performance Modeling, Measurement and Evaluation, Raleigh, N.C., May 1986.
[11]
McCoy, K., "VMS File Systems Internals", Digital Press, Bedford, Mass., t 990.
[12]
McKusick, M.K., Joy, W.M., Leffier, S.J., Fabry, R.S., "A Fast File System for Unix," ACM Transactions on Computer Systems 2(3), 1984.
[13]
Morris, J.H., Satyanarayanan, M., Conner, M.H., Howard, J.H., Rosenthal, D.S., Smith, F.D., "Andrew: A Distributed Personal Computing Environment," Communications of the ACM 29(3), March 1986.
[14]
Murthy, V.S. Devarakonda, "File Usage Analysis and Resource Usage Prediction: A Measurement-Based Study", Ph.D. thesis, Department of Computer Science, University of Illinois at Urbana-Champaign, 1988.
[15]
Nelson, M.N., Welch, B.B., Ousterhout, J.K., "Caching in the Sprite Network File System," ACM Transactions on Computer Systems 6(1), February 1988.
[16]
Nichols, W.G., Emer, J.S., "Design and Implementation of the VAX Distributed File Service," Digital Technical Journal 9, June 1989.
[17]
Ousterhout, J.K., Da Costa, H., Harrison, D., Kunze, J., Kupfer, M., Thompson, J., "A Trace-Driven Analysis of the Unix 4.2 BSD File System," Proceeding of the 10th ACM Symposium on Operating System Principles, Orcas island, December 1985.
[18]
Ousterhout, J.K., Cherenson, A.R., Douglis, F., Nelson, M.N., Welch, B.B., "The Sprite Network Operating System," Computer 21 (2), February 1988.
[19]
Ousterhout, J.K., Douglis, F., "Beating the I/0 Bottleneck: A Case for Log Structured File Systems", Operating Systems Review, ACM, Vol. 23, No.i, January 1989.
[20]
Ramakrishnan, K.K., Emer, J.S., "Performance Analysis of Mass Storage Service Alternatives for Distributed Systems," IEEE Transactions on Software Engineering 15(2), February 1989.
[21]
Satyanarayanan, M., "A Survey of Distributed File Systems," Technical Report CMU-CS-89-116, Department of Computer Science, Carnegie Mellon University, February 1989.
[22]
Satyanarayanan, M., ",4 Study of File Size and Functional Lifetimes," Proceedings of the 8th ACM Symposium on Operating System Principles, Asilomar, December 1981.
[23]
Smith, A.J., "Analysis of Long-Term File Reference Patterns for Application to File Migration Algorithms," IEEE Transactions on Software Engineering 7(4), July 1981.
[24]
SPEC, "SPEC Benchmark Suite Release 1.0," SPEC Newsletter 2(2), Available from Waterside Associates, Fremont, CA., 1990.

Cited By

View all
  • (2024)IoT malware detection using static and dynamic analysis techniquesSecurity and Privacy10.1002/spy2.4447:6Online publication date: 19-Jul-2024
  • (2020)A trace-based study of SMB network file system workloads in an academic enterpriseProceedings of the 2020 Summer Simulation Conference10.5555/3427510.3427552(1-8)Online publication date: 20-Jul-2020
  • (2016)Offline Selective Data Deduplication for Primary Storage SystemsIEICE Transactions on Information and Systems10.1587/transinf.2015EDP7034E99.D:2(370-382)Online publication date: 2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 20, Issue 1
June 1992
260 pages
ISSN:0163-5999
DOI:10.1145/149439
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGMETRICS '92/PERFORMANCE '92: Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
    June 1992
    267 pages
    ISBN:0897915070
    DOI:10.1145/133057
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 1992
Published in SIGMETRICS Volume 20, Issue 1

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)93
  • Downloads (Last 6 weeks)9
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)IoT malware detection using static and dynamic analysis techniquesSecurity and Privacy10.1002/spy2.4447:6Online publication date: 19-Jul-2024
  • (2020)A trace-based study of SMB network file system workloads in an academic enterpriseProceedings of the 2020 Summer Simulation Conference10.5555/3427510.3427552(1-8)Online publication date: 20-Jul-2020
  • (2016)Offline Selective Data Deduplication for Primary Storage SystemsIEICE Transactions on Information and Systems10.1587/transinf.2015EDP7034E99.D:2(370-382)Online publication date: 2016
  • (2015)Performance Optimization for Managing Massive Numbers of Small Files in Distributed File SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2014.237772026:12(3433-3448)Online publication date: 1-Dec-2015
  • (2014)Analysis of Fork/Join and Related Queueing SystemsACM Computing Surveys10.1145/262891347:2(1-71)Online publication date: 25-Aug-2014
  • (2012)A File Is Not a FileACM Transactions on Computer Systems10.1145/2324876.232487830:3(1-39)Online publication date: 1-Aug-2012
  • (2011)A file is not a fileProceedings of the Twenty-Third ACM Symposium on Operating Systems Principles10.1145/2043556.2043564(71-83)Online publication date: 23-Oct-2011
  • (2009)Effects of Data Scrubbing on Reliability in Storage SystemsIEICE Transactions on Information and Systems10.1587/transinf.E92.D.1639E92-D:9(1639-1649)Online publication date: 2009
  • (2009)Detailed analysis of I/O traces for large scale applications2009 International Conference on High Performance Computing (HiPC)10.1109/HIPC.2009.5433186(419-427)Online publication date: Dec-2009
  • (2006)Mirrored disk rouing and schedulingCluster Computing10.1007/s10586-006-0014-39:4(475-484)Online publication date: 1-Oct-2006
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media