No abstract available.
Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you?
Component failure in large-scale IT installations is becoming an ever larger problem as the number of components in a single cluster approaches a million.
In this paper, we present and analyze field-gathered disk replacement data from a number of large ...
Failure trends in a large disk drive population
It is estimated that over 90% of all new information produced in the world is being stored on magnetic media, most of it on hard disk drives. Despite their importance, there is relatively little published work on the failure patterns of disk drives, and ...
A five-year study of file-system metadata
For five years, we collected annual snapshots of filesystem metadata from over 60,000 Windows PC file systems in a large corporation. In this paper, we use these snapshots to study temporal changes in file size, file age, file-type frequency, directory ...
Proportional-share scheduling for distributed storage systems
Fully distributed storage systems have gained popularity in the past few years because of their ability to use cheap commodity hardware and their high scalability. While there are a number of algorithms for providing differentiated quality of service to ...
Argon: performance insulation for shared storage servers
Services that share a storage system should realize the same efficiency, within their share of time, as when they have the system to themselves. The Argon storage server explicitly manages its resources to bound the inefficiency arising from inter-...
Strong accountability for network storage
This paper presents the design, implementation, and evaluation of CATS, a network storage service with strong accountability properties. A CATS server annotates read and write responses with evidence of correct execution, and offers audit and challenge ...
On the scalability of storage sub-system back-end network
The aim of this on-going work is to study the scalability of the back-end network of storage sub-systems in terms of the number of disks that can be linked to the network. It is well known that without considering the limitation of back-end network, ...
SeFS: Unleashing the power of full-text search on file systems
Dropping prices in storage space capacity combined with a constant increase in the accumulated amount of digitized content create an unprecedented need for full-text search automation in local and distributed file systems. The current state of the art ...
FlexiCache: a flexible interface for customizing Linux file system buffer cache replacement policies
Recently, there has been a growing trend towards designing advanced file system buffer cache replacement algorithms such as ARC [1], PCC [2], and LIRS [3]. These algorithms perform much better than standard caching algorithms available in modern ...
CompulsiveFS: making NVRAM suitable for extremely reliable storage
Byte-addressable, non-volatile memory (NVRAM) technologies such as magnetoresistive random access memory and phase-change memory have recently emerged as viable competitors to Flash RAM. These new technologies have the ability to improve the performance,...
Performance evaluation of RAID6 systems
The aim of this on-going work is to study the performance of RAID6 protected storage systems under a Storage Performance Council-1 (SPC-1) benchmark based workload [3]. With the scale of modern storage systems becoming increasingly large, the ...
Design and implementation of verifiable audit trails for a versioning file system
We present constructs that create, manage, and verify digital audit trails for versioning file systems. Based upon a small amount of data published to a third party, a file system commits to a version history. At a later date, an auditor uses the ...
Architectures for controller based CDP
Continuous Data Protection (CDP) is a recent storage technology which enables reverting the state of the storage to previous points in time. We propose four alternative architectures for supporting CDP in a storage controller, and compare them ...
Jumbo store: providing efficient incremental upload and versioning for a utility rendering service
We have developed a new storage system called the Jumbo Store (JS) based on encoding directory tree snapshots as graphs called HDAGs whose nodes are small variable-length chunks of data and whose edges are hash pointers. We store or transmit each node ...
Data ONTAP GX: a scalable storage cluster
Data ONTAP GX is a clustered Network Attached File server composed of a number of cooperating filers. Each filer manages its own local file system, which consists of a number of disconnected flexible volumes. A separate namespace infrastructure runs ...
Trace: parallel trace replay with approximate causal events
- Michael P. Mesnier,
- Matthew Wachs,
- Raja R. Sambasivan,
- Julio Lopez,
- James Hendricks,
- Gregory R. Ganger,
- David O'Hallaron
//TRACE is a new approach for extracting and replaying traces of parallel applications to recreate their I/O behavior. Its tracing engine automatically discovers internode data dependencies and inter-I/O compute times for each node (process) in an ...
Karma: know-it-all replacement for a multilevel cache
Multilevel caching, common in many storage configurations, introduces new challenges to traditional cache management: data must be kept in the appropriate cache and replication avoided across the various cache levels. Some existing solutions focus on ...
AMP: adaptive multi-stream prefetching in a shared cache
Prefetching is a widely used technique in modern data storage systems. We study the most widely used class of prefetching algorithms known as sequential prefetching. There are two problems that plague the state-of-the-art sequential prefetching ...
Nache: design and implementation of a caching proxy for NFSv4
In this paper, we present Nache, a caching proxy for NFSv4 that enables a consistent cache of a remote NFS server to be maintained and shared across multiple local NFS clients. Nache leverages the features of NFSv4 to improve the performance of file ...
TFS: a transparent file system for contributory storage
Contributory applications allow users to donate unused resources on their personal computers to a shared pool. Applications such as SETI@home, Folding@home, and Freenet are now in wide use and provide a variety of services, including data processing and ...
Cobalt: separating content distribution from authorization in distributed file systems
How should a distributed file system manage access to protected content? On one hand, distributed storage should make data access pervasive: authorized users should be able to access their data from any location. On the other hand, content protection is ...
PARAID: a gear-shifting power-aware RAID
Reducing power consumption for server computers is important, since increased energy usage causes increased heat dissipation, greater cooling requirements, reduced computational density, and higher operating costs. For a typical data center, storage ...
Index Terms
- Proceedings of the 5th USENIX conference on File and Storage Technologies