[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1987816.1987836acmotherconferencesArticle/Chapter ViewAbstractPublication PagessystorConference Proceedingsconference-collections
research-article

Towards SIRF: self-contained information retention format

Published: 30 May 2011 Publication History

Abstract

Many organizations are now required to preserve and maintain access to large volumes of digital content for dozens of years. There is a need for preservation systems and processes to support such long-term retention requirements and enable the usability of those digital objects in the distant future, regardless of changes in technologies and designated communities. A key component in such preservation systems is the storage subsystem where the digital objects are located for most of their lifecycle. We describe SIRF (Self-contained Information Retention Format) -- a logical storage container format specialized for long term retention. SIRF includes a set of digital preservation objects and a catalog with metadata related to the entire contents of the container as well as to the individual objects and their interrelationship. SIRF is being developed by the Storage Networking Industry Association (SNIA) with the intention of creating a standardized vendor-neutral storage format that will be interpretable by future preservation systems and that will simplify and reduce the costs of digital preservation.

References

[1]
NASA, "The Apollo 11 Telemetry Data Recordings: A Final Report," December 2009, http://www.hq.nasa.gov/alsj/a11/Apollo_11_TV_Tapes_Report.pdf.
[2]
ISO 14721:2003, Blue Book. Issue 1. CCSDS, 650.0-B-1: Reference Model for an Open Archival, Information System (OAIS), 2002.
[3]
M. Factor, D. Naor, S. Rabinovici-Cohen, L. Ramati, P. Reshef, and J. Satran, "The need for preservation aware storage - a position paper," ACM SIGOPS Operating Systems Review, Special Issue on File and Storage Systems, 14(1):19--23, January 2007.
[4]
SNIA Long Term Retention (LTR) Technical Working Group https://www.snia.org/apps/org/workgroup/ltrtwg.
[5]
Self-contained Information Retention Format (SIRF) use cases and functional requirements, working draft -- version 0.5a, SNIA, September 2010, http://www.snia.org/tech_activities/publicreview/SIRF_Use_Cases_V05a_DRAFT.pdf.
[6]
M. Baker, K. Keeton, and S. Martin, "Why traditional storage systems don't help us save stuff forever," Technical Report 2005-120, HP Laboratories Palo Alto, June 2005.
[7]
M. Baker, M. Shah, D. Rosenthal, M. Roussopoulos, P. Maniatis, TJ Giuli, and P. Bungale, "A fresh look at the reliability of long-term digital storage," in EuroSys 2006: 1st ACM SIGOPS European Systems Conference, Leuven, Belgium, pp. 221--234, April 2006.
[8]
M. W. Storer, K.M. Greenan, and E.L. Miller. "Long-term threats to secure archives," in StorageSS 2006: 2nd International Workshop on Storage Security and Survivability, Alexandria, VA, pp. 9--16, October 2006.
[9]
M. W. Storer, K.M. Greenan, E.L. Miller, K. Voruganti. "POTSHARDS - a secure, recoverable, long-term archival storage system," ACM Transactions on Storage, 5(2), article 5, June 2009.
[10]
M. Factor, D. Naor, S. Rabinovici-Cohen, L. Ramati, P. Reshef, J. Satran, and D.L Giaretta. "Preservation DataStores: Architecture for Preservation Aware Storage," in MSST 2007: 24th IEEE Conference on Mass Storage Systems and Technologies, San Diego, CA, pp. 3--15, September 2007.
[11]
S. Rabinovici-Cohen, M.E. Factor, D. Naor, L. Ramati, P. Reshef, S. Ronen, J. Satran, and D.L. Giaretta, "Preservation DataStores: New storage paradigm for preservation environments," IBM Journal of Research and Development, Special Issue on Storage Technologies and Systems, 52(4/5):389--399, July/September 2008.
[12]
CASPAR: Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval, EU FP6 Project, http://www.casparpreserves.eu.
[13]
K-K. Muniswamy-Reddy, D.A. Holland, U. Braun, and M. Seltzer, "Provenance-aware storage systems," in USENIX'06: 2006 USENIX Annual Technical Conference, Boston, MA, pp. 43--56, May 2006.
[14]
A. Dappert, and M. Enders, "Digital preservation metadata standards," Information Standards Quarterly, Special Issue on Digital Preservation, 22(2):4--12, Spring 2010.
[15]
P. Maniatis, M. Roussopoulos, T.J. Giuli, D.S.H. Rosenthal, and M. Baker, "The LOCKSS peer-to-peer digital preservation system," ACM Transactions on Computer Systems, 23(1):2--50, February 2005.
[16]
NDIIPP: National Digital Information Infrastructure and Preservation Program, a Collaborative Initiative of the Library of Congress, http://www.digitalpreservation.gov.
[17]
K-K. Muniswamy-Reddy, P. Macko, and M. Seltzer, "Provenance for the Cloud," in FAST'10: 8th USENIX Conference on File Technologies, San Jose, CA, pp. 197--210, February 2010.
[18]
M. A. Sakka, B. Defude, and J. Tellez, "Document provenance in the cloud: constraints and challenges," in EUNICE 2010: 16th International EUNICE/IFIP WG 6.6 Workshop: Networked Services and Applications -- Engineering, Control and Management, Trondheim, Norway, LNCS 6164, pp, 107--117, June 2010.
[19]
DuraCloud: Technology and Storage by DuraSpace, http://www.duraspace.org/duracloud.php.
[20]
ENSURE: Enabling Knowledge Sustainability, Usability and Recovery for Economic Value, EU FP7 Project, http://cordis.europa.eu/fetch?CALLER=PROJ_ICT&ACTION=D&CAT=PROJ&RCN=98002.
[21]
XML Formatted Data Unit (XFDU) Structure and Construction Rules, Recommended Standard CCSDS-661.0-B-1, The Consultative Committee for Space Data Systems (CCSDS), September 2008, http://public.ccsds.org/publications/archive/661x0b1.pdf.
[22]
VERS: Management of Electronic Records PROS 99/007 (Version 2), The Victorian Electronic Records Strategy, http://www.prov.vic.gov.au/vers/standard/version2.asp.
[23]
METS: Metadata Encoding and Transmission Standard, http://www.loc.gov/standards/mets.
[24]
PREMIS: PREservation Metadata: Implementation Strategies, http://www.loc.gov/standards/premis.
[25]
XAM: eXtensible Access Method, SNIA XAM Initiative, http://www.snia.org/forums/xam.
[26]
JHOVE2: The Next Generation Architecture for Format-Aware Characterization, http://www.jhove2.org.
[27]
DROID: Digital Record Object Identification, http://sourceforge.net/projects/droid.
[28]
The BagIt File Packaging Format (0.96), IETF Network Working Group Internet Draft, October 2010, http://tools.ietf.org/html/draft-kunze-bagit-05.
[29]
ISO 19005-1:2005, Document Management - Electronic document file format for long term preservation - Part 1: Use of PDF 1.4 (PDF/A-1), ISO International Organization for Standardization, June 2005.

Cited By

View all
  • (2023)Towards Migration-Free "Just-in-Case" Data Archival for Future Cloud Data Lakes Using Synthetic DNAProceedings of the VLDB Endowment10.14778/3594512.359452216:8(1923-1929)Online publication date: 22-Jun-2023
  • (2021)Cloud Services na perspectiva da Ciência da Informação: uma análise focada no uso de metadadosInformação & Informação10.5433/1981-8920.2021v26n1p45926:1(459)Online publication date: 31-Mar-2021
  • (2021)Cloud services e o padrão PREMISRDBCI Revista Digital de Biblioteconomia e Ciência da Informação10.20396/rdbci.v19i00.866138419Online publication date: 5-Jan-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SYSTOR '11: Proceedings of the 4th Annual International Conference on Systems and Storage
May 2011
189 pages
ISBN:9781450307734
DOI:10.1145/1987816
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • NetApp
  • Mellanox: Mellanox Technologies
  • Hewlett-Packard
  • Intel: Intel
  • Red Hat: Red Hat, Inc.
  • MARVELL: Marvell Technology Group
  • IBM: IBM

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud storage
  2. digital archiving
  3. digital preservation
  4. long term retention
  5. metadata
  6. provenance
  7. standards
  8. storage container catalog
  9. storage subsystem

Qualifiers

  • Research-article

Conference

SYSTOR '11
Sponsor:
  • Mellanox
  • Intel
  • Red Hat
  • MARVELL
  • IBM

Acceptance Rates

SYSTOR '11 Paper Acceptance Rate 16 of 53 submissions, 30%;
Overall Acceptance Rate 108 of 323 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Towards Migration-Free "Just-in-Case" Data Archival for Future Cloud Data Lakes Using Synthetic DNAProceedings of the VLDB Endowment10.14778/3594512.359452216:8(1923-1929)Online publication date: 22-Jun-2023
  • (2021)Cloud Services na perspectiva da Ciência da Informação: uma análise focada no uso de metadadosInformação & Informação10.5433/1981-8920.2021v26n1p45926:1(459)Online publication date: 31-Mar-2021
  • (2021)Cloud services e o padrão PREMISRDBCI Revista Digital de Biblioteconomia e Ciência da Informação10.20396/rdbci.v19i00.866138419Online publication date: 5-Jan-2021
  • (2021)Cloud services e o padrão PREMISRDBCI Revista Digital de Biblioteconomia e Ciência da Informação10.20396/rdbci.v19i0.866138419(1-21)Online publication date: 5-Jan-2021
  • (2019)On Value Preservation with Distributed Ledger Technologies, Intelligent Agents, and Digital PreservationBlockchain and Applications10.1007/978-3-030-23813-1_18(145-152)Online publication date: 25-Jun-2019
  • (2015)Time Machine: Projecting the Digital Assets onto the Future Simulation EnvironmentAdvances in Practical Applications of Agents, Multi-Agent Systems, and Sustainability: The PAAMS Collection10.1007/978-3-319-18944-4_15(175-186)Online publication date: 21-May-2015
  • (2014)A Proposal for a Reference Architecture for Long-Term Archiving, Preservation, and Retrieval of Big DataProceedings of the 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications10.1109/TrustCom.2014.80(622-629)Online publication date: 24-Sep-2014
  • (2013)Investigating the Needs, Capabilities and Decision Making Mechanisms in Digital PreservationInformation Resources Management Journal10.4018/irmj.201307010226:3(17-39)Online publication date: Jul-2013
  • (2013)Sustaining accessibility of information through digital preservation: A literature reviewJournal of Information Science10.1177/016555151348010739:4(442-458)Online publication date: 18-Mar-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media