[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3383583.3398624acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
abstract

Multi-tenancy Cloud Access and Preservation: Virginia Tech Digital Libraries Platform

Published: 01 August 2020 Publication History

Abstract

Virginia Tech Libraries has developed a cloud-native, microservervices-based digital libraries platform to consolidate diverse access and preservation infrastructure into a set of flexible, independent microservices in Amazon Web Services. We have been an implementer and contributor to various community digital library and repository projects including DSpace, Fedora, and Samvera3. However, the complexity and cost of maintaining disparate application stacks have reduced our capacity to build new infrastructure. Virginia Tech has a long history of participation in and contribution to community-driven Open Source projects and has, in that time, developed more than a dozen independent applications architected on these stacks. The cost of independently addressing vulnerabilities, which often requires work to mitigate incompatibilities; reworking each application to comply with developing branding guidelines; and feature development and improvement has burgeoned, threatening to overwhelm our capacity. Like many of our peers5, our maintenance obligations have made continued growth unsustainable and have pushed older applications to near abandonware. We have designed and developed the Digital Libraries Platform to address these concerns thus reducing our maintenance obligations and costs associated with feature development across digital libraries. This approach represents a departure from the monolithic architectures of our legacy systems and, as such, shares more infrastructure among individual digital library implementations. The shared infrastructure facilitates rapid inclusion of new and improved features into each digital library instance. New features can be developed independent of any digital library instance and integrated into that instance by inclusion of that feature in the React/Amplify template. Changes to the template super class, such as those necessitated by evolving branding guidelines, may be immediately inherited by the template instances that subscribe to it. The platform implements Terraform6 deployment templates, Lambda serverless functions, and other cloud assets to form a microservices architecture on which multiple template-based sites are built. Individual sites are configured in AWS DynamoDB, Amazon's NoSQL database service, and via modification of shared template. Additional services provide digital preservation support including auditing, file fixity validation, replication to external cloud storage providers, file format characterization, and deposit to third-party preservation services. This presentation also discusses the cost of operating these services in AWS and strategies for mitigating those costs. These strategies include containerization to allow deployment of high-cost, asynchronous services to local infrastructure to take full advantage of existing infrastructure and advantageous utility pricing while allowing for local redeployment. In the past, developers worked in local, independent environments. New features and fixes were submitted to a central development environment testing and validation, which significantly slowed development. Migrating development, review, integration, and deployment processes to AWS decreased the time and resource bottlenecks for those processes. Our AWS cost accounting demonstrates an 87% savings over our traditional, on-premises Fedora/Samvera approach For a team of four software developers, the total cost using a traditional server-based (a t2-medium EC2 instance) development approach is about $133 per month versus our serverless-based development approach using AWS Amplify at an average of $17 per month. As the Digital Libraries Platform project expands, we anticipate publishing a set of API documents allowing us and others to reimplement specific microservices independent of the architecture.

References

[1]
DSpace. 2020. DSpace - A Turnkey Institutional Repository. Retrieved April 6, 2020 from https://duraspace.org/dspace/
[2]
Fedora. 2020. Retrieved April 6, 2020 from Fedora - The Flexible, Modular Open-Source Repository. Retrieved from https://duraspace.org/dspace/
[3]
Samvera. 2020. Samvera - a welcoming, vibrant community. Retrieved April 6, 2020 from https://samvera.org/
[4]
Virginia Tech Digital Libraries Github. 2020. Retrieved June 15, 2020 from https://github.com/vt-digital-libraries-platform
[5]
Michael Giarlo and Justin Coyne. 2019. Sustaining a Large-Scale Repository Architecture: Behind the Scenes of the Stanford Digital Repository. Retrieved April 6, 2020 from https://www.conftool.net/or2019/index.php?page=browseSessions&form_session=362
[6]
Terraform. 2020. Terraform by HashiCorp. Retrieved April 6, 2020 from https://www.terraform.io/
[7]
PREMIS. 2018. PREMIS Data Dictionary for Preservation Metadata. Retrieved April 6, 2020 from https://www.loc.gov/standards/premis/
[8]
MetaArchive. 2020. MetaArchive: A Digital Preservation Network. Retrieved April 6, 2020 from https://metaarchive.org/
[9]
Academic Preservation Trust. 2020. Retrieved April 6, 2020 from http://aptrust.org/
[10]
File Information Toolset (FITS). 2020. Retrieved April 6, 2020 from https://projects.iq.harvard.edu/fits/
[11]
Nice Opaque Identifiers. 2006. Nice Opaque Identifiers. Retrieved April 6, 2020 from https://metacpan.org/pod/distribution/Noid/noid
[12]
International Image Interoperability Framework. 2020. Home IIIIF. Retrieved April 6, 2020 from https://iiif.io/
[13]
John Kunze. 2013. The ARK Identifier Scheme. Retrieved April 6, 2020 from https://tools.ietf.org/html/draft-kunze-ark-18

Cited By

View all
  • (2022)Enhanced Security and Privacy Issue in Multi-Tenant Environment of Green Computing Using Blockchain TechnologyBlockchain Applications in the Smart Era10.1007/978-3-030-89546-4_4(65-83)Online publication date: 20-Apr-2022

Index Terms

  1. Multi-tenancy Cloud Access and Preservation: Virginia Tech Digital Libraries Platform

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020
    August 2020
    611 pages
    ISBN:9781450375856
    DOI:10.1145/3383583
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 August 2020

    Check for updates

    Author Tags

    1. cloud computing
    2. digital libraries
    3. digital preservation
    4. microservice

    Qualifiers

    • Abstract

    Conference

    JCDL '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 415 of 1,482 submissions, 28%

    Upcoming Conference

    JCDL '24
    The 2024 ACM/IEEE Joint Conference on Digital Libraries
    December 16 - 20, 2024
    Hong Kong , China

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 11 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Enhanced Security and Privacy Issue in Multi-Tenant Environment of Green Computing Using Blockchain TechnologyBlockchain Applications in the Smart Era10.1007/978-3-030-89546-4_4(65-83)Online publication date: 20-Apr-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media