[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/IPDPS.2014.27guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

CALCioM: Mitigating I/O Interference in HPC Systems through Cross-Application Coordination

Published: 19 May 2014 Publication History

Abstract

Unmatched computation and storage performance in new HPC systems have led to a plethora of I/O optimizations ranging from application-side collective I/O to network and disk-level request scheduling on the file system side. As we deal with ever larger machines, the interference produced by multiple applications accessing a shared parallel file system in a concurrent manner becomes a major problem. Interference often breaks single-application I/O optimizations, dramatically degrading application I/O performance and, as a result, lowering machine wide efficiency. This paper focuses on CALCioM, a framework that aims to mitigate I/O interference through the dynamic selection of appropriate scheduling policies. CALCioM allows several applications running on a supercomputer to communicate and coordinate their I/O strategy in order to avoid interfering with one another. In this work, we examine four I/O strategies that can be accommodated in this framework: serializing, interrupting, interfering and coordinating. Experiments on Argonne's BG/P Surveyor machine and on several clusters of the French Grid'5000 show how CALCioM can be used to efficiently and transparently improve the scheduling strategy between two otherwise interfering applications, given specified metrics of machine wide efficiency.

Cited By

View all
  • (2024)Tango: A Cross-layer Approach to Managing I/O Interference over Local Ephemeral StorageProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00020(1-15)Online publication date: 17-Nov-2024
  • (2023)HadaFSProceedings of the 21st USENIX Conference on File and Storage Technologies10.5555/3585938.3585952(215-230)Online publication date: 21-Feb-2023
  • (2023)I/O Access Patterns in HPC Applications: A 360-Degree SurveyACM Computing Surveys10.1145/361100756:2(1-41)Online publication date: 15-Sep-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
IPDPS '14: Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium
May 2014
1176 pages
ISBN:9781479938001

Publisher

IEEE Computer Society

United States

Publication History

Published: 19 May 2014

Author Tag

  1. I/O, Parallel File Systems, Cross-Application Contention, Interference, CALCioM

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Tango: A Cross-layer Approach to Managing I/O Interference over Local Ephemeral StorageProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00020(1-15)Online publication date: 17-Nov-2024
  • (2023)HadaFSProceedings of the 21st USENIX Conference on File and Storage Technologies10.5555/3585938.3585952(215-230)Online publication date: 21-Feb-2023
  • (2023)I/O Access Patterns in HPC Applications: A 360-Degree SurveyACM Computing Surveys10.1145/361100756:2(1-41)Online publication date: 15-Sep-2023
  • (2023)Uncovering I/O demands on HPC platformsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104744182:COnline publication date: 1-Dec-2023
  • (2020)Taming I/O variation on QoS-less HPC storageProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.5555/3433701.3433715(1-13)Online publication date: 9-Nov-2020
  • (2020)GIFTProceedings of the 18th USENIX Conference on File and Storage Technologies10.5555/3386691.3386702(103-120)Online publication date: 24-Feb-2020
  • (2020)I/O performance of the Santos Dumont supercomputerInternational Journal of High Performance Computing Applications10.1177/109434201986852634:2(227-245)Online publication date: 1-Mar-2020
  • (2020)CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server ClusterProceedings of the 49th International Conference on Parallel Processing10.1145/3404397.3404411(1-11)Online publication date: 17-Aug-2020
  • (2020)Mapping and scheduling HPC applications for optimizing I/OProceedings of the 34th ACM International Conference on Supercomputing10.1145/3392717.3392764(1-12)Online publication date: 29-Jun-2020
  • (2020)A Survey and Classification of Software-Defined Storage SystemsACM Computing Surveys10.1145/338589653:3(1-38)Online publication date: 28-May-2020
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media