[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3219104.3219120acmotherconferencesArticle/Chapter ViewAbstractPublication PagespearcConference Proceedingsconference-collections
research-article

Understanding I/O Bottlenecks and Tuning for High Performance I/O on Large HPC Systems: A Case Study

Published: 22 July 2018 Publication History

Abstract

As we move towards peta-to-exascale machines, large-scale physics based simulations are expected to generate large amount of I/O traffic based on unprecedented growth in the volume and types of data. It is imperative to understand and characterize the I/O behavior of scientific applications, including complex checkpoint/restart options, on different hardware-software configurations including large shared parallel file systems, node local flash, and burst buffer technologies, to tune and improve the overall application performance. In this work, we study the I/O behavior of WRF, a widely used scientific application for atmospheric research and operational weather forecasting, on high performance computing systems. WRF provides a rich collection of I/O strategies such as using different parallel I/O libraries (PnetCDF, NetCDF) and I/O quilting options with these libraries, as well as configurable I/O "knobs" that can be used to modify the I/O frequency. We evaluate the effectiveness of using various I/O strategies within WRF in conjunction with parallel file system parameter tuning on Comet and Stampede2 HPC systems. We discuss the impact of using various parallel I/O strategies and further show the use of an I/O profiling tool to analyze an anomalous parallel I/O behavior. Overall, we provide a discussion on tuning and performance insights gained from our evaluations.

References

[1]
{n. d.}. WRF CONUS 2.5km benchmark.
[2]
Tricia Balle and Pete Johnsen. 2016. Improving i/o performance of the weather research and forecast (WRF) model. In Cray User Group.
[3]
Richard L. Moore, Chaitan Baru, Diane Baxter, Geoffrey C. Fox, Amit Majumdar, Phillip Papadopoulos, Wayne Pfeiffer, Robert S. Sinkovits, Shawn Strande, Mahidhar Tatineni, Richard P. Wagner, Nancy Wilkins-Diehr, and Michael L. Norman. 2014. Gateways to Discovery: Cyberinfrastructure for the Long Tail of Science. In Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment (XSEDE '14). ACM, New York, NY, USA, Article 39, 8 pages.
[4]
Andrew Porter and M Ashworth. 2010. Configuring and optimizing the weather research and forecast model on the CRAY XT. In Cray User Group Proceedings.
[5]
W. C. Skamarock, J. B. Klemp, J. Dudhia, D. O. Gill, D. M. Barker, M. G Duda, X.-Y. Huang, W. Wang, and J. G. Powers. 2008. A Description of the Advanced Research WRF Version 3. Technical Report.
[6]
Dan Stanzione, Bill Barth, Niall Gaffney, Kelly Gaither, Chris Hempel, Tommy Minyard, S. Mehringer, Eric Wernert, H. Tufo, D. Panda, and P. Teller. 2017. Stampede 2: The Evolution of an XSEDE Supercomputer. In Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact (PEARC17). ACM, New York, NY, USA, Article 15, 8 pages.
[7]
W. Wang, X. Huang, H. Fu, Y. Hu, S. Xu, and G. Yang. 2013. CFIO: A Fast I/O Library for Climate Models. In 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications. 911--918.
[8]
Cong Xu, Shane Snyder, Omkar Kulkarni, Vishwanath Venkatesan, Philip Carns, Suren Byna, Robert Sisneros, and Kalyana Chadalavada. 2017. DXT: Darshan Extended Tracing. In Cray User Group Proceedings.

Cited By

View all
  • (2023)An Asynchronous Parallel I/O Framework for Mass Conservation Ocean ModelApplied Sciences10.3390/app13241323013:24(13230)Online publication date: 13-Dec-2023
  • (2023)I/O in WRF: A Case Study in Modern Parallel I/O TechniquesProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3613216(1-13)Online publication date: 12-Nov-2023
  • (2022)Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production LoadProceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing10.1145/3502181.3531461(43-55)Online publication date: 27-Jun-2022
  • Show More Cited By

Index Terms

  1. Understanding I/O Bottlenecks and Tuning for High Performance I/O on Large HPC Systems: A Case Study

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    PEARC '18: Proceedings of the Practice and Experience on Advanced Research Computing: Seamless Creativity
    July 2018
    652 pages
    ISBN:9781450364461
    DOI:10.1145/3219104
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 July 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Parallel I/O
    2. WRF
    3. asynchronous I/O

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    PEARC '18

    Acceptance Rates

    PEARC '18 Paper Acceptance Rate 79 of 123 submissions, 64%;
    Overall Acceptance Rate 133 of 202 submissions, 66%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 31 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)An Asynchronous Parallel I/O Framework for Mass Conservation Ocean ModelApplied Sciences10.3390/app13241323013:24(13230)Online publication date: 13-Dec-2023
    • (2023)I/O in WRF: A Case Study in Modern Parallel I/O TechniquesProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3613216(1-13)Online publication date: 12-Nov-2023
    • (2022)Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production LoadProceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing10.1145/3502181.3531461(43-55)Online publication date: 27-Jun-2022
    • (2022)Development of an equation-based parallelization method for multiphase particle-in-cell simulationsEngineering with Computers10.1007/s00366-022-01768-639:5(3577-3591)Online publication date: 22-Dec-2022
    • (2020)Optimizing Discrete Simulations of the Spread of HIV-1 to Handle Billions of Cells on a WorkstationProceedings of the 2020 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation10.1145/3384441.3395987(67-78)Online publication date: 15-Jun-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media