[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3019046.3019047acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Scientific workflows at datawarp-speed: accelerated data-intensive science using NERSC's burst buffer

Published: 13 November 2016 Publication History

Abstract

Emerging exascale systems have the ability to accelerate the time-to-discovery for scientific workflows. However, as these workflows become more complex, their generated data has grown at an unprecedented rate, making I/O constraints challenging. To address this problem advanced memory hierarchies, such as burst buffers, have been proposed as intermediate layers between the compute nodes and the parallel file system. In this paper, we utilize Cray DataWarp burst buffer coupled with in-transit processing mechanisms, to demonstrate the advantages of advanced memory hierarchies in preserving traditional coupled scientific workflows. We consider in-transit workflow which couples simulation of subsurface flows with on-the-fly flow visualization. With respect to the proposed workflow, we study the performance of the Cray DataWarp Burst Buffer and provide a comparison with the Lustre parallel file system.

References

[1]
K.-L. Ma, C. Wang, H. Yu, and A. Tikhonova, "In-situ processing and visualization for ultrascale simulations," Journal of Physics: Conference Series, vol. 78, no. 1, P. 012043, 2007. {Online}. Available: http://stacks.iop.org/1742-6596/78/i=1/a=012043
[2]
A. Kageyama and T. Yamada, "An approach to exascale visualization: Interactive viewing of in-situ visualization," Computer Physics Communications, vol. 185, no. 1, pp. 79--85, 2014.
[3]
P. Kogge and J. Shalf, "Exascale computing trends: Adjusting to the," Computing in Science & Engineering, vol. 15, no. 6, pp. 16--26, 2013.
[4]
F. Zheng, H. Abbasi, C. Docan, J. Lofstead, S. Klasky, Q. Liu, M. Parashar, N. Podhorszki, K. Schwan, and M. Wolf, "PreDatA - preparatory data analytics on peta-scale machines," in Proc. of 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS'10), April 2010.
[5]
H. Abbasi, M. Wolf, G. Eisenhauer, S. Klasky, K. Schwan, and F. Zheng, "Datastager: scalable data staging services for petascale applications," in Proc. 18th International Symposium on High Performance Distributed Computing (HPDC'09), 2009.
[6]
C. Docan, M. Parashar, and S. Klasky, "DataSpaces: An Interaction and Coordination Framework for Coupled Simulation Workflows," in Proc. of 19th International Symposium on High Performance and Distributed Computing (HPDC'10), June 2010.
[7]
W. Bhimji, D. Bard, D. Paul, M. Romanus, A. Ovsyannikov, B. Friesen, M. Bryson, J. Correa, G. Lockwood, V. Tsulaia, S. Byna, S. Farrell, C. Daley, V. Beckner, B. V. Straalen, D. Trebotich, C. Tull, G. Weber, N. Wright, K. Antypas, and Prabhat, "Accelerating science with the NERSC Burst Buffer Early User Program," Cray User Group Meeting, 2016.
[8]
The HDF Group. (2000-2010) Hierarchical data format version 5. {Online}. Available: http://www.hdfgroup.org/HDF5
[9]
J. Lofstead, F. Zheng, S. Klasky, and K. Schwan, "Adaptable, Metadata Rich IO Methods for Portable High Performance IO," in Proc. 23th IEEE International Parallel and Distributed Processing Symposium (IPDPS'09), May 2009.
[10]
M. A. Jette, A. B. Yoo, and M. Grondona, "Slurm: Simple linux utility for resource management," in In Lecture Notes in Computer Science: Proceedings of Job Scheduling Strategies for Parallel Processing (JSSPP) 2003. Springer-Verlag, 2002, pp. 44--60.
[11]
M. Romanus, R. B. Ross, and M. Parashar, "Challenges and considerations for utilizing burst buffers in high-performance computing," CoRR, vol. abs/1509.05492, 2015. {Online}. Available: http://arxiv.org/abs/1509.05492
[12]
D. Trebotich, M. F. Adams, S. Molins, C. I. Steefel, and S. Chaopeng, "High-resolution simulation of pore-scale reactive transport processes associated with carbon sequestration," Computing in Science & Engineering, vol. 16, no. 6, pp. 22--31, 2014.
[13]
S. Molins, D. Trebotich, C. I. Steefel, and C. Shen, "An investigation of the effect of pore scale flow on average geochemical reaction rates using direct numerical simulation," Water Resour. Res., vol. 48, no. 3, pp. 43--82, 2012.
[14]
M. Adams, P. Colella, D. T. Graves, J. Johnson, N. Keen, T. J. Ligocki, D. F. Martin, P. McCorquodale, D. Modiano, P. Schwartz, T. Sternberg, and B. V. Straalen, Chombo Software Package for AMR Applications, Design Document. Lawrence Berkeley National Laboratory Technical Report LBNL-6616E.
[15]
C. I. Steefel, CrunchFlow Users Manual. Lawrence Berkeley National Laboratory, 2008.
[16]
S. Molins, D. Trebotich, L. Yang, J. B. Ajo-Franklin, T. J. Ligocki, C. Shen, and C. Steefel, "Pore-scale controls on calcite dissolution rates from flow-through laboratory and numerical experiments," Environmental Science and Technology, 2014.
[17]
D. Trebotich and D. Graves, "An adaptive finite volume method for the incompressible Navier-Stokes equations in complex geometries," Communications in Applied Mathematics and Computational Science, vol. 10, no. 1, pp. 43--82, 2015.
[18]
B. Pollak, "Portable operating system interface (posix)-part 1x: Realtime distributed systems communication application program interface (api)," IEEE Standard P, vol. 1003.
[19]
J. B. Ajo-Franklin, M. Voltolini, S. Molins, and L. Yang, "Coupled processes in a fractured reactive system: A dolomite dissolution study with relevance to gcs caprock integrity," Caprock Integrity in Geological Carbon Storage, Submitted December 2015. In review.

Cited By

View all
  • (2020)GIFTProceedings of the 18th USENIX Conference on File and Storage Technologies10.5555/3386691.3386702(103-120)Online publication date: 24-Feb-2020
  • (2019)WatCacheThe Journal of Supercomputing10.1007/s11227-017-2167-775:2(554-586)Online publication date: 1-Feb-2019
  • (2017)Performance analysis of emerging data analytics and HPC workloadsProceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems10.1145/3149393.3149400(43-48)Online publication date: 12-Nov-2017

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
PDSW-DISCS '16: Proceedings of the 1st Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems
November 2016
66 pages
ISBN:9781509052165

Sponsors

In-Cooperation

Publisher

IEEE Press

Publication History

Published: 13 November 2016

Check for updates

Qualifiers

  • Research-article

Conference

SC16
Sponsor:

Acceptance Rates

Overall Acceptance Rate 17 of 41 submissions, 41%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2020)GIFTProceedings of the 18th USENIX Conference on File and Storage Technologies10.5555/3386691.3386702(103-120)Online publication date: 24-Feb-2020
  • (2019)WatCacheThe Journal of Supercomputing10.1007/s11227-017-2167-775:2(554-586)Online publication date: 1-Feb-2019
  • (2017)Performance analysis of emerging data analytics and HPC workloadsProceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems10.1145/3149393.3149400(43-48)Online publication date: 12-Nov-2017

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media