[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/CCGRID.2007.51guideproceedingsArticle/Chapter ViewAbstractPublication PagesccgridConference Proceedingsconference-collections
Article

Exploiting Lustre File Joining for Effective Collective IO

Published: 14 May 2007 Publication History

Abstract

Lustre is a parallel file system that presents high aggregated IO bandwidth by striping file extents across many storage devices. However, our experiments indicate excessively wide striping can cause performance degradation. Lustre supports an innovative file joining feature that joins files in place. To mitigate striping overhead and benefit collective IO, we propose two techniques: split writing and hierarchical striping. In split writing, a file is created as separate subfiles, each of which is striped to only a few storage devices. They are joined as a single file at the file close time. Hierarchical striping builds on top of split writing and orchestrates the span of subfiles in a hierarchical manner to avoid overlapping and achieve the appropriate coverage of storage devices. Together, these techniques can avoid the overhead associated with large stripe width, while still being able to combine bandwidth available from many storage devices. We have prototyped these techniques in the ROMIO implementation of MPI-IO. Experimental results indicate that split writing and hierarchical striping can significantly improve the performance of Lustre collective IO in terms of both data transfer and management operations. On a Lustre file system configured with 46 object storage targets, our implementation improves collective write performance of a 16-process job by as much as 220%.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
CCGRID '07: Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
May 2007
900 pages
ISBN:0769528333

Publisher

IEEE Computer Society

United States

Publication History

Published: 14 May 2007

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Batch-file Operations to Optimize Massive Files AccessingACM Transactions on Storage10.1145/339428616:3(1-25)Online publication date: 16-Jul-2020
  • (2019)Performance of ODROID-MC1 for scientific flow problemsFuture Generation Computer Systems10.1016/j.future.2018.12.05995:C(149-162)Online publication date: 1-Jun-2019
  • (2018)Dac-ManProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291753(1-13)Online publication date: 11-Nov-2018
  • (2018)Dac-ManProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00075(1-13)Online publication date: 11-Nov-2018
  • (2017)Reducing I/O variability using dynamic I/O path characterization in petascale storage systemsThe Journal of Supercomputing10.1007/s11227-016-1904-773:5(2069-2097)Online publication date: 1-May-2017
  • (2016)An ephemeral burst-buffer file system for scientific applicationsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.5555/3014904.3014997(1-12)Online publication date: 13-Nov-2016
  • (2016)The composite-file file systemProceedings of the 14th Usenix Conference on File and Storage Technologies10.5555/2930583.2930585(15-22)Online publication date: 22-Feb-2016
  • (2014)A user-friendly approach for tuning parallel file operationsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC.2014.24(229-236)Online publication date: 16-Nov-2014
  • (2013)Structuring PLFS for extensibilityProceedings of the 8th Parallel Data Storage Workshop10.1145/2538542.2538564(20-26)Online publication date: 17-Nov-2013
  • (2013)Insights for exascale IO APIs from building a petascale IO APIProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.1145/2503210.2503238(1-12)Online publication date: 17-Nov-2013
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media