[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1413370.1413427acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories

Published: 15 November 2008 Publication History

Abstract

As parallel algorithms and architectures drive the longest molecular dynamics (MD) simulations towards the millisecond scale, traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Inspired by the programming interface of Google's MapReduce, we have built a new parallel analysis framework called HiMach, which allows users to write trajectory analysis programs sequentially, and carries out the parallel execution of the programs automatically. We introduce (1) a new MD trajectory data analysis model that is amenable to parallel processing, (2) a new interface for defining trajectories to be analyzed, (3) a novel method to make use of an existing sequential analysis tool called VMD, and (4) an extension to the original MapReduce model to support multiple rounds of analysis. Performance evaluations on up to 512 cores demonstrate the efficiency and scalability of the HiMach framework on a Linux cluster.

References

[1]
Y.-S. Hwang, R. Das, J. H. Saltz, M. Hodoscek, B. R. Brooks, "Parallelizing Molecular Dynamics Programs for Distributed-Memory Machines," IEEE Computational Science and Engineering, vol. 02, no. 2, pp. 18--29, 1995.
[2]
J. C. Phillips, R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa, C. Chipot, R. D. Skeel, L. Kalé, and K. Schaulten, "Scalable molecular dynamics with NAMD," J. Comp. Chem., vol. 26, no. 16, pp. 1781--1802, 2005.
[3]
D. A. Case, T. E. Cheatham III, T. Darden, H. Gohlke, R. Luo, K. M. Merz Jr., A. Onufriev, C. Simmerling, B. Wang, and R. J. Woods, "The Amber biomolecular simulation programs," J. Comp. Chem., vol. 26, no. 16, pp. 1668--1688, 2005.
[4]
M. Christen, P. H. Hünenberger, D. Bakowies, R. Baron, R. Bürgi, D. P. Geerke, T. N. Heinz, M. A. Kastenholz, V. Kräutler, C. Oostenbrink, C. Peter, D. Trzesniak, and W. F. van Gunsteren, "The GROMOS software for biomolecular simulation: GROMOS05," J. Comp. Chem., vol. 26, no. 16, pp. 1719--1751, 2005.
[5]
K. J. Bowers, E. Chow, H. Xu, R. O. Dror, M. P. Eastwood, B. A. Gregersen, J. L. Klepeis, I. Kolossváry, M. A. Moraes, F. D. Sacerdoti, J. K. Salmon, Y. Shan, and D. E. Shaw, "Scalable algorithms for molecular dynamics simulations on commodity clusters," in Proc. 2006 ACM/IEEE Conf. on Supercomputing (SC06), Tampa, FL, November 2006.
[6]
B. Hess, C. Kutzner, D. van der Spoel, and E. Lindahl, "GROMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation," J. Chem. Theory and Comp., vol. 4, no. 3, pp. 435--447, 2008.
[7]
B. G. Fitch, A. Rayshubskiy, M. Eleftheriou, T. J. C. Ward, M. E. Giampapa, M. C. Pitman, J. W. Pitera, W. C. Swope, and R. S. Germain, "Blue Matter: Scaling of N-body simulations to one atom per node," IBM J. Research and Develop., vol. 52, no. 1/2, pp. 145--158, 2008.
[8]
R. Fine, G. Dimmler, and C. Levinthal, "FASTRUN: A special purpose, hardwired computer for molecular simulation," Proteins, vol. 11, no. 4, pp. 242--253, 1991, (erratum: 14(3): 421--422, 1992).
[9]
S. Toyoda, H. Miyagawa, K. Kitamura, T. Amisaki, E. Hashimoto, H. Ikeda, A. Kusumi, and N. Miyakawa, "Development of MD Engine: High-speed accelerator with parallel processor design for molecular dynamics simulations," J. Comp. Chem., vol. 20, no. 2, pp. 185--199, 1999.
[10]
M. Taiji, T. Narumi, Y. Ohno, N. Futatsugi, A. Suenaga, N. Takada, and A. Konagaya, "Protein Explorer: A petaflops special-purpose computer system for molecular dynamics simulations," in Proc. 2003 ACM/IEEE Conf. on Supercomputing (SC03), Phoenix, AZ, November 2003.
[11]
D. E. Shaw, M. M. Deneroff, R. O. Dror, J. S. Kuskin, R. H. Larson, J. K. Salmon, C. Young, B. Batson, K. J. Bowers, J. C. Chao, M. P. Eastwood, J. Gagliardo, J. P. Grossman, R. C. Ho, D. J. Ierardi, I. Kolossváry, J. L. Klepeis, T. Layman, C. McLeavey, M. A. Moraes, R. Mueller, E. C. Priest, Y. Shan, J. Spengler, M. Theobald, B. Towles, and S. C. Wang, "Anton, a special-purpose machine for molecular dynamics simulation," in Proc. 34th Ann. Intl. Symp. on Computer Architecture, San Diego, June 2007, pp. 1--12.
[12]
K. Barker, K. Davis, A. Hoisie, D. Kerbyson, M. Lang, S. Pakin, and J. C. Sancho, "Entering the petaflop era: The architecture and performance of roadrunner," in Proc. 2008 ACM/IEEE Conf. on Supercomputing (SC08), Austin, TX, November 2008.
[13]
J. Dean and S. Ghemawat, "MapReduce: Simplified data processing on large clusters," in Proc. 6th Symp. on Operating System Design and Implementation (OSDI04), San Francisco, CA, December 2004.
[14]
J. Dean and S. Ghemawat,"MapReduce: Simplified data processing on large clusters," Comm. ACM, vol. 51, no. 1, pp. 107--113, 2008.
[15]
W. Humphrey, A. Dalke, and K. Schulten, "VMD---Visual Molecular Dynamics," J. Mol. Graphics, vol. 14, no. 1, pp. 33--38, 1996.
[16]
PDB File Format, http://www.rcsb.org/pdb/home/home.do.
[17]
J. S. Vitter, "External memory algorithms and data structures: Dealing with massive data," ACM Computing Surveys, vol. 33, no. 2, pp. 209--271, 2001.
[18]
Hadoop, http://hadoop.apache.org/core/.
[19]
S. Ghemawat, H. Gobioff, and S.-T. Leung, "The Google file system," in Proc. 19th ACM Symp. Operating Systems Principles, Bolton Landing, NY, October 2003.
[20]
pyMPI, http://pympi.sourceforge.net/.
[21]
P. H. Carns, W. B. Ligon III, R. B. Ross, and R. Thakur, "PVFS: A parallel file system for Linux clusters," in Proc. 4th Ann. Linux Showcase and Conf., Atlanta, GA, October 2000.
[22]
PVFS, http://www.pvfs.org/.

Cited By

View all
  • (2019)M3XProceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference10.5555/3358807.3358859(617-631)Online publication date: 10-Jul-2019
  • (2019)Modeling high-throughput applications for in situ analyticsInternational Journal of High Performance Computing Applications10.1177/109434201984726333:6(1185-1200)Online publication date: 1-Nov-2019
  • (2018)KeyBin2Proceedings of the 47th International Conference on Parallel Processing10.1145/3225058.3225149(1-10)Online publication date: 13-Aug-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing
November 2008
739 pages
ISBN:9781424428359

Sponsors

Publisher

IEEE Press

Publication History

Published: 15 November 2008

Check for updates

Qualifiers

  • Research-article

Conference

SC '08
Sponsor:

Acceptance Rates

SC '08 Paper Acceptance Rate 59 of 277 submissions, 21%;
Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)M3XProceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference10.5555/3358807.3358859(617-631)Online publication date: 10-Jul-2019
  • (2019)Modeling high-throughput applications for in situ analyticsInternational Journal of High Performance Computing Applications10.1177/109434201984726333:6(1185-1200)Online publication date: 1-Nov-2019
  • (2018)KeyBin2Proceedings of the 47th International Conference on Parallel Processing10.1145/3225058.3225149(1-10)Online publication date: 13-Aug-2018
  • (2018)Task-parallel Analysis of Molecular Dynamics TrajectoriesProceedings of the 47th International Conference on Parallel Processing10.1145/3225058.3225128(1-10)Online publication date: 13-Aug-2018
  • (2017)Spline-based decomposition of streamed particle trajectories for efficient transfer and analysisProceedings of the European Association for Computer Graphics: Short Papers10.2312/egsh.20171010(41-44)Online publication date: 24-Apr-2017
  • (2017)MelissaProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3126908.3126922(1-14)Online publication date: 12-Nov-2017
  • (2015)SmartProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2807591.2807650(1-12)Online publication date: 15-Nov-2015
  • (2014)Petascale Tcl with NAMD, VMD, and Swift/TProceedings of the 1st First Workshop for High Performance Technical Computing in Dynamic Languages10.1109/HPTCDL.2014.7(6-17)Online publication date: 16-Nov-2014
  • (2014)A flexible framework for asynchronous in situ and in transit analytics for scientific simulationsProceedings of the 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2014.92(277-286)Online publication date: 26-May-2014
  • (2011)Simplified parallel domain traversalProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063397(1-11)Online publication date: 12-Nov-2011
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media