[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

MilkyWay-2 supercomputer: system and application

Published: 01 June 2014 Publication History

Abstract

On June 17, 2013, MilkyWay-2 (Tianhe-2) supercomputer was crowned as the fastest supercomputer in the world on the 41th TOP500 list. This paper provides an overview of the MilkyWay-2 project and describes the design of hardware and software systems. The key architecture features of MilkyWay-2 are highlighted, including neo-heterogeneous compute nodes integrating commodity-off-the-shelf processors and accelerators that share similar instruction set architecture, powerful networks that employ proprietary interconnection chips to support the massively parallel message-passing communications, proprietary 16-core processor designed for scientific computing, efficient software stacks that provide high performance file system, emerging programming model for heterogeneous systems, and intelligent system administration. We perform extensive evaluation with wide-ranging applications from LINPACK and Graph500 benchmarks to massively parallel software deployed in the system.

References

[1]
Yang X J, Liao X K, Lu K, Hu Q F, Song J Q, Su J S. The Tianhe-1a supercomputer: its hardware and software. Journal of Computer Science and Technology, 2011, 26(3): 344---351
[2]
Zhang H, Wang K, Zhang J, Wu N, Dai Y. A fast and fair shared buffer for high-radix router. Journal of Circuits, Systems, and Computers, 2013
[3]
Kirk D. Nvidia cuda software and GPU parallel computing architecture. In: Proceedings of the 6th International Symposium on Memory Management. 2007, 103---104
[4]
Sherlekar S. Tutorial: Intel many integrated core (MIC) architecture. In: Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems. 2012, 947
[5]
Gaster B, Howes L, Kaeli D R, Mistry P, Schaa D. Heterogeneous Computing with OpenCL. Morgan Kaufmann Publishers Inc., 2011
[6]
Lee S, Vetter J S. Early evaluation of directive-based GPU programming models for productive exascale computing. In: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis. 2012, 1---11
[7]
Wienke S, Springer P, Terboven C, Mey D. Openacc: first experiences with real-world applications. In: Proceedings of the 18th International Conference on Parallel Processing. 2012, 859---870
[8]
PGI Accelerator Compilers. Portland Group Inc, 2011
[9]
Yang X L, Tang T, Wang G B, Jia J, Xu X H. MPtoStream: an openMP compiler for CPU-GPU heterogeneous parallel systems. Science China Information Sciences, 2012, 55(9): 1961---1971
[10]
Dolbeau R, Bihan S, Bodin F. Hmpp: a hybrid multi-core parallel programming environment. In: Proceedings of the 2007 Workshop on General Purpose Processing on Graphics Processing Units. 2007, 1---5
[11]
Checconi F, Petrini F, Willcock J, Lumsdaine A, Choudhury A R, Sabharwal Y. Breaking the speed and scalability barriers for graph exploration on distributed-memory machines. In: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis. 2012, 1---12
[12]
Beamer S, Buluç A, Asanovic K, Patterson D. Distributed memory breadth-first search revisited: enabling bottom-up search. In: Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum. 2013, 1618---1627
[13]
Subramaniam S, Mehrotra M, Gupta D. Virtual high throughput screening (VHIS)-a perspective. Bioinformation, 2007, 3(1): 14---17
[14]
Tanrikulu Y, Krüger B, Proschak E. The holistic integration of virtual screening in drug discovery. Drug Discovery Today, 2013, 18(7): 358---364
[15]
Zhang X, Wong S E, Lightstone F C. Message passing interface and multithreading hybrid for parallel molecular docking of large databases on petascale high performance computing machines. Journal of Computational Chemistry, 2013, 34(11): 915---927
[16]
Lang P T, Brozell S R, Mukherjee S, Pettersen E F, Meng E C, Thomas V, Rizzo R C, Case D A, James T L, Kuntz I D. Dock 6: combining techniques to model RNA-small molecule complexes. RNA, 2009, 15(6): 1219---1230
[17]
Gao Z, Li H, Zhang H, Liu X, Kang L, Luo X, Zhu W, Chen K, Wang X, Jiang H. PDTD: a web-accessible protein database for drug target identification. BMC Bioinformatics, 2008, 9(1): 104
[18]
Yang C, Xue W, Fu H, Gan L, Li L, Xu Y, Lu Y, Sun J, Yang G, Zheng W. A peta-scalable CPU-GPU algorithm for global atmospheric simulations. In: Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. 2013, 1---12

Cited By

View all
  • (2024)Towards Highly Compatible I/O-Aware Workflow Scheduling on HPC SystemsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00031(1-15)Online publication date: 17-Nov-2024
  • (2024)File I/O Cache Performance of Supercomputer Fugaku Using an Out-of-Core Direct Numerical Simulation Code of TurbulenceComputational Science – ICCS 202410.1007/978-3-031-63778-0_13(173-187)Online publication date: 2-Jul-2024
  • (2022)Towards scalable resource management for supercomputersProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/3571885.3571916(1-15)Online publication date: 13-Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Frontiers of Computer Science: Selected Publications from Chinese Universities
Frontiers of Computer Science: Selected Publications from Chinese Universities  Volume 8, Issue 3
June 2014
183 pages
ISSN:2095-2228
EISSN:2095-2236
Issue’s Table of Contents

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 June 2014

Author Tags

  1. MilkyWay-2 supercomputer
  2. benchmark optimization
  3. heterogeneous programing model
  4. interconnect network
  5. neo-heterogeneous architecture
  6. performance evaluation
  7. petaflops computing
  8. system management

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Towards Highly Compatible I/O-Aware Workflow Scheduling on HPC SystemsProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00031(1-15)Online publication date: 17-Nov-2024
  • (2024)File I/O Cache Performance of Supercomputer Fugaku Using an Out-of-Core Direct Numerical Simulation Code of TurbulenceComputational Science – ICCS 202410.1007/978-3-031-63778-0_13(173-187)Online publication date: 2-Jul-2024
  • (2022)Towards scalable resource management for supercomputersProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/3571885.3571916(1-15)Online publication date: 13-Nov-2022
  • (2020)GPU Acceleration of a High-Order CFD ProgramProceedings of the 2020 4th International Conference on High Performance Compilation, Computing and Communications10.1145/3407947.3407963(123-128)Online publication date: 27-Jun-2020
  • (2020)Parallelization and Optimization of a Combustion Simulation Application on GPU PlatformProceedings of the 2020 4th International Conference on High Performance Compilation, Computing and Communications10.1145/3407947.3407960(50-55)Online publication date: 27-Jun-2020
  • (2020)Optimizing the SSD Burst Buffer by Traffic DetectionACM Transactions on Architecture and Code Optimization10.1145/337770517:1(1-26)Online publication date: 4-Mar-2020
  • (2020)Spatially Bursty I/O on Supercomputers: Causes, Impacts and SolutionsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2020.300557231:12(2908-2922)Online publication date: 9-Jul-2020
  • (2020)Design and Implementation of the Tianhe-2 Data Storage and Management SystemJournal of Computer Science and Technology10.1007/s11390-020-9799-435:1(27-46)Online publication date: 1-Jan-2020
  • (2019)Big Data Framework for Scalable and Efficient Biomedical Literature Mining in the CloudProceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval10.1145/3342827.3342843(80-86)Online publication date: 28-Jun-2019
  • (2019)GeckoProceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores10.1145/3303084.3309489(21-30)Online publication date: 17-Feb-2019
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media