Abstract
OpenSHMEM is an open standard that brings together several long-standing, vendor-specific SHMEM implementations that allows applications to use SHMEM in a platform-independent fashion. Several implementations of OpenSHMEM have become available on clusters interconnected by InfiniBand networks, which has gradually become the de facto high performance network interconnect standard. In this paper, we present a detailed comparison and analysis of the performance of different OpenSHMEM implementations, using micro-benchmarks and application kernels. This study, done on TACC Stampede system using up to 4,096 cores, provides a useful guide for application developers to understand and contrast various implementations and to select the one that works best for their applications.
This research is supported in part by National Science Foundation grants #OCI-0926691, #OCI-1148371 and #CCF-1213084.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Co-Array Fortran, http://www.co-array.org
Brightwell, R., Pedretti, K.: An Intra-Node Implementation of OpenSHMEM Using Virtual Address Space Mapping. In: The 5th Conference on Partitioned Global Address Space (PGAS) (2011)
Bonachea, D.: GASNet Specification v1.1. Tech. Rep. UCB/CSD-02-1207, U. C. Berkeley (2008)
HPCToolkit, http://hpctoolkit.org/
Jose, J., Kandalla, K., Luo, M., Panda, D.: Supporting Hybrid MPI and OpenSHMEM over InfiniBand: Design and Performance Evaluation. In: 41st International Conference on Parallel Processing, ICPP (2012)
Mellanox Scalable SHMEM, http://www.mellanox.com/page/products_dyn?product_family=133&mtag=scalableshmem
MVAPICH2-X: Unified MPI+PGAS Communication Runtime over OpenFabrics/Gen2 for Exascale Systems, http://mvapich.cse.ohio-state.edu/
OpenMPI: Open Source High Performance Computing, http://www.open-mpi.org/
OpenSHMEM, http://openshmem.org/
OSU Micro-benchmarks, http://mvapich.cse.ohio-state.edu/benchmarks/
Potluri, S., Kandalla, K., Bureddy, D., Li, M., Panda, D.K.: Efficient Intranode Desgins for OpenSHMEM on Multicore Clusters. In: The 6th Conference on Partitioned Global Address Space, PGAS (2012)
Shainer, G., Wilde, T., Lui, P., Liu, T., Kagan, M., Dubman, M., Shahar, Y., Graham, R., Shamis, P., Poole, S.: The Co-design Architecture for Exascale Systems, a Novel Approach for Scalable Designs. Computer Science-Research and Development, 1–7 (2013)
Silicon Graphics International: SHMEM API for Parallel Programming, http://www.shmem.org/
TACC Stampede Cluster, http://www.xsede.org/resources/overview
UPC Consortium: UPC Language Specifications, v1.2. Tech. Rep. LBNL-59208, Lawrence Berkeley National Lab (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Jose, J., Zhang, J., Venkatesh, A., Potluri, S., Panda, D.K.(. (2014). A Comprehensive Performance Evaluation of OpenSHMEM Libraries on InfiniBand Clusters. In: Poole, S., Hernandez, O., Shamis, P. (eds) OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools. OpenSHMEM 2014. Lecture Notes in Computer Science, vol 8356. Springer, Cham. https://doi.org/10.1007/978-3-319-05215-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-05215-1_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05214-4
Online ISBN: 978-3-319-05215-1
eBook Packages: Computer ScienceComputer Science (R0)