[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/CCGRID.2009.85acmotherconferencesArticle/Chapter ViewAbstractPublication PagesccgridConference Proceedingsconference-collections
Article

Natively Supporting True One-Sided Communication in MPI on Multi-core Systems with InfiniBand

Published: 18 May 2009 Publication History

Abstract

As high-end computing systems continue to grow in scale, the performance that applications can achieve on such large scale systems depends heavily on their ability to avoid explicitly synchronized communication with other processes in the system. Accordingly, several modern and legacy parallel programming models (such as MPI, UPC, Global Arrays) have provided many programming constructs that enable implicit communication using one-sided communication operations. While MPI is the most widely used communication model for scientific computing, the usage of one-sided communication is restricted; this is mainly owing to the inefficiencies in current MPI implementations that internally rely on synchronization between processes even during one-sided communication, thus losing the potential of such constructs. In our previous work, we had utilized native one-sided communication primitives offered by high-speed networks such as InfiniBand (IB) to allow for true one-sided communication in MPI. In this paper, we extend this work to natively take advantage of one-sided atomic operations on cache-coherent multi-core/multi-processor architectures while still utilizing the benefits of networks such as IB. Specifically, we present a sophisticated hybrid design that uses locks that migrate between IB hardware atomics and multi-core CPU atomics to take advantage of both. We demonstrate the capability of our proposed design with a wide range of experiments illustrating its benefits in performance as well as its potential to avoid explicit synchronization.

References

[1]
Berkeley Unified Parallel C (UPC) Project. http://upc.lbl.gov/.
[2]
Blue Gene System Architecture Overview. http://www.research.ibm.com/journal/rd/492/gara.html.
[3]
Global Arrays. http://www.emsl.pnl.gov/docs/global/.
[4]
GROMACS. http://www.gromacs.org/.
[5]
PETSc. http://www-unix.mcs.anl.gov/petsc/.
[6]
Argonne National Laboratory. MPICH2. http://wwwunix.mcs.anl.gov/mpi/mpich2/.
[7]
B. W. Barrett, G. M. Shipman, and A. Lumsdaine. Analysis of Implementation Options for MPI-2 One-Sided. In Proceedings, Euro PVM/MPI, Paris, France, October 2007.
[8]
D. Bonachea. GASNet Specification, v1.1. Technical Report UCB/CSD-02-1207, Computer Science Division, University of California at Berkeley, October 2002.
[9]
S. Booth and F. E. Mourao. Single Sided MPI Implementations for SUN MPI. In Supercomputing, 2000.
[10]
D. Buntinas, D. K. Panda, and W. Gropp. NIC-Based Atomic Remote Memory Operations in Myrinet/GM. Workshop on Novel Uses of System Area Networks (SAN-1), February 2002.
[11]
A. Devulapalli and P. Wyckoff. Distributed Queue Based Locking Using Advanced Network Features. In ICPP, 2005.
[12]
M. Frigo and S.G. Johnson. The Design and Implementation of FFTW3. Proceedings of the IEEE, 2005.
[13]
M. Goudreau, K. Lang, S. B. Rao, T. Suel, and T. Tsantilas. Portable and Effcient Parallel Computing Using the BSP Model. IEEE Transactions on Computers, pages 670-689, 1999.
[14]
W. Gropp, E. Lusk, and A. Skjellum. Using MPI: Portable Parallel Programming with the Message Passing Interface, 2nd edition. MIT Press, Cambridge, MA, 1999.
[15]
J. Hilland, P. Culley, J. Pinkerton, and R. Recio. RDMA Protocol Verbs Specification (Version 1.0). Technical report, RDMA Consortium, April 2003.
[16]
InfiniBand Trade Association. InfiniBand Architecture Specification, Release 1.0, October 24 2000.
[17]
F. E. Mourao and J. G. Silva. Implementing MPI's One-Sided Communications for WMPI. In EuroPVM/MPI, September 1999.
[18]
S. Narravula, A. Mamidala, A. Vishnu, K. Vaidyanathan, and D. K. Panda. High Performance Distributed Lock Management Services using Network-based Remote Atomic Operations. CCGrid, 2007.
[19]
J. Nieplocha and B. Carpenter. ARMCI: A Portable Remote Memory Copy Library for Distributed Array Libraries and Compiler Run-Time Systems. Lecture Notes in Computer Science, 1586, 1999.
[20]
Fabrizio Petrini, Wu chun Feng, Adolfy Hoisie, Salvador Coll, and Eitan Frachtenberg. The quadrics network (qsnet): High-performance clustering technology. In HotI 01, pages 125-130, 2001.
[21]
G. Santhanaraman, S. Narravula, and D. K. Panda. Designing Passive Synchronization for MPI-2 One-Sided Communication to Maximize Overlap. In IPDPS, 2008.
[22]
J. Traff, H. Ritzdorf, and R. Hempel. The Implementation of MPI-2 One-Sided Communication for the NEC SX. In Proceedings of Supercomputing , 2000.

Cited By

View all
  • (2018)Enabling highly scalable remote memory access programming with MPI-3 one sidedCommunications of the ACM10.1145/326441361:10(106-113)Online publication date: 26-Sep-2018
  • (2016)Reducing Communication in Distributed Asynchronous Iterative MethodsProcedia Computer Science10.1016/j.procs.2016.05.50180:C(1906-1916)Online publication date: 1-Jun-2016
  • (2016)An implementation and evaluation of the MPI 3.0 one-sided communication interfaceConcurrency and Computation: Practice & Experience10.1002/cpe.375828:17(4385-4404)Online publication date: 10-Dec-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
CCGRID '09: Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
May 2009
633 pages
ISBN:9780769536224

Publisher

IEEE Computer Society

United States

Publication History

Published: 18 May 2009

Check for updates

Author Tags

  1. InfiniBand
  2. MPI2
  3. RMA
  4. locks
  5. one-sided
  6. passive synchronization

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Enabling highly scalable remote memory access programming with MPI-3 one sidedCommunications of the ACM10.1145/326441361:10(106-113)Online publication date: 26-Sep-2018
  • (2016)Reducing Communication in Distributed Asynchronous Iterative MethodsProcedia Computer Science10.1016/j.procs.2016.05.50180:C(1906-1916)Online publication date: 1-Jun-2016
  • (2016)An implementation and evaluation of the MPI 3.0 one-sided communication interfaceConcurrency and Computation: Practice & Experience10.1002/cpe.375828:17(4385-4404)Online publication date: 10-Dec-2016
  • (2014)Enabling highly-scalable remote memory access programming with MPI-3 One SidedScientific Programming10.1155/2014/57190222:2(75-91)Online publication date: 1-Apr-2014
  • (2014)Intra-Epoch Message Scheduling To Exploit Unused or Residual Overlapping PotentialProceedings of the 21st European MPI Users' Group Meeting10.1145/2642769.2642779(13-19)Online publication date: 9-Sep-2014
  • (2014)Scalable MPI-3.0 RMA on the Blue Gene/Q SupercomputerProceedings of the 21st European MPI Users' Group Meeting10.1145/2642769.2642778(7-12)Online publication date: 9-Sep-2014
  • (2014)Nonblocking epochs in MPI one-sided communicationProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC.2014.44(475-486)Online publication date: 16-Nov-2014
  • (2014)A fast and resource-conscious MPI message queue mechanism for large-scale jobsFuture Generation Computer Systems10.1016/j.future.2013.07.00330:C(265-290)Online publication date: 1-Jan-2014
  • (2013)Enabling highly-scalable remote memory access programming with MPI-3 one sidedProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.1145/2503210.2503286(1-12)Online publication date: 17-Nov-2013
  • (2013)Efficient and truly passive MPI-3 RMA using InfiniBand atomicsProceedings of the 20th European MPI Users' Group Meeting10.1145/2488551.2488573(91-96)Online publication date: 15-Sep-2013
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media