In the exascale computing era, optimizing MPI collective performance in high-performance computing (HPC) applications is critical. Current algorithms face performance degradation due to system call overhead, page faults, or data-copy latency, affecting HPC applications' efficiency and scalability. To address these issues, we propose PiP-MColl, a Process-in-Process-based Multi-object Inter-process MPI Collective design that maximizes small message MPI collective performance at scale. PiP-MColl features efficient multiple sender and receiver collective algorithms and leverages Process-in-Process shared memory techniques to eliminate unnecessary system call, page fault overhead, and extra data copy, improving intra- and inter-node message rate and throughput. Our design also boosts performance for larger messages, resulting in comprehensive improvement for various message sizes. Experimental results show that PiP-MColl outperforms popular MPI libraries, including OpenMPI, MVAPICH2, and Intel MPI, by up to 4.6X for MPI collectives like MPI_Scatter and MPI_Allgather.

References

[1]

Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, and Dhabaleswar K Panda. 2018. Designing efficient shared address space reduction collectives for multi-/many-cores. In 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 1020--1029.

Crossref

Google Scholar

[2]

Atsushi Hori, Min Si, Balazs Gerofi, Masamichi Takagi, Jai Dayal, Pavan Balaji, and Yutaka Ishikawa. 2018. Process-in-Process: Techniques for Practical Address-Space Sharing. In Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 131--143.

Digital Library

Google Scholar

[3]

Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, and Rajeev Thakur 2023. C-Coll: Introducing Error-bounded Lossy Compression into MPI Collectives. arxiv: 2304.03890 [cs.DC]

Google Scholar

[4]

Benjamin S Parsons and Vijay S Pai. 2014. Accelerating MPI collective communications through hierarchical algorithms without sacrificing inter-node communication flexibility. In 2014 IEEE 28th International Parallel and Distributed Processing Symposium. IEEE, 208--218. io

Digital Library

Google Scholar

Cited By

View all

Huang JDi SYu XZhai YLiu JHuang YRaffenetti KZhou HZhao KChen ZCappello FGuo YThakur RLee IChabbi MSteuwer M(2024)POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU ClustersProceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3627535.3638467(454-456)Online publication date: 2-Mar-2024
https://dl.acm.org/doi/10.1145/3627535.3638467
Huang JOuyang KZhai YLiu JSi MRaffenetti KZhou HHori AChen ZGuo YThakur R(2023)PiP-MColl: Process-in-Process-based Multi-object MPI Collectives2023 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER52292.2023.00037(354-364)Online publication date: 31-Oct-2023
https://doi.org/10.1109/CLUSTER52292.2023.00037

Index Terms

Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques
1. Computing methodologies
  1. Distributed computing methodologies
    1. Distributed programming languages

Recommendations

Tools-supported HPF and MPI parallelization of the NAS parallel benchmarks
FRONTIERS '96: Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation

High Performance Fortran (HPF) compilers and communication libraries with the standardized Message Passing Interface (MPI) are becoming widely available, easing the development of portable parallel applications. The Annai tool environment supports ...
A grid-enabled MPI: message passing in heterogeneous distributed computing systems
SC '98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing

Application development for high-performance distributed computing systems, or computational grids as they are sometimes called, requires ``grid-enabled" tools that hide mundane aspects of the heterogeneous grid environment without compromising ...
A transmission optimization method for MPI communications
Abstract
In recent years, MPI has been widely used as a communication protocol for massively parallel computing tasks, and the performance of MPI interprocess communications has become a major constraint for large-scale scalability. By analyzing the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

HPDC '23: Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing

August 2023

350 pages

ISBN:9798400701559

DOI:10.1145/3588195

General Chair:
Ali R. Butt
Virginia Tech, USA
,
Program Chairs:
Ningfang Mi
Northeastern University, USA
,
Kyle Chard
University of Chicago & Argonne National Laboratory, USA

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2023

Check for updates

Author Tags

Qualifiers

Poster

Funding Sources

US Department of Energy

Conference

HPDC '23

Sponsor:

HPDC '23: The 32nd International Symposium on High-Performance Parallel and Distributed Computing

June 16 - 23, 2023

FL, Orlando, USA

Acceptance Rates

Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
57
Total Downloads

Downloads (Last 12 months)29
Downloads (Last 6 weeks)4

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Huang JDi SYu XZhai YLiu JHuang YRaffenetti KZhou HZhao KChen ZCappello FGuo YThakur RLee IChabbi MSteuwer M(2024)POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU ClustersProceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3627535.3638467(454-456)Online publication date: 2-Mar-2024
https://dl.acm.org/doi/10.1145/3627535.3638467
Huang JOuyang KZhai YLiu JSi MRaffenetti KZhou HHori AChen ZGuo YThakur R(2023)PiP-MColl: Process-in-Process-based Multi-object MPI Collectives2023 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER52292.2023.00037(354-364)Online publication date: 31-Oct-2023
https://doi.org/10.1109/CLUSTER52292.2023.00037

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Tools-supported HPF and MPI parallelization of the NAS parallel benchmarks

A grid-enabled MPI: message passing in heterogeneous distributed computing systems

A transmission optimization method for MPI communications