Parallel performance wizard: A performance system for the analysis of partitioned global-address-space applications
HH Su, M Billingsley… - The International Journal …, 2010 - journals.sagepub.com
HH Su, M Billingsley, AD George
The International Journal of High Performance Computing …, 2010•journals.sagepub.comGiven the complexity of high-performance parallel programs, developers often must rely on
performance analysis tools to help them improve the performance of their applications.
While many tools support analysis of message-passing programs, tool support is limited for
applications written in programming models that present a partitioned global address space
(PGAS) to the programmer such as UPC and SHMEM. Existing tools that support message-
passing models are difficult to extend to support PGAS models due to differences between …
performance analysis tools to help them improve the performance of their applications.
While many tools support analysis of message-passing programs, tool support is limited for
applications written in programming models that present a partitioned global address space
(PGAS) to the programmer such as UPC and SHMEM. Existing tools that support message-
passing models are difficult to extend to support PGAS models due to differences between …
Given the complexity of high-performance parallel programs, developers often must rely on performance analysis tools to help them improve the performance of their applications. While many tools support analysis of message-passing programs, tool support is limited for applications written in programming models that present a partitioned global address space (PGAS) to the programmer such as UPC and SHMEM. Existing tools that support message-passing models are difficult to extend to support PGAS models due to differences between the two paradigms and the techniques used in their implementations. In this paper, we present our work on Parallel Performance Wizard (PPW), a performance analysis system for PGAS and MPI application analysis. We discuss new concepts, namely the generic-operation-type abstraction and GASP-enabled data collection, developed to facilitate support for multiple programming models and then give an overview of PPW’s automatic analysis and visualization capabilities. Finally, to show the usefulness of our system, we present results on PPW’s overhead, storage requirements and scalability before demonstrating its effectiveness via application case studies.
Sage Journals