More Web Proxy on the site http://driver.im/

research-article

A runtime system for finite element methods in a partitioned global address space

Authors:

Daniel Grünewald,

Frank HannigAuthors Info & Claims

CF '20: Proceedings of the 17th ACM International Conference on Computing Frontiers

Pages 39 - 48

https://doi.org/10.1145/3387902.3392628

Published: 23 May 2020 Publication History

Abstract

With approaching exascale performance, applications in the domain of high-performance computing (HPC) have to scale to an ever-increasing amount of compute nodes. The Global Address Space Programming Interface (GASPI) communication API promises to handle this challenge by providing a highly flexible and efficient programming model in a partitioned global address space (PGAS).

Suitable applications targeting supercomputers include the domain of mesh-based solvers for partial differential equations (PDEs) due to their high computational intensity. The implementation of such solvers is highly interdisciplinary, which therefore requires an abstraction of hardware-specific parallelization techniques from developing numerical algorithms.

We present an open-source run-time system (RTS) that distributes and parallelizes device-agnostic kernels, which define algorithms on unstructured grids. We describe how the RTS abstracts common parts of iterative solvers and further explain how to parallelize and distribute these components. We further show the efficiency of our approach for several microbenchmarks and an implementation of the discontinuous Galerkin method (DGM). The results show that we can almost completely hide all synchronization overhead and that the RTS only imposes a small computational cost.

References

[1]

K.-J. Bathe. Finite Element Procedures. 2nd. Prentice Hall, 2014. ISBN: 978-0-9790049-5-7.

[2]

G. Nikishkov. "Introduction to the Finite Element Method". In: (2004).

[3]

D. Unat, A. Dubey, T. Hoefler, J. Shalf, M. Abraham, M. Bianco, B. Chamberlain, R. Cledat, H. Edwards, H. Finkel, K. Fürlinger, F. Hannig, E. Jeannot, A. Kamil, J. Keasler, P. Kelly, V. Leung, H. Ltaief, N. Maruyama, C. Newburn, and M. Pericàs. "Trends in Data Locality Abstractions for HPC Systems". In: IEEE Transactions on Parallel and Distributed Systems 28.10 (Oct. 2017), pp. 3007--3020.

[4]

R. Rabenseifner, G. Hager, and G. Jost. "Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes". In: Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing. Feb. 2009, pp. 427--436.

Digital Library

[5]

H. Jin, D. Jespersen, P. Mehrotra, R. Biswas, L. Huang, and B. Chapman. "High Performance Computing Using MPI and OpenMP on Multi-Core Parallel Systems". In: Parallel Computing 37.9 (2011), pp. 562--575.

Digital Library

[6]

C. Simmendinger, M. Rahn, and D. Grünewald. "The GASPI API: A Failure Tolerant PGAS API for Asynchronous Dataflow on Heterogeneous Architectures". In: Sustained Simulation Performance 2014. Springer, 2015, pp. 17--32.

[7]

F. ITWM. GPI-2. http://www.gpi-site.com/. Accessed: 2019-10-07.

[8]

D. Pflüger, M. Mehl, J. Valentin, F. Lindner, D. Pfander, S. Wagner, D. Graziotin, and Y. Wang. "The Scalability-Efficiency/Maintainability-Portability Trade-off in Simulation Software Engineering: Examples and a Preliminary Systematic Literature Review". In: The Computing Research Repository (CoRR) (2016). arXiv: 1608.04336 [cs.SE].

[9]

C. Schmitt, S. Kronawitter, F. Hannig, J. Teich, and C. Lengauer. "Automating the Development of High-Performance Multigrid Solvers". In: Proceedings of the IEEE 106.11 (Nov. 2018), pp. 1969--1984.

[10]

B. Chamberlain, D. Callahan, and H. Zima. "Parallel Programmability and the Chapel Language". In: The International Journal of High Performance Computing Applications 21.3 (2007), pp. 291--312.

Digital Library

[11]

T. W. Jan S. Hesthaven. Nodal Discontinuous Galerkin Methods. Springer Publishing, 2008. ISBN: 978-0-387-72067-8.

[12]

D. S. Lo. Finite Element Mesh Generation. CRC Press, 2015. ISBN: 978-1-4822-6687-0.

[13]

GASPI Forum. GASPI: Global Address Space Programming Interface. Tech. rep. Version 17.1. 2017. URL: https://raw.git.hubusercontent.com/GASPI-Forum/GASPI-Forum.github.io/master/standards/GASPI-17.1.pdf.

[14]

Daniel Grünewald. Asynchronous Constraint Execution. 2020. URL: https://github.com/cc-hpc-itwm/ACE.

[15]

G. Karypis and V. Kumar. "A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs". In: SIAM Journal on Scientific Computing (SISC) (Dec. 1998), pp. 359--392.

[16]

M. Blatt, A. Burchardt, A. Dedner, C. Engwer, J. Fahlke, B. Flemisch, C. Gersbacher, C. Gräser, F. Gruber, C. Grüninger, D. Kempf, R. Klöfkorn, T. Malkmus, S. Müthing, M. Nolte, M. Piatkowski, and O. Sander. "The Distributed and Unified Numerics Environment, Version 2.4". In: Archive of Numerical Software 4.100 (2016), pp. 13--29.

[17]

S. Abhyankar, J. Brown, E. M. Constantinescu, D. Ghosh, B. F. Smith, and H. Zhang. "PETSc/TS: A Modern Scalable ODE/DAE Solver Library". In: The Computing Research Repository (CoRR) (2018). arXiv: 1806.01437 [math.NA].

[18]

J. R. Stewart and H. C. Edwards. "A Framework Approach for Developing Parallel Adaptive Multiphysics Applications". In: Finite Elements in Analysis and Design (July 2004), pp. 1599--1617.

Digital Library

[19]

G. R. Mudalige, M. B. Giles, I. Reguly, C. Bertolli, and P. H. J. Kelly. "OP2: An Active Library Framework for Solving Unstructured Mesh-based Applications on Multi-Core and Many-Core Architectures". In: Proceedings of Innovative Parallel Computing (InPar). 2012, pp. 1--12.

[20]

Z. DeVito, N. Joubert, F. Palacios, S. Oakley, M. Medina, M. Barrientos, E. Elsen, F. Ham, A. Aiken, K. Duraisamy, et al. "Liszt: A Domain Specific Language for Building Portable Mesh-based PDE Solvers". In: Proc. of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC). ACM, 2011, 9:1--9:12.

Digital Library

[21]

M. S. Alnæs, A. Logg, K. B. Ølgaard, M. E. Rognes, and G. N. Wells. "Unified Form Language: A Domain-specific Language for Weak Formulations of Partial Differential Equations". In: ACM Transactions on Mathematical Software (TOMS) 40.2 (Mar. 2014), 9:1--9:37.

Digital Library

[22]

N. Möller, E. Petit, Q. Carayol, Q. Dinh, and W. Jalby. "Asynchronous One-Sided Communications for Scalable Fast Multipole Method in Electromagnetic Simulations". In: Proceedings of the Open Workshop on Data Locality (COLOC) in conjunction with Euro-Par 2017. Aug. 2017. URL: https://hal.archives-ouvertes.fr/hal-01998352.

[23]

J. Breitbart, M. Schmidtobreick, and V. Heuveline. "Evaluation of the Global Address Space Programming Interface (GASPI)". In: Proceedings of the IEEE International Parallel Distributed Processing Symposium Workshops (IPDPSW). May 2014, pp. 717--726.

Digital Library

[24]

R. Belli and T. Hoefler. "Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer Synchronization". In: Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 2015, pp. 871--881.

Digital Library

[25]

B. Chapman, T. Curtis, S. Pophale, S. Poole, J. Kuehn, C. Koelbel, and L. Smith. "Introducing OpenSHMEM: SHMEM for the PGAS Community". In: Proc. of the Conference on Partitioned Global Address Space Programming Model (PGAS) (New York, NY, USA). ACM, 2010, 2:1--2:3.

Digital Library

[26]

V. End. "On Collective Communication and Notified Read in the Global Address Space Programming Interface (GASPI)". PhD thesis. Georg-August-Universität Göttingen, 2016.

Cited By

Alhaddad SFörstner JGroth SGrünewald DGrynko YHannig FKenter TPfreundt FPlessl CSchotte MSteinke TTeich JWeiser MWende F(2021)HighPerMeshes – A Domain-Specific Language for Numerical Algorithms on Unstructured GridsEuro-Par 2020: Parallel Processing Workshops10.1007/978-3-030-71593-9_15(185-196)Online publication date: 14-Mar-2021
https://doi.org/10.1007/978-3-030-71593-9_15
Alhaddad SFörstner JGroth SGrünewald DGrynko YHannig FKenter TPfreundt FPlessl CSchotte MSteinke TTeich JWeiser MWende F(2021)The HighPerMeshes framework for numerical algorithms on unstructured gridsConcurrency and Computation: Practice and Experience10.1002/cpe.661634:14Online publication date: 21-Sep-2021
https://doi.org/10.1002/cpe.6616

Index Terms

A runtime system for finite element methods in a partitioned global address space
1. Computing methodologies

Recommendations

Partitioned Global Address Space Languages

The Partitioned Global Address Space (PGAS) model is a parallel programming model that aims to improve programmer productivity while at the same time aiming for high performance. The main premise of PGAS is that a globally shared address space improves ...
Productivity and performance using partitioned global address space languages
PASCO '07: Proceedings of the 2007 international workshop on Parallel symbolic computation

Partitioned Global Address Space (PGAS) languages combine the programming convenience of shared memory with the locality and performance control of message passing. One such language, Unified Parallel C (UPC) is an extension of ISO C defined by a ...
High order central discontinuous Galerkin-finite element methods for the Camassa-Holm equation

In this paper, we consider the numerical approximation of the Camassa-Holm equation which supports peakon solutions. We develop and test a high order central discontinuous Galerkin-finite element methods for solving this equation. In our numerical ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

CF '20: Proceedings of the 17th ACM International Conference on Computing Frontiers

May 2020

298 pages

ISBN:9781450379564

DOI:10.1145/3387902

General Chairs:
Maurizio Palesi
University of Catania, IT
,
Gianluca Palermo
Politecnico di Milano, IT
,
Program Chairs:
Cat Graves
Hewlett Packard Labs
,
Eishi Arima
ITC University of Tokyo, JP

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 May 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Author Tags

Qualifiers

Research-article

Funding Sources

Bundesministerium für Bildung und Forschung

Conference

CF '20

Sponsor:

SIGMICRO

CF '20: Computing Frontiers Conference

May 11 - 13, 2020

Sicily, Catania, Italy

Acceptance Rates

Overall Acceptance Rate 273 of 785 submissions, 35%

Upcoming Conference

CF '25

Sponsor:
sigmicro

22nd ACM International Conference on Computing Frontiers

May 28 - 30, 2025

Cagliari , Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
102
Total Downloads

Downloads (Last 12 months)7
Downloads (Last 6 weeks)1

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Alhaddad SFörstner JGroth SGrünewald DGrynko YHannig FKenter TPfreundt FPlessl CSchotte MSteinke TTeich JWeiser MWende F(2021)HighPerMeshes – A Domain-Specific Language for Numerical Algorithms on Unstructured GridsEuro-Par 2020: Parallel Processing Workshops10.1007/978-3-030-71593-9_15(185-196)Online publication date: 14-Mar-2021
https://doi.org/10.1007/978-3-030-71593-9_15
Alhaddad SFörstner JGroth SGrünewald DGrynko YHannig FKenter TPfreundt FPlessl CSchotte MSteinke TTeich JWeiser MWende F(2021)The HighPerMeshes framework for numerical algorithms on unstructured gridsConcurrency and Computation: Practice and Experience10.1002/cpe.661634:14Online publication date: 21-Sep-2021
https://doi.org/10.1002/cpe.6616

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents