[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3387902.3392628acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
research-article

A runtime system for finite element methods in a partitioned global address space

Published: 23 May 2020 Publication History

Abstract

With approaching exascale performance, applications in the domain of high-performance computing (HPC) have to scale to an ever-increasing amount of compute nodes. The Global Address Space Programming Interface (GASPI) communication API promises to handle this challenge by providing a highly flexible and efficient programming model in a partitioned global address space (PGAS).
Suitable applications targeting supercomputers include the domain of mesh-based solvers for partial differential equations (PDEs) due to their high computational intensity. The implementation of such solvers is highly interdisciplinary, which therefore requires an abstraction of hardware-specific parallelization techniques from developing numerical algorithms.
We present an open-source run-time system (RTS) that distributes and parallelizes device-agnostic kernels, which define algorithms on unstructured grids. We describe how the RTS abstracts common parts of iterative solvers and further explain how to parallelize and distribute these components. We further show the efficiency of our approach for several microbenchmarks and an implementation of the discontinuous Galerkin method (DGM). The results show that we can almost completely hide all synchronization overhead and that the RTS only imposes a small computational cost.

References

[1]
K.-J. Bathe. Finite Element Procedures. 2nd. Prentice Hall, 2014. ISBN: 978-0-9790049-5-7.
[2]
G. Nikishkov. "Introduction to the Finite Element Method". In: (2004).
[3]
D. Unat, A. Dubey, T. Hoefler, J. Shalf, M. Abraham, M. Bianco, B. Chamberlain, R. Cledat, H. Edwards, H. Finkel, K. Fürlinger, F. Hannig, E. Jeannot, A. Kamil, J. Keasler, P. Kelly, V. Leung, H. Ltaief, N. Maruyama, C. Newburn, and M. Pericàs. "Trends in Data Locality Abstractions for HPC Systems". In: IEEE Transactions on Parallel and Distributed Systems 28.10 (Oct. 2017), pp. 3007--3020.
[4]
R. Rabenseifner, G. Hager, and G. Jost. "Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes". In: Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing. Feb. 2009, pp. 427--436.
[5]
H. Jin, D. Jespersen, P. Mehrotra, R. Biswas, L. Huang, and B. Chapman. "High Performance Computing Using MPI and OpenMP on Multi-Core Parallel Systems". In: Parallel Computing 37.9 (2011), pp. 562--575.
[6]
C. Simmendinger, M. Rahn, and D. Grünewald. "The GASPI API: A Failure Tolerant PGAS API for Asynchronous Dataflow on Heterogeneous Architectures". In: Sustained Simulation Performance 2014. Springer, 2015, pp. 17--32.
[7]
F. ITWM. GPI-2. http://www.gpi-site.com/. Accessed: 2019-10-07.
[8]
D. Pflüger, M. Mehl, J. Valentin, F. Lindner, D. Pfander, S. Wagner, D. Graziotin, and Y. Wang. "The Scalability-Efficiency/Maintainability-Portability Trade-off in Simulation Software Engineering: Examples and a Preliminary Systematic Literature Review". In: The Computing Research Repository (CoRR) (2016). arXiv: 1608.04336 [cs.SE].
[9]
C. Schmitt, S. Kronawitter, F. Hannig, J. Teich, and C. Lengauer. "Automating the Development of High-Performance Multigrid Solvers". In: Proceedings of the IEEE 106.11 (Nov. 2018), pp. 1969--1984.
[10]
B. Chamberlain, D. Callahan, and H. Zima. "Parallel Programmability and the Chapel Language". In: The International Journal of High Performance Computing Applications 21.3 (2007), pp. 291--312.
[11]
T. W. Jan S. Hesthaven. Nodal Discontinuous Galerkin Methods. Springer Publishing, 2008. ISBN: 978-0-387-72067-8.
[12]
D. S. Lo. Finite Element Mesh Generation. CRC Press, 2015. ISBN: 978-1-4822-6687-0.
[13]
GASPI Forum. GASPI: Global Address Space Programming Interface. Tech. rep. Version 17.1. 2017. URL: https://raw.git.hubusercontent.com/GASPI-Forum/GASPI-Forum.github.io/master/standards/GASPI-17.1.pdf.
[14]
Daniel Grünewald. Asynchronous Constraint Execution. 2020. URL: https://github.com/cc-hpc-itwm/ACE.
[15]
G. Karypis and V. Kumar. "A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs". In: SIAM Journal on Scientific Computing (SISC) (Dec. 1998), pp. 359--392.
[16]
M. Blatt, A. Burchardt, A. Dedner, C. Engwer, J. Fahlke, B. Flemisch, C. Gersbacher, C. Gräser, F. Gruber, C. Grüninger, D. Kempf, R. Klöfkorn, T. Malkmus, S. Müthing, M. Nolte, M. Piatkowski, and O. Sander. "The Distributed and Unified Numerics Environment, Version 2.4". In: Archive of Numerical Software 4.100 (2016), pp. 13--29.
[17]
S. Abhyankar, J. Brown, E. M. Constantinescu, D. Ghosh, B. F. Smith, and H. Zhang. "PETSc/TS: A Modern Scalable ODE/DAE Solver Library". In: The Computing Research Repository (CoRR) (2018). arXiv: 1806.01437 [math.NA].
[18]
J. R. Stewart and H. C. Edwards. "A Framework Approach for Developing Parallel Adaptive Multiphysics Applications". In: Finite Elements in Analysis and Design (July 2004), pp. 1599--1617.
[19]
G. R. Mudalige, M. B. Giles, I. Reguly, C. Bertolli, and P. H. J. Kelly. "OP2: An Active Library Framework for Solving Unstructured Mesh-based Applications on Multi-Core and Many-Core Architectures". In: Proceedings of Innovative Parallel Computing (InPar). 2012, pp. 1--12.
[20]
Z. DeVito, N. Joubert, F. Palacios, S. Oakley, M. Medina, M. Barrientos, E. Elsen, F. Ham, A. Aiken, K. Duraisamy, et al. "Liszt: A Domain Specific Language for Building Portable Mesh-based PDE Solvers". In: Proc. of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC). ACM, 2011, 9:1--9:12.
[21]
M. S. Alnæs, A. Logg, K. B. Ølgaard, M. E. Rognes, and G. N. Wells. "Unified Form Language: A Domain-specific Language for Weak Formulations of Partial Differential Equations". In: ACM Transactions on Mathematical Software (TOMS) 40.2 (Mar. 2014), 9:1--9:37.
[22]
N. Möller, E. Petit, Q. Carayol, Q. Dinh, and W. Jalby. "Asynchronous One-Sided Communications for Scalable Fast Multipole Method in Electromagnetic Simulations". In: Proceedings of the Open Workshop on Data Locality (COLOC) in conjunction with Euro-Par 2017. Aug. 2017. URL: https://hal.archives-ouvertes.fr/hal-01998352.
[23]
J. Breitbart, M. Schmidtobreick, and V. Heuveline. "Evaluation of the Global Address Space Programming Interface (GASPI)". In: Proceedings of the IEEE International Parallel Distributed Processing Symposium Workshops (IPDPSW). May 2014, pp. 717--726.
[24]
R. Belli and T. Hoefler. "Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer Synchronization". In: Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS). May 2015, pp. 871--881.
[25]
B. Chapman, T. Curtis, S. Pophale, S. Poole, J. Kuehn, C. Koelbel, and L. Smith. "Introducing OpenSHMEM: SHMEM for the PGAS Community". In: Proc. of the Conference on Partitioned Global Address Space Programming Model (PGAS) (New York, NY, USA). ACM, 2010, 2:1--2:3.
[26]
V. End. "On Collective Communication and Notified Read in the Global Address Space Programming Interface (GASPI)". PhD thesis. Georg-August-Universität Göttingen, 2016.

Cited By

View all
  • (2021)HighPerMeshes – A Domain-Specific Language for Numerical Algorithms on Unstructured GridsEuro-Par 2020: Parallel Processing Workshops10.1007/978-3-030-71593-9_15(185-196)Online publication date: 14-Mar-2021
  • (2021)The HighPerMeshes framework for numerical algorithms on unstructured gridsConcurrency and Computation: Practice and Experience10.1002/cpe.661634:14Online publication date: 21-Sep-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CF '20: Proceedings of the 17th ACM International Conference on Computing Frontiers
May 2020
298 pages
ISBN:9781450379564
DOI:10.1145/3387902
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 May 2020

Permissions

Request permissions for this article.

Check for updates

Badges

Author Tags

  1. GASPI
  2. PGAS
  3. finite element methods
  4. runtime system

Qualifiers

  • Research-article

Funding Sources

  • Bundesministerium für Bildung und Forschung

Conference

CF '20
Sponsor:
CF '20: Computing Frontiers Conference
May 11 - 13, 2020
Sicily, Catania, Italy

Acceptance Rates

Overall Acceptance Rate 273 of 785 submissions, 35%

Upcoming Conference

CF '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)HighPerMeshes – A Domain-Specific Language for Numerical Algorithms on Unstructured GridsEuro-Par 2020: Parallel Processing Workshops10.1007/978-3-030-71593-9_15(185-196)Online publication date: 14-Mar-2021
  • (2021)The HighPerMeshes framework for numerical algorithms on unstructured gridsConcurrency and Computation: Practice and Experience10.1002/cpe.661634:14Online publication date: 21-Sep-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media