Efficient AMG on Heterogeneous Systems

Jiri Kraus¹⁹ &
Malte Förster¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7174))

1287 Accesses
4 Citations

Abstract

In many numerical simulation codes the backbone of the application covers the solution of linear systems of equations. Often, being created via a discretization of differential equations, the corresponding matrices are very sparse. One popular way to solve these sparse linear systems are multigrid methods - in particular AMG - because of their numerical scalability. As the memory bandwidth is usually the bottleneck of linear solvers for sparse systems they especially benefit from high throughput architectures like GPUs. We will show that this is true even for a rather complex hierarchical method like AMG. The presented benchmarks are all based on the new open source library LAMA and compare the run times on different GPUs to those of an efficient OpenMP parallel CPU implementation. As the memory access pattern is especially crucial for GPUs we have a focus on the performance of different sparse matrix formats.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

JXPAMG: a parallel algebraic multigrid solver for extreme-scale numerical simulations

Article 25 October 2022

Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Evaluating Performance and Scalability of the Sparse Linear Systems Solver Spliss

References

hypre homepage (2010), https://computation.llnl.gov/casc/hypre/software.html (last viewed December 2010)
Petsc homepage (2010), http://www.mcs.anl.gov/petsc/petsc-as/ (last viewed December 2010)
Blitz++ homepage (2011), http://www.oonumerics.org/blitz/ (last viewed January 2011)
Ellpack homepage (2011), http://www.cs.purdue.edu/ellpack/ (last viewed April 2011)
Hiflow3 homepage (2011), http://www.hiflow3.org/ (last viewed August 2011)
Barrett, R.: Templates for the solution of linear systems: building blocks for iterative methods. Society for Industrial Mathematics (1994)
Google Scholar
Baskaran, M., Bordawekar, R.: Optimizing sparse matrix-vector multiplication on gpus. IBM Research Report (2008)
Google Scholar
Bell, N., Garland, M.: Efficient sparse matrix-vector multiplication on CUDA. In: Proc. ACM/IEEE Conf. Supercomputing (SC), Portland, OR, USA (2009)
Google Scholar
Feng, Z., Zeng, Z.: Parallel multigrid preconditioning on graphics processing units (GPUs) for robust power grid analysis. In: Proceedings of the 47th Design Automation Conference, pp. 661–666. ACM (2010)
Google Scholar
Förster, M., Kraus, J.: Scalable parallel AMG on ccNUMA machines with OpenMP. Computer Science-Research and Development, 1–8 (2011)
Google Scholar
Haase, G., Liebmann, M., Douglas, C., Plank, G.: A parallel algebraic multigrid solver on graphics processing units. In: High Performance Computing and Applications, pp. 38–47 (2010)
Google Scholar
Heuveline, V., Subramanian, C., Lukarski, D., Weiss, J.: A multi-platform linear algebra toolbox for finite element solvers on heterogeneous clusters. In: 2010 IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), pp. 1–6. IEEE (2010)
Google Scholar
Kirk, D., Hwu, W.-M.: Programming massively parallel processors: A Hands-on approach. Morgan Kaufmann Publishers Inc., San Francisco (2010)
Google Scholar
Klie, H., Sudan, H., Li, R., Saad, Y.: Exploiting capabilities of many core platforms in reservoir simulation. In: SPE Reservoir Simulation Symposium (2011)
Google Scholar
Ruge, J., Stüben, K.: Algebraic Multigrid (AMG). In: McCormick, S.F. (ed.) Multigrid Methods. Frontiers in Applied Mathematics, vol. 3, pp. 73–130. SIAM, Philadelphia (1987)
Chapter Google Scholar
Vandevoorde, D., Josuttis, N.: C++ templates: the Complete Guide. Addison-Wesley Professional (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer Institute for Algorithms and Scientific Computing SCAI, Schloss Birlinghoven, 53754, Sankt Augustin, Germany
Jiri Kraus & Malte Förster

Authors

Jiri Kraus
View author publications
You can also search for this author in PubMed Google Scholar
Malte Förster
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

High Performance Computing Center Stuttgart (HLRS), Universität Stuttgart, Nobelstraße 19, 70569, Stuttgart, Germany
Rainer Keller
Institute of Computer Science and Engineering, Karlsruhe Institute of Technology (KIT), Haid-und-Neu-Straße 7, 76131, Karlsruhe, Germany
David Kramer
Institute for Applied and Numerical Mathematics, SRG New Frontiers in High Performance Computing and Karlsruhe Institute of Technology (KIT), 4, Fritz-Erler-Straße 23, 76133, Karlsruhe, Germany
Jan-Philipp Weiss

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kraus, J., Förster, M. (2012). Efficient AMG on Heterogeneous Systems. In: Keller, R., Kramer, D., Weiss, JP. (eds) Facing the Multicore - Challenge II. Lecture Notes in Computer Science, vol 7174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30397-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-30397-5_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30396-8
Online ISBN: 978-3-642-30397-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficient AMG on Heterogeneous Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

JXPAMG: a parallel algebraic multigrid solver for extreme-scale numerical simulations

Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Evaluating Performance and Scalability of the Sparse Linear Systems Solver Spliss

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Efficient AMG on Heterogeneous Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

JXPAMG: a parallel algebraic multigrid solver for extreme-scale numerical simulations

Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Evaluating Performance and Scalability of the Sparse Linear Systems Solver Spliss

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation