[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Employing nested OpenMP for the parallelization of multi-zone computational fluid dynamics applications

Published: 01 May 2006 Publication History

Abstract

In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Benchmarks employing multi-level OpenMP parallelism. For our study, we use the NanosCompiler that supports nesting of OpenMP directives and provides clauses to control the grouping of threads, load balancing, and synchronization. We report the benchmark results, compare the timings with those of different hybrid parallelization paradigms (MPI+OpenMP and MLP) and discuss OpenMP implementation issues that affect the performance of multi-level parallel applications.

References

[1]
Ayguade, E., Martorell, X., Labarta, J., Gonzalez, M. and Navarro, N., Exploiting multiple levels of parallelism in OpenMP: a case study. In: Proceedings of the 1999 International Conference on Parallel Processing, Ajzu, Japan.
[2]
D. Bailey, T. Harris, W. Saphir, R. Van der Wijngaart, A. Woo, M. Yarrow, The NAS parallel benchmarks 2.0, RNR-95-020, NASA Ames Research Center, 1995.
[3]
R. Blikberg, T. Sorevik, Nested parallelism: allocation of processors to tasks and openMP implementation, in: Second European Workshop on OpenMP, Edinburgh, September 2000.
[4]
M. Gonzalez, E. Ayguadé, X. Martorell, J. Labarta, N. Navarro, J. Oliver, NanosCompiler: Supporting Flexible Multilevel Parallelism in OpenMP. Concurrency: Practice and Experience, Special issue on OpenMP, vol. 12(12), October 2000, pp. 1205--1218.
[5]
M. Gonzalez, J. Oliver, X. Martorell, E. Ayguadé, J. Labarta, N. Navarro, OpenMP Extensions for thread groups and their run-time support, in: Thirteenth International Workshop on Languages and Compilers for Parallel Computing (LCPC'2000), New York, USA, August 2000, pp. 317--331.
[6]
H. Jin, M. Frumkin, J. Yan, The OpenMP implementations of NAS parallel benchmarks and its performance, NAS Technical Report NAS-99-011, 1999.
[7]
H. Jin, G. Jost, Performance evaluation of remote memory access programming on shared memory parallel computer architectures, NAS Technical Report NAS-03-001, NASA Ames Research Center, Moffett Field, CA, 2003.
[8]
H. Jin, R.F. Van der Wijngaar, Performance characteristics of the multi-zone NAS parallel benchmarks, in: Proceedings of IPDPS 2004, Santa Fe, New Mexico, USA, April 2004.
[9]
Jin, H., Jost, G., Ayguade, E., Gonzalez, M. and Martorell, X., Automatic multilevel parallelization using OpenMP. Sci. Programming. v11 i2.
[10]
X. Martorell, E. Ayguadé, N. Navarro, J. Corbalan, M. Gonzalez, J. Labarta, Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors, in: Thirteenth International Conference on Supercomputing (ICS'99), Rhodes, Greece, June 1999, pp. 294--301.
[11]
MIPSPro 7 Fortran 90 Commands and Directives Reference Manual 007-3696-03.
[12]
MPI 1.1 Standard, http://www-unix.mcs.anl.gov/mpi/mpich.
[13]
OpenMP Fortran Application Program Interface, http://www.openmp.org/.
[14]
S. Shah, G. Haab, P. Petersen, J. Throop, Flexible control structures for parallelism in OpenMP, in: First European Workshop on OpenMP, Lund, Sweden, September 1999.
[15]
Taft, J., Achieving 60 GFLOP/s on the production CFD code OVERFLOW-MLP. Parallel Comput. v27. 521
[16]
R.F. Van Der Wijngaart, H. Jin, NAS parallel benchmarks, multi-zone versions, NAS Technical Report NAS-03-010, NASA Ames Research Center, Moffett Field, CA, 2003.
[17]
XL Fortran for AIX User's Guide Version 8.11, IBM sc09-4948-01, IBM Corp. Second Ed., June 2003.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Parallel and Distributed Computing
Journal of Parallel and Distributed Computing  Volume 66, Issue 5
Special issue: 18th International parallel and distributed processing symposium
May 2006
158 pages

Publisher

Academic Press, Inc.

United States

Publication History

Published: 01 May 2006

Author Tags

  1. NAS benchmarks
  2. Nested parallelism
  3. OpenMP
  4. Parallel programming models

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media