[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/782052.782057dlproceedingsArticle/Chapter ViewAbstractPublication PagescasconConference Proceedingsconference-collections
Article
Free access

Automatic parallelization for symmetric shared-memory multiprocessors

Published: 12 November 1996 Publication History

Abstract

The trend in workstation hardware is towards symmetric shared-memory multiprocessors (SMPs). User expectations are for (largely) automatic exploitation of parallelism on an SMP, similar to automatic exploitation of modern processor features such as caches and instruction scheduling.In this paper, we present our solution to automatic SMP parallelization. Our solution is unique in its robust support for unbalanced processor loads and nesting of parallel loops and parallel sections, in conjunction with its tight integration with high-order transformations for improved uniprocessor performance, so that the speedup due to parallelism is truly a multiplicative speedup over highly optimized uniprocessor execution times.

References

[1]
{1} AIX Version 4.1 General Programming Concepts: Writing and Debugging Programs, 1994.
[2]
{2} Jeanne Ferrante, Vivek Sarkar, and Wendy Thrash. On Estimating and Enhancing Cache Effectiveness. Lecture Notes in Computer Science, (589):328-343, 1991. Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing, Santa Clara, California, USA, August 1991. Edited by U. Banerjee, D. Gelernter, A. Nicolau, D. Padua.
[3]
{3} Susan Flynn Hummel and Edith Schonberg. Low-Overhead Scheduling of Nested Parallelism. IBM Journal of Research and Development, 1991.
[4]
{4} Susan Flynn Hummel, Edith Schonberg, and Lawrence Flynn. Factoring: A Practical and Robust Method for Scheduling Parallel Loops. Supercomputing 91, Nov. 1991.
[5]
{5} Clyde Kruskal and Alan Weiss. Allocating Independent Subtasks on Parallel Processors. IEEE Transactions on Software Engineering, SE-11(10), October 1985.
[6]
{6} Jim Q. Ning, Andre Engberts, and W. Kozaczynski. Automated Support for Legacy Code Understanding. Communications of ACM, 5(37):50-57, 1994.
[7]
{7} Constantine D. Polychronopoulos and David J. Kuck. Guided Self-Scheduling: A Practical Scheduling Scheme for Parallel Supercomputers. IEEE Transactions on Computers, C-36(12), December 1987.
[8]
{8} IBM Shared Memory System POWER/4 User's Guide and Technical Reference, 1993.
[9]
{9} Vivek Sarkar. Determining Average Program Execution Times and their Variance. Proceedings of the 1989 SIGPLAN Conference on Programming Language Design and Implementation, 24(7):298-312, July 1989.
[10]
{10} Vivek Sarkar. Automatic Partitioning of a Program Dependence Graph into Parallel Tasks. IBM Journal of Research and Development, 35(5/6), 1991.
[11]
{11} Vivek Sarkar. Automatic Selection of High Order Transformations in the IBM ASTI Optimizer. Technical Report ADTI-96-004, Application Development Technology Institute, IBM Software Solutions Division, September 1996. Submitted to special issue of IBM Journal of Research and Development.
[12]
{12} Vivek Sarkar and Radhika Thekkath. A General Framework for Iteration-Reordering Loop Transformations. Proceedings of the ACM SIGPLAN '92 Conference on Programming Language Design and Implementation, pages 175-187, June 1992.
[13]
{13} IBM VS FORTRAN Version 2: Programming Guide for CMS and MVS, 1993.

Cited By

View all
  • (2015)Experiences in extending parallware to support OpenACCProceedings of the Second Workshop on Accelerator Programming using Directives10.1145/2832105.2832112(1-12)Online publication date: 15-Nov-2015
  • (2009)Exploring parallelization strategies for NUFFT data translationProceedings of the seventh ACM international conference on Embedded software10.1145/1629335.1629361(187-196)Online publication date: 12-Oct-2009
  • (2008)OpenMP tasks in IBM XL compilersProceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds10.1145/1463788.1463810(207-221)Online publication date: 27-Oct-2008
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
CASCON '96: Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research
November 1996
504 pages

Sponsors

  • IBM Canada: IBM Canada
  • NRC: National Research Council - Canada
  • CRSNG: Natural Sci and EngRch Council of Canada

Publisher

IBM Press

Publication History

Published: 12 November 1996

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 24 of 90 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)23
  • Downloads (Last 6 weeks)6
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Experiences in extending parallware to support OpenACCProceedings of the Second Workshop on Accelerator Programming using Directives10.1145/2832105.2832112(1-12)Online publication date: 15-Nov-2015
  • (2009)Exploring parallelization strategies for NUFFT data translationProceedings of the seventh ACM international conference on Embedded software10.1145/1629335.1629361(187-196)Online publication date: 12-Oct-2009
  • (2008)OpenMP tasks in IBM XL compilersProceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds10.1145/1463788.1463810(207-221)Online publication date: 27-Oct-2008
  • (2007)Designer-controlled generation of parallel and flexible heterogeneous MPSoC specificationProceedings of the 44th annual Design Automation Conference10.1145/1278480.1278676(787-790)Online publication date: 4-Jun-2007
  • (1997)False Sharing Elimination by Selection of Runtime Scheduling ParametersProceedings of the international Conference on Parallel Processing10.5555/645533.656492(396-403)Online publication date: 11-Aug-1997
  • (1997)Optimal weighted loop fusion for parallel programsProceedings of the ninth annual ACM symposium on Parallel algorithms and architectures10.1145/258492.258520(282-291)Online publication date: 1-Jun-1997

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media