[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1897179.1897237guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

An efficient algorithm for the physical mapping of clustered task graphs onto multiprocessor architectures

Published: 19 January 2000 Publication History

Abstract

The most important issue in sequential program parallelisation is the efficient assignment of computations into different processing elements. In the past, too many approaches were devoted in efficient program parallelization considering various models for the parallel programs and the target architectures. The most widely used parallelism description model is the task graph model with precedence constraints. Nevertheless, as far as physical mapping of tasks onto parallel architectures is concerned, little research has given practical results. It is well known that the physical mapping problem is NP-hard in the strong sense, thus allowing only for heuristic approaches. Most researchers or tool programmers use exhaustive algorithms, or the classical method of simulated annealing. This paper presents an alternative approach onto the mapping problem. Given the graph of clustered tasks, and the graph of the target distributed architecture, our heuristic finds a mapping by first placing the highly communicative tasks on adjacent nodes of the processor network. Once these "backbone" tasks are mapped, there is no backtracking, thus achieving low complexity. Therefore, the remaining tasks are placed beginning from those close to the "backbone" tasks. The paper concludes with performance and comparison results which reveal the method's efficiency.

References

[1]
H. Ali et H. El-Rewini. Task Allocation in Distributed Systems: A Split Graph Model. Journal of Combinatorial Mathematics and Combinatorial Computing, vol. 14, pp. 15-32, October 1993.
[2]
H. El-Rewini, T. G. Lewis and H. Ali. Task Scheduling in Parallel and Distributed Systems. Prentice Hall, 1994.
[3]
A. Gerasoulis and T. Yang. On the Granularity and Clustering of Directed Acyclic Task Graphs. IEEE Trans. Parallel Distrib. Syst., vol. 4, no. 6, pp. 686- 701, Jan. 1993.
[4]
N. Koziris, G. Papakonstantinou and P. Tsanakas. Optimal Time and Efficient Space Free Scheduling for Nested Loops. The Computer Journal, vol. 39, no 5, pp 439-448, 1996.
[5]
T. Lewis and Hesham El-Rewini. Parallax: A Tool for Parallel Program Scheduling. IEEE Parallel & Distributed Technology, vol. 1, no. 2, May 1993, pp 62-72.
[6]
V. Lo, S. Rajopadhye, S. Gupta, D. Keldsen, M. Mohamed, B. Nitzberg, J. Telle and X. Zhong. OREGAMI: Tools for Mapping Parallel Computations to Parallel Architectures. Int'l Journal of Parallel Programming, vol. 20, no. 3, 1991, pp. 237-270.
[7]
V. Lo. Heuristic Algorithms for Task Assignment in Distributed Systems. IEEE Trans. Comput., vol. C- 37, no. 11, pp. 1384-1397, Nov. 1988.
[8]
B. Monien And H. Sudborough. Embedding one Interconnection Network in Another. Computational Graph Theory, Springer-Verlag, Computing Supplement 7, pp 257-282, 1990.
[9]
C. H. Papadimitriou and M. Yannakakis. Toward an Architecture-Independent Analysis of Parallel Algorithms. SIAM J. Comput., vol. 19, pp. 322- 328, 1990.
[10]
V. Sarkar. Partitioning and Scheduling Parallel Programs for Execution on Multiprocessors. Cambridge, MA: MIT Press, 1989.
[11]
H. Stone. Multiprocessor Scheduling with the Aid of Network Flow Algorithms. IEEE Trans. Soft. Engin., vol. SE-3, no. 1, pp. 85-93, Jul. 1977.
[12]
T. Yang and A. Gerasoulis. PYRROS: Static Task Scheduling and Code Generation for Message Passing Multiprocessors. Proc 6th Int'l Conf. Supercomputing (ICS92), ACM Press, New York, N. Y., 1992, pp. 428-437.

Cited By

View all
  • (2018)An optimized hybrid algorithm in term of energy and performance for mapping real time workloads on 2d based on-chip networksApplied Intelligence10.1007/s10489-018-1246-748:12(4792-4804)Online publication date: 1-Dec-2018
  • (2017)Task Mapping on SMART NoCProceedings of the 54th Annual Design Automation Conference 201710.1145/3061639.3062323(1-6)Online publication date: 18-Jun-2017
  • (2009)Communication-Aware Hierarchical Online-Placement in Heterogeneous Reconfigurable SystemsProceedings of the 2009 IEEE/IFIP International Symposium on Rapid System Prototyping10.1109/RSP.2009.23(61-67)Online publication date: 23-Jun-2009
  • Show More Cited By
  1. An efficient algorithm for the physical mapping of clustered task graphs onto multiprocessor architectures

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    EURO-PDP'00: Proceedings of the 8th Euromicro conference on Parallel and distributed processing
    January 2000
    415 pages
    ISBN:0769505007

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 19 January 2000

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)An optimized hybrid algorithm in term of energy and performance for mapping real time workloads on 2d based on-chip networksApplied Intelligence10.1007/s10489-018-1246-748:12(4792-4804)Online publication date: 1-Dec-2018
    • (2017)Task Mapping on SMART NoCProceedings of the 54th Annual Design Automation Conference 201710.1145/3061639.3062323(1-6)Online publication date: 18-Jun-2017
    • (2009)Communication-Aware Hierarchical Online-Placement in Heterogeneous Reconfigurable SystemsProceedings of the 2009 IEEE/IFIP International Symposium on Rapid System Prototyping10.1109/RSP.2009.23(61-67)Online publication date: 23-Jun-2009
    • (2008)Application-specific networks-on-chip topology customization using network partitioningProceedings of the 1st international forum on Next-generation multicore/manycore technologies10.1145/1463768.1463786(1-6)Online publication date: 24-Nov-2008
    • (2008)Tailoring circuit-switched network-on-chip to application-specific system-on-chip by two optimization schemesACM Transactions on Design Automation of Electronic Systems10.1145/1297666.129767813:1(1-31)Online publication date: 6-Feb-2008
    • (2006)Efficient Techniques for Clustering and Scheduling onto Embedded MultiprocessorsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2006.8717:7(667-680)Online publication date: 1-Jul-2006
    • (2004)Bandwidth-Constrained Mapping of Cores onto NoC ArchitecturesProceedings of the conference on Design, automation and test in Europe - Volume 210.5555/968879.969207Online publication date: 16-Feb-2004
    • (2001)TOPPERProceedings of the 8th Panhellenic conference on Informatics10.5555/1756269.1756291(336-350)Online publication date: 8-Nov-2001

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media