Scheduling non-uniform parallel loops on MIMD computers

January 1993

Author:
Jie Liu
Oregon State Univ., Corvallis

Publisher:

Oregon State University
Computer Science Dept. Corvallis, OR
United States

Order Number:UMI Order No. GAX94-24184

Bibliometrics

Abstract

Parallel loops are one of the main sources of parallelism in scientific applications, and many parallel loops do not have a uniform iteration execution time. To achieve good performance for such applications on a parallel computer, iterations of a parallel loop have to be assigned to processors in such a way that each processor has roughly the same amount of work in terms of execution time. A parallel computer with a large number of processors tends to have distributed-memory. To run a parallel loop on a distributed-memory machine, data distribution also needs to be considered. This research investigates the scheduling of non-uniform parallel loops on both shared-memory and distributed-memory parallel computers.

We present Safe Self-Scheduling (SSS), a new scheduling scheme that combines the advantages of both static and dynamic scheduling schemes. SSS has two phases: a static scheduling phase and a dynamic self-scheduling phase that together reduce the scheduling overhead while achieving a well balanced workload. The techniques introduced in SSS can be used by other self-scheduling schemes. The static scheduling phase further improves the performance by maintaining a high cache hit ratio resulting from increased affinity of iterations to processors. SSS is also very well suited for distributed-memory machines.

We introduce methods to duplicate data on a number of processors. The methods eliminate data movement during computation and increase the scalability of problem size. We discuss a systematic approach to implement a given self-scheduling scheme on a distributed-memory. We also show a multilevel scheduling scheme to self-schedule parallel loops on a distributed-memory machine with a large number of processors to eliminate the bottleneck resulting from a central scheduler.

We proposed a method using abstractions to automate both self-scheduling methods and data distribution methods in parallel programming environments. The abstractions are tested using CHARM, a real parallel programming environment. Methods are also developed to tolerate processor faults caused by both physical failure and reassignment of processors by the operating system during the execution of a parallel loop.

We tested the techniques discussed using simulations and real applications. Good results have been obtained on both shared-memory and distributed-memory parallel computers.

Cited By

Liu J and Saletore V Self-scheduling on distributed-memory machines Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (814-823)

Contributors

Jie Liu
Oregon State University
- Publication Years1993 - 1993
- Publication counts1
- Citation count1
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article1
View Full Profile

Index Terms

Scheduling non-uniform parallel loops on MIMD computers
1. Computer systems organization
  1. Architectures
    1. Parallel architectures
      1. Multiple instruction, multiple data
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems
        Process management
        Scheduling

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Data-Parallel Programming on MIMD Computers

The implementation of two compilers for the data-parallel programming language Dataparallel C is described. One compiler generates code for Intel and nCUBE hypercube multicomputers; the other generates code for Sequent multiprocessors. A suite of ...
The NYU Ultracomputer Designing an MIMD Shared Memory Parallel Computer

We present the design for the NYU Ultracomputer, a shared-memory MIMD parallel machine composed of thousands of autonomous processing elements. This machine uses an enhanced message switching network with the geometry of an Omega-network to approximate ...
Dynamic Processor Self-Scheduling for General Parallel Nested Loops

A processor self-scheduling scheme is proposed for general parallel nested loops in multiprocessor systems. In this scheme, programs are instrumented to allow processors to schedule loop iterations among themselves dynamically at run time without ...

Browse Theses

Sections

Cited By

Index Terms

Data-Parallel Programming on MIMD Computers

The NYU Ultracomputer Designing an MIMD Shared Memory Parallel Computer

Dynamic Processor Self-Scheduling for General Parallel Nested Loops

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Data-Parallel Programming on MIMD Computers

The NYU Ultracomputer Designing an MIMD Shared Memory Parallel Computer

Dynamic Processor Self-Scheduling for General Parallel Nested Loops