Loop tiling for parallelism: | Guide books

Loop tiling for parallelismAugust 2000

Author:
Jingling Xue
Univ. of New South Wales, Sydney, Australia

Publisher:

Kluwer Academic Publishers
101 Philip Drive Assinippi Park Norwell, MA
United States

ISBN:978-0-7923-7933-1

Published:01 August 2000

Pages:

256

Available at Amazon

Bibliometrics

Abstract

No abstract available.

Cited By

Contributors

Jingling Xue
UNSW Sydney
- Publication Years1992 - 2024
- Publication counts198
- Citation count2,884
- Available for Download120
- Downloads (cumulative)57,262
- Downloads (12 months)11,729
- Downloads (6 weeks)1,457
- Average Downloads per Article477
- Average Citation per Article15
View Full Profile

Index Terms

Loop tiling for parallelism
1. Software and its engineering
  1. Software notations and tools
    1. Compilers
      1. Source code generation

Reviews

Reviewer: Hans J. Schneider

A technique to restructure nested loops for parallel machines is discussed. This monograph focuses on the use of loop tiling for minimizing synchronization and communication cost and maximizing parallelism. The author does not consider other kinds of loop transformation or optimizing for cache locality. The book is organized into three parts. The first part introduces the mathematical background as well as the theory of nonsingular loop transformations. The second deals with rectangular tiling and then addresses the general case of parallelepiped tiling. The last part focuses on minimizing the execution time of a loop nest on a distributed memory machine. Chapter 1 presents some mathematical concepts necessary to understand the subject. The author emphasizes convex cones, which are used to analyze data dependency, loop permutability, legality, and selection of tile size and shape. The next chapter formally introduces the basic concepts of loop transformations. It defines perfectly nested loops, dependence vectors, and their polyhedra, and it treats fully permutable loop nests. Nonsingular transformations, closely related to tiling, are briefly reviewed. Rectangular tiling, covered in chapter 3, uses squares or rectangles of the same size and shape to partition the iteration space; formally, a concrete partitioning is characterized by a tile size vector and a tile offset. In general, the legality of a rectangular tiling can be checked by testing the existence of integer solutions to a system of inequalities; the author also describes a simpler test that is sufficient in practical applications. Finally, he discusses several related transformations, such as strip-mining, loop coalescing, and loop skewing. Chapter 4 extends the investigations to parallelepiped tiling, which offers more opportunities for exposing parallelism, improving locality, and reducing communication overhead. After illustrating this technique with an example and giving the formal definition, the author again discusses how to use integer programming to test legality exactly and then describes a practical test. Finally, he presents an alternative formal model that can clarify the duality between loop tiling and loop partitioning. The final chapters consider implementation on a distributed memory machine. Chapter 5 describes a suite of compiler techniques to generate a single-program, multiple data (SPMD) program to execute a tiled iteration space. Tiles are assigned to processors as atomic units of computation and then data distribution is derived using the computer-owns rule, by which a processor owns the data it computes. A detailed formal discussion of message-passing code generation, local memory management, and global-to-local address translation follows. Some experimental results round out the chapter. Chapter 6 addresses the problem of determining the best tile shape to minimize inter-tile communication if the tile size is given; this analysis assumes constant distance vectors. Conversely, chapter 7 deals with the more difficult problem of finding the best tile size once the shape is known, but restricting discussion to a two-dimensional iteration space. The author writes well and, at the beginning of each chapter, states what that chapter will cover. The book is clearly organized, and “Further Reading” sections are helpful for readers who are entering the field. Overall, the author succeeds in giving a consistent presentation of the state of the art in a specialized topic. The book can be used as a reference by professionals in compiler techniques, but they should be very familiar with the notation and terminology of linear algebra.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Tiling imperfectly-nested loop nests
SC '00: Proceedings of the 2000 ACM/IEEE conference on Supercomputing

Tiling is one of the more important transformations for enhancing loca lity of reference in programs. Intuitively, tiling a set of loops achieves the effect of interleaving iterations of these loops. Tiling of perfectly-nested loop nests (which are loop ...
Tiling Imperfectly-nested Loop Nests (REVISED)
Parameterized loop tiling

Loop tiling is a widely used program optimization that improves data locality and enables coarse-grained parallelism. Parameterized tiled loops, where the tile sizes remain symbolic parameters until runtime, are quite useful for iterative compilers and ...

Browse Books

Sections

Cited By

Index Terms

Reviews

Access critical reviews of Computing literature here

Tiling imperfectly-nested loop nests

Tiling Imperfectly-nested Loop Nests (REVISED)

Parameterized loop tiling

Save to Binder

Sections

Cited By

Save to Binder

Index Terms

Reviews

Access critical reviews of Computing literature here

Recommendations

Tiling imperfectly-nested loop nests

Tiling Imperfectly-nested Loop Nests (REVISED)

Parameterized loop tiling