[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

Optimal pipelining in supercomputers

Published: 01 May 1986 Publication History

Abstract

This paper examines the relationship between the degree of central processor pipelining and performance. This relationship is studied in the context of modern supercomputers. Limitations due to instruction dependencies are studied via simulations of the CRAY-1S. Both scalar and vector code are studied. This study shows that instruction dependencies severely limit performance for scalar code as well as overall performance.
The effects of latch overhead are then considered. The primary cause of latch overhead is the difference between maximum and minimum gate propagation delays. This causes both the skewing of data as it passes along the data path, and unintentional clock skewing due to clock fanout logic. Latch overhead is studied analytically in order to lower bound the clock period that may be used in a pipelined system. This analysis also touches on other points related to latch clocking. This analysis shows that for short pipeline segments both the Earle latch and polarity hold latch give the same clock period bound for both single-phase and multi-phase clocks. Overhead due to data skew and unintentional clock skew are each added to the CRAY-1S simulation model. Simulation results with realistic assumptions show that eight to ten gate levels per pipeline segment lead to optimal overall performance. The results also show that for short pipeline segments data skew and clock skew contribute about equally to the degradation in performance.

References

[1]
D. W. Anderson, F. J. Sparacio, and R. M. Tomasulo, "The System/360 Model 91" Machine Philosophy and Instruction Handling," IBM Journal, Vol. 11, No. 34, pp. 8-24, January 1967.
[2]
Applied Micro Circuits Corp., Q1500 Series Design Guide, 1985.
[3]
L. W. Cotten, "Circuit Implementation of High-Speed Pipeline Systems", AFIPS Fall Joint Computer Conference, pp. 489-504, 1965.
[4]
CRAY-1 Computer Systems, Hardware Reference Manual, Cray Research, Inc., Chippewa Falls, WI, 1979.
[5]
J. G. Earle, "Latched Carry-Save Adder", IBM Technical Disclosure Bull., Vol. 7, pp. 909-910, March 1965.
[6]
Fairchild, Fast Fairchild Advanced Schottky 1TL, 1984.
[7]
Fairchild, FGE Series Design Manual, Vol. II, 1985.
[8]
B. K. Fawcett, "Maximal Clocking Rates for Pipelined Digital Systems", M.S. Thesis, Dept. Elec. Eng., University of Illinois at Urbana-Champaign, 1975
[9]
M. J. Flynn, "Very High-Speed Computing Systems," Proceedings of the IEEE, Vol. 54, No. 12, pp. 1901-1909, December 1966.
[10]
C.C. Foster and E. M. Riseman, "Percolation of Code to Enhance Parallel Dispatching and Execution", IEEE Trans. Computers, Vol. C-21, No. 12, pp. 1411-1415, December 1972.
[11]
T.G. Hallin and M. J. Flynn, "Pipelining of Arithmetic Functions", IEEE Trans. Computers, Vol. C-21, No. 8, pp. 880-886, August 1972.
[12]
D. J. Kuck, The Structure of Computers and Computations, vol. 1, John Wiley and Sons, New York, 1978.
[13]
F. H. McMahon, "FORTRAN CPU Performance Analysis", Law,nee Livermore Laboratories, 1972.
[14]
Motorola, Inc., MECL Device Data, 1982.
[15]
A. Nicolau, and J. A. Fischer, "Measuring the Parallelism Available for Very Long Instruction Word Architectures," IEEETrans. Vol. C-33, No. 11, November 1984, pp. 968- 976.
[16]
N. Pang and J. E. Smith, "CRAY-1 Simulation Tools", Tech. Report ECE-83-11, University of Wisconsin-Madison, Dec. 1983.
[17]
E. M. Riseman and C. C. Foster, "The inhibition of Potential Parallelism by Conditional Jumps", IEEE Trans. Computers, Vol. C-21, No. 12, pp. 1405-1411, December 1972.
[18]
H. D. Shapiro, "A Comparison of Various Methods for Detecting and Utilizing Parallelism in a Single Instruction Stream", 1977 International Conference on Parallel Processing, pp. 67-76, Aug. 1977.
[19]
J. E. Thornton, Design of a Computer - The Control Data 6600, Scott, Foresman and Co., Glenview, IL, 1970.
[20]
G. S. Tjaden and M. J. Flynn, "Detection and Parallel Execution of Independent Instructions", IEEE Trans. Computers, Vol. C-19, No. 10, pp. 889-895, October 1970.
[21]
J. Worlton, "The Philosophy Behind the Machines," Computer World, Nov. 9. 1981.
[22]
J. Worlton, "Understanding Supercomputer Benchmarks", Datamation, Vol. 30, No. 9, pp. 121-130, September 1, 1984.

Cited By

View all
  • (2021)Reflections on Computer Pipeline Technology from an Analytical PerspectiveProceedings of the 4th International Conference on Computer Science and Software Engineering10.1145/3494885.3494911(140-145)Online publication date: 22-Oct-2021
  • (2021)Superconducting Computing with Alternating Logic Elements2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA)10.1109/ISCA52012.2021.00057(651-664)Online publication date: Jun-2021
  • (2015)An energy-delay product study on chip multi-processors for variable stage pipeliningHuman-centric Computing and Information Sciences10.1186/s13673-015-0046-x5:1Online publication date: 21-Sep-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGARCH Computer Architecture News
ACM SIGARCH Computer Architecture News  Volume 14, Issue 2
Special Issue: Proceedings of the 13th annual international symposium on Computer architecture (ISCA '86)
May 1986
429 pages
ISSN:0163-5964
DOI:10.1145/17356
Issue’s Table of Contents
  • cover image ACM Conferences
    ISCA '86: Proceedings of the 13th annual international symposium on Computer architecture
    June 1986
    454 pages
    ISBN:081860719X

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1986
Published in SIGARCH Volume 14, Issue 2

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)234
  • Downloads (Last 6 weeks)38
Reflects downloads up to 18 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Reflections on Computer Pipeline Technology from an Analytical PerspectiveProceedings of the 4th International Conference on Computer Science and Software Engineering10.1145/3494885.3494911(140-145)Online publication date: 22-Oct-2021
  • (2021)Superconducting Computing with Alternating Logic Elements2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA)10.1109/ISCA52012.2021.00057(651-664)Online publication date: Jun-2021
  • (2015)An energy-delay product study on chip multi-processors for variable stage pipeliningHuman-centric Computing and Information Sciences10.1186/s13673-015-0046-x5:1Online publication date: 21-Sep-2015
  • (2014)A comparative simulation study on the power---performance of multi-core architectureThe Journal of Supercomputing10.1007/s11227-014-1263-170:1(465-487)Online publication date: 1-Oct-2014
  • (2013)Performance analysis of multi-threaded multi-core CPUsProceedings of the First International Workshop on Many-core Embedded Systems10.1145/2489068.2489076(49-53)Online publication date: 24-Jun-2013
  • (2013)Power-performance of multi-threaded multi-core processor: Analysis, optimization and simulation2013 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCSim.2013.6641492(674-677)Online publication date: Jul-2013
  • (2008)Optimal Power/Performance Pipeline Depth for SMT in Scaled TechnologiesIEEE Transactions on Computers10.1109/TC.2007.7077157:1(69-81)Online publication date: 1-Jan-2008
  • (2007)Computer ArchitectureWiley Encyclopedia of Computer Science and Engineering10.1002/9780470050118.ecse071Online publication date: 14-Dec-2007
  • (2006)Long and short covering edges in combination logic circuitsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/43.627699:12(1245-1253)Online publication date: 1-Nov-2006
  • (2006)Synchronization of pipelinesIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/43.23860612:8(1132-1146)Online publication date: 1-Nov-2006
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media