Execution Latency Reduction via Variable Latency Pipeline and Instruction Reuse

Toshinori Sato⁶ &
Itsujiro Arita⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2150))

Included in the following conference series:

European Conference on Parallel Processing

744 Accesses
1 Citations

Abstract

Operand bypass logic might be one of the critical structures for future microprocessors to achieve high clock speed. The delay of the logic imposes the execution time budget to be reduced significantly, resulting in that the execution stage is divided into several stages. Variable latency pipeline (VLP) structure has the advantages of pipelining and pseudo-asynchronous design techniques. According to source operands delivered to arithmetic units, the VLP changes execution latency and thus it achieves both high speed and low latency for most of the operands. In this paper we evaluate the VLP on dynamically scheduled superscalar processors using a cycle-by-cycle simulator. Our experimental results show that the VLP is effective for reducing the effective execution time, and thus the constraints on the operand bypass logic is mitigated. We also evaluate instruction reuse technique in order to support the VLP.

Download to read the full chapter text

Chapter PDF

Dual-IS: Instruction Set Modality for Efficient Instruction Level Parallelism

Aligned Scheduling: Cache-Efficient Instruction Scheduling for VLIW Processors

An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs

Article Open access 29 January 2015

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Austin T. M.: DIVA: a reliable substrate for deep submicron microarchitecture design. 32nd International Symposium on Microarchitecture (1999)
Google Scholar
Brooks D., Martonosi M.: Dynamically exploiting narrow width operands to improve processor power and performance. 5th International Symposium on High Performance Computer Architecture (1999)
Google Scholar
Burger D., Austin T. M.: The SimpleScalar tool set, version 2.0. ACM SIGARCH Computer Architecture News, 25(3) (1997)
Google Scholar
Hara T., Ando H., Nakanishi C., Nakata M.: Performance comparison of ILP machines with cycle time evaluation. 23rd International Symposium on Computer Architecture (1996)
Google Scholar
Kessler R. E., McLellan E. J., Webb D. A.: The Alpha 21264 microprocessor architecture. International Conference on Computer Design (1998)
Google Scholar
Kondo Y., Ikumi N., Ueno K., Mori J., Hirano M.: An early-completion-detecting ALU for a 1GHz 64b datapath. International Solid State Circuit Conference (1997)
Google Scholar
Oberman S. F., Flynn M. J.: A variable latency pipelined floating-point adder. International Euro-Par Conference (1996)
Google Scholar
Richardson S.E.: Exploiting trivial and redundant computation. 11th International Symposium on Computer Arithmetic (1993)
Google Scholar
Rotenberg E.: AR-SMT: a microarchitectural approach to fault tolerance in microprocessors. 29th Fault-Tolerant Computing Symposium (1999)
Google Scholar
Sodani A., Sohi G. S.: Dynamic instruction reuse. 24th International Symposium on Computer Architecture (1997)
Google Scholar
Yeager K.C.: The MIPS R10000 superscalar microprocessor. IEEE Micro, April (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Artificial Intelligence, USA
Toshinori Sato
Center for Microelectronic Systems, Kyushu Institute of Technology, Japan
Itsujiro Arita

Authors

Toshinori Sato
View author publications
You can also search for this author in PubMed Google Scholar
Itsujiro Arita
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Manchester, Oxford Road, Manchester, M13 9PL, UK
Rizos Sakellariou , John Gurd & Len Freeman , &
Department of Computation, UMIST, P.O. Box 88, Manchester, M60 1QD, UK
John Keane

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sato, T., Arita, I. (2001). Execution Latency Reduction via Variable Latency Pipeline and Instruction Reuse. In: Sakellariou, R., Gurd, J., Freeman, L., Keane, J. (eds) Euro-Par 2001 Parallel Processing. Euro-Par 2001. Lecture Notes in Computer Science, vol 2150. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44681-8_62

Download citation

DOI: https://doi.org/10.1007/3-540-44681-8_62
Published: 17 August 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42495-6
Online ISBN: 978-3-540-44681-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Execution Latency Reduction via Variable Latency Pipeline and Instruction Reuse

Abstract

Chapter PDF

Similar content being viewed by others

Dual-IS: Instruction Set Modality for Efficient Instruction Level Parallelism

Aligned Scheduling: Cache-Efficient Instruction Scheduling for VLIW Processors

An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Execution Latency Reduction via Variable Latency Pipeline and Instruction Reuse

Abstract

Chapter PDF

Similar content being viewed by others

Dual-IS: Instruction Set Modality for Efficient Instruction Level Parallelism

Aligned Scheduling: Cache-Efficient Instruction Scheduling for VLIW Processors

An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation