[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3377049.3377088acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaConference Proceedingsconference-collections
research-article

Relaxed Reorder Buffer Commit with Batch Context Switch

Published: 20 March 2020 Publication History

Abstract

Most of the modern processors achieve instruction level parallelism by executing instructions without the original program order. Although the instructions can execute without preserving the original program sequence, they must commit in-order once execution ends.A Reorder Buffer (ROB) is typically used in traditional out-of-order processors to ensure the in-order commitment of the instructions, which has the ability to impede the processor entirely if the head of ROB is obstructed with an instruction. In this paper, we present an alternate approach mechanism for reorder buffer to do out-of-order commit of instructions. The proposed mechanism extends the ROB by adding a Stalled Instruction Zone (SI-Zone) which accommodates batches of dependent instructions and allows out-of-order commit for the ROB. The particular instruction that makes initial stall and other dependent instructions are moved to SI-zone to make way for the normal instructions to commit at the ROB. The SI-zone commits the instructions in batch mode once the stall is cleared. The outcome is an efficient processor that is ROB-based and increases both the effective size of ROB and number of physical registers.

References

[1]
Furat Afram, Hui Zeng, and Kanad Ghose. 2013. A group-commit mechanism for rob-based processors implementing the x86 isa. In High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on. IEEE, 47--58.
[2]
Alex Aiken, Utpal Banerjee, Arun Kejariwal, and Alexandru Nicolau. 2016. Overview of ILP Architectures. In Instruction Level Parallelism. Springer, 9--42.
[3]
Haitham Akkary, Ravi Rajwar, and Srikanth T Srinivasan. 2003. Checkpoint processing and recovery: Towards scalable large instruction window processors. In Microarchitecture, 2003. MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on. IEEE, 423--434.
[4]
Harry Barowski and Tim Niggemeier. 2016. Pipelining out-of-order instructions. US Patent 9,395,996.
[5]
Gordon B Bell and Mikko H Lipasti. 2004. Deconstructing commit. In Performance Analysis of Systems and Software, 2004 IEEE International Symposium on-ISPASS. IEEE, 68--77.
[6]
Brad Calder, Glenn Reinman, and Dean M Tullsen. 1999. Selective value prediction. In ACM SIGARCH Computer Architecture News, Vol. 27. IEEE Computer Society, 64--74.
[7]
Adrian Cristal, Daniel Ortega, Josep Llosa, and Mateo Valero. 2004. Out-of-order commit processors. In Software, IEE Proceedings-. IEEE, 48--59.
[8]
Md Haque, Abdullah Al Kaisan, Mahmudur R Saniat, Aminur Rahman, et al. 2014. GPU accelerated fractal image compression for medical imaging in parallel computing platform. arXiv preprint arXiv:1404.0774 (2014).
[9]
James Allan Kahle. 2003. Secondary reorder buffer microprocessor. US Patent 6,629,233.
[10]
Nevin Kirman, Meyrem Kirman, Mainak Chaudhuri, and Jose F Martinez. 2005. Checkpointed early load retirement. In High-Performance Computer Architecture, 2005. HPCA-11. 11th International Symposium on. IEEE, 16--27.
[11]
Vijayalakshmi Saravanan, Kothari Dwarkadas Pralhaddas, Dwarkadas Pralhaddas Kothari, and Isaac Woungang. 2015. An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs. Human-centric Computing and Information Sciences 5, 1 (2015), 2.
[12]
Yiannakis Sazeides and James E Smith. 1997. The predictability of data values. In Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture. IEEE Computer Society, 248--258.
[13]
Mageda Sharafeddin and Haitham Akkary. 2015. A small and power efficient checkpoint core architecture for manycore processors. International Journal of High Performance Systems Architecture 5, 4 (2015), 216--227.
[14]
Thang M Tran. 2017. Space efficient checkpoint facility and technique for processor with integrally indexed register mapping and free-list arrays. US Patent 9,672,044.
[15]
Rafael Ubal, Julio Sahuquillo, Salvador Petit, Pedro López, and David R Kaeli. 2012. A sequentially consistent multiprocessor architecture for out-of-order retirement of instructions. IEEE Transactions on Parallel and Distributed Systems 23, 8 (2012), 1361--1368.
[16]
Kai Wang and Manoj Franklin. 1997. Highly accurate data value prediction using hybrid predictors. In Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture. IEEE Computer Society, 281--290.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCA 2020: Proceedings of the International Conference on Computing Advancements
January 2020
517 pages
ISBN:9781450377782
DOI:10.1145/3377049
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 March 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. batch processing
  2. context switch
  3. reorder buffer

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCA 2020

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 84
    Total Downloads
  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)1
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media