[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1165573.1165584acmconferencesArticle/Chapter ViewAbstractPublication PagesislpedConference Proceedingsconference-collections
Article

Selective writeback: exploiting transient values for energy-efficiency and performance

Published: 04 October 2006 Publication History

Abstract

Today's superscalar microprocessors use large, heavily-ported physical register files (RFs) to increase the instruction throughput. The high complexity and power dissipation of such RFs mainly stem from the need to maintain each and every result for a large number of cycles after the result generation. We observed that a significant fraction (about 45%) of the result values are delivered to their consumers via the bypass network (consumed "on-the-fly") and are never read out from the destination registers. In this paper, we first formulate conditions for identifying such transient values and describe their micro-architectural implementation; then we propose a technique to avoid the writeback of such transient values into the RF. With 64-entry integer and floating point register files, our technique achieves an 11% performance improvement and 29% reduction in the RF energy consumption compared to the baseline machine with the same number of registers. Furthermore, for the same performance target, the Selective Writeback scheme results in a 38% reduction in the energy consumption of the RF compared to the baseline machine.

References

[1]
Aggarwal, A., Franklin, M., "Asymmetrically-ported Register Files", in Proc. of ICCD 2003.
[2]
Azevedo, A., et.al., "Profile-based Dynamic Voltage Scheduling using Program Checkpoints in COPPER Framework", in Proc. of DATE, 2002.
[3]
Balakrishnan, S., Sohi, G., "Exploiting Value Locality in Physical Register Files", in Proc. of MICRO-36, 2003.
[4]
Balasubramonian, R.,et. al., "Reducing the Complexity of the Register File in Dynamic Superscalar Processor", in Proc. of MICRO-34, 2001.
[5]
Borch, E. et al, "Loose Loops Sink Chips", in Proc. of HPCA-8, 2002.
[6]
Burger, D. and Austin, T. M., "The SimpleScalar tool set: Version 2.0", Tech. Report, Dept. of CS, Univ. of Wisconsin-Madison, June 1997.
[7]
Chrysos G., J.Emer, "Memory Dependence Prediction using Store Sets", in Proc. of ISCA-25, 1998.
[8]
Cruz, J-L. et. al., "Multiple-Banked Register File Architecture", in Proc. of ISCA-27, 2000.
[9]
Ergin O., et.al., "Increasing Processor Performance through Early Register Release", in Proc. of ICCD, 2004.
[10]
Ergin O., et.al., "Register Packing: Exploiting Narrow-Width Operands for Reducing Register File Pressure", in Proc. of .MICRO-37, 2004.
[11]
Gonzalez, A., Gonzalez, J., Valero, M., "Virtual-Physical Registers", in Proc. of HPCA-4, 1998.
[12]
Gonzalez, R. et. al. "A content Aware Register File Organization", in Proc. of ISCA-31, 2004.
[13]
Jones T. et al., "Compiler Directed Early Register Release", in Proc. of PACT 2005.
[14]
Jourdan, S.,et. al. "A Novel Renaming Scheme to Exploit Value Temporal Locality through Physical Register Reuse and Unification", in Proc. of MICRO-31, 1998.
[15]
Kim, N., Mudge, T., "Reducing Register Ports Using Delayed Write-Back Queues and Operand Pre-Fetch", in Proc. of ICS-17, 2003.
[16]
Kondo M. and Nakamura H. "A Small, Fast and Low-Power Register File by Bit-Partitioning", in Proc.of HPCA-11, 2005.
[17]
Lipasti, M., et.al., "Physical Register Inlining", in Proc. of ISCA-31, 2004.
[18]
Lozano, G. and Gao, G., "Exploiting Short-Lived Variables in Superscalar Processors", in Proc. MICRO-28, 1995.
[19]
Martin, M., Roth, A., Fischer, C., "Exploiting Dead Value Information", in Proc. of MICRO-30, 1997.
[20]
Martinez, J., et. al., "Cherry: Checkpointed Early Resource Recycling in Out-of-order Microprocessors", in Proc. of MICRO-35, 2002.
[21]
Monreal, T., Vinals, V., Gonzalez, A., Valero, M. "Hardware Schemes for Early Register Release", in Proc. of ICPP-02, 2002.
[22]
Park, I., Powell, M., Vijaykumar, T., "Reducing Register Ports for Higher Speed and Lower Energy", in Proc. of MICRO-35, 2002.
[23]
Ponomarev, D., et. al."Reducing Datapath Energy Through the Isolation of Short-Lived Operands", in Proc. of PACT-12, 2003.
[24]
Tseng, J., Asanovic, K., "Banked Multiported Register Files for High Frequency Superscalar Microprocessors", in Proc. of ISCA-30, 2003.

Cited By

View all
  • (2018)Selective writebackIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2008.200024316:6(650-661)Online publication date: 29-Dec-2018
  • (2007)An L2-miss-driven early register deallocation for SMT processorsProceedings of the 21st annual international conference on Supercomputing10.1145/1274971.1274992(138-147)Online publication date: 17-Jun-2007

Index Terms

  1. Selective writeback: exploiting transient values for energy-efficiency and performance

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISLPED '06: Proceedings of the 2006 international symposium on Low power electronics and design
    October 2006
    446 pages
    ISBN:1595934626
    DOI:10.1145/1165573
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 October 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. energy-efficiency
    2. register files

    Qualifiers

    • Article

    Conference

    ISLPED06
    Sponsor:
    ISLPED06: International Symposium on Low Power Electronics and Design
    October 4 - 6, 2006
    Bavaria, Tegernsee, Germany

    Acceptance Rates

    Overall Acceptance Rate 398 of 1,159 submissions, 34%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Selective writebackIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2008.200024316:6(650-661)Online publication date: 29-Dec-2018
    • (2007)An L2-miss-driven early register deallocation for SMT processorsProceedings of the 21st annual international conference on Supercomputing10.1145/1274971.1274992(138-147)Online publication date: 17-Jun-2007

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media