[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

On the trade-off between power and flexibility of FPGA clock networks

Published: 19 September 2008 Publication History

Abstract

FPGA clock networks consume a significant amount of power, since they toggle every clock cycle and must be flexible enough to implement the clocks for a wide range of different applications. The efficiency of FPGA clock networks can be improved by reducing this flexibility; however, reducing the flexibility introduces stricter constraints during the clustering and placement stages of the FPGA CAD flow. These constraints can reduce the overall efficiency of the final implementation. This article examines the trade-off between the power consumption and flexibility of FPGA clock networks.
Specifically, this article makes three contributions. First, it presents a new parameterized clock-network framework for describing and comparing FPGA clock networks. Second, it describes new clock-aware placement techniques that are needed to find a legal placement satisfying the constraints imposed by the clock network. Finally, it performs an empirical study to examine the trade-off between the power consumption of the clock network and the impact of the CAD constraints for a number of different clock networks with varying amounts of flexibility.
The results show that the techniques used to produce a legal placement can have a significant influence on power and the ability of the placer to find a legal solution. On average, circuits placed using the most effective techniques dissipate 5% less overall energy and are significantly more likely to be legal than circuits placed using other techniques. Moreover, the results show that the architecture of the clock network is also important. On average, FPGAs with an efficient clock network are up to 14.6% more energy efficient compared to other FPGAs.

References

[1]
ACTEL. 2007. ProASIC3 flash family FPGAs datasheet: Device architecture (Jan.).
[2]
ALTERA. 2005. Stratix II Device Handbook 1. Chapter 2 (Mar.).
[3]
ALTERA. 2006. Stratix III Device Handbook 1. Chapter 6 (Nov.).
[4]
Bakolgu, H. B. 1990. Machine organization and rent's rule. In Circuits, Interconnections, and Packaging for VLSI. Addison-Wesley, Reading, MA.
[5]
Betz, V., Rose, J., and Marquardt, A. 1999. Architecture and CAD for Deep-Submicron FPGAs. Kluwer Academic.
[6]
Brynjolfson, I. and Zilic, Z. 2000. Dynamic clock management for low-power applications in FPGAs. In Proceedings of the IEEE Custom Integrated Circuits Conference (CICC), 139--142.
[7]
Edahiro, M. and Lipton, R. J. 1996. Clock buffer placement algorithm for wire-delay-dominated timing model. In Proceedings of the Great Lakes Symposium on VLSI, 143--147.
[8]
Friedman, E. G. 2001. Clock distribution networks in synchronous digital integrated circuits. Proc. IEEE 89, 5 (May), 665--692.
[9]
George, V., Zang, H., and Rabaey, J. 1999. The design of a low-energy FPGAs. In Proceedings of the International Symposium on Low-Power Electronic Design (ISLPED), 188--193.
[10]
Lamoureux, J., and Wilton, S. J. E. 2005. On the interaction between power-aware computer-aided design algorithms for field-programmable gate arrays. J. Low Power Electron. 1, 2, 119--132.
[11]
Lamoureux, J. and Wilton, S. J. E. 2006. FPGA clock network architecture: Flexibility vs. area and power. In Proceedings of the ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), 101--108.
[12]
Lamoureux, J. and Wilton, S. J. E. 2007. Clock-Aware placement for FPGAs. In Proceedings of the International Conference on Field-Programmable Logic and Applications (FPL).
[13]
Landman, B. S. and Russo, R. L. 1971. On a pin versus block relationship for partitions of logic graphs. IEEE Trans. Comput. C-20, 12, 1469--1479.
[14]
Li, F., Chen, D., He, L., and Cong, J. 2003. Architecture evaluation for power-efficient FPGAs. In Proceedings of the International Symposium on Field-Programmable Gate Arrays (FPGA), 175--184.
[15]
Nakamura, S. and Masson, G. M. 1982. Lower bounds on crosspoints in concentrators. IEEE Trans. Comput. 31, 12, 1173--1178.
[16]
Poon, K. K. W. and Wilton, S. J. E. 2005. A detailed power model for field-programmable gate arrays. ACM Trans. Des. Autom. Electron. Syst.10, 2 (Apr.), 279--302.
[17]
Natesan, V. and Bhatia, D. 1996. Clock-Skew constrained cell placement. In Proceedings of the International Conference on VLSI Design, 146--149.
[18]
Schabas, K. and Brown, S. D. 2003. Using logic duplication to improve performance in FPGAs. In Proceedings of the ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), 136--142.
[19]
Tuan, T., Kao, S., Rahman, A., Das, S., and Trimberger, S. 2006. A 90nm low-power FPGA for battery-powered applications. In Proceedings of the ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), 3--11.
[20]
XILINX. 2007. Virtex-5 User Guide. Chapter 2 (Feb).
[21]
Zhu, K. and Wong, D. F. 1997. Clock skew minimization during FPGA placement. IEEE Trans. Comput. Aided Des. Integr. Circ. 16, 4 (Apr.), 376--385.

Cited By

View all
  • (2019)Simultaneous Placement and Clock Tree Construction for Modern FPGAsProceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3289602.3293897(132-141)Online publication date: 20-Feb-2019
  • (2018)UTPlaceF 2.0ACM Transactions on Design Automation of Electronic Systems10.1145/317484923:4(1-23)Online publication date: 9-May-2018
  • (2016)Stratix™ 10 High Performance Routable Clock NetworksProceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/2847263.2847279(64-73)Online publication date: 21-Feb-2016
  • Show More Cited By

Index Terms

  1. On the trade-off between power and flexibility of FPGA clock networks

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Reconfigurable Technology and Systems
        ACM Transactions on Reconfigurable Technology and Systems  Volume 1, Issue 3
        September 2008
        135 pages
        ISSN:1936-7406
        EISSN:1936-7414
        DOI:10.1145/1391732
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 19 September 2008
        Accepted: 01 April 2008
        Revised: 01 March 2008
        Received: 01 July 2007
        Published in TRETS Volume 1, Issue 3

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. FPGA
        2. clock distribution networks
        3. clock-aware placement
        4. low-power design

        Qualifiers

        • Research-article
        • Research
        • Refereed

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)6
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 18 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2019)Simultaneous Placement and Clock Tree Construction for Modern FPGAsProceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3289602.3293897(132-141)Online publication date: 20-Feb-2019
        • (2018)UTPlaceF 2.0ACM Transactions on Design Automation of Electronic Systems10.1145/317484923:4(1-23)Online publication date: 9-May-2018
        • (2016)Stratix™ 10 High Performance Routable Clock NetworksProceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/2847263.2847279(64-73)Online publication date: 21-Feb-2016
        • (2012)A complete dynamic power estimation model for data-paths in FPGA DSP designsIntegration, the VLSI Journal10.1016/j.vlsi.2011.09.00245:2(172-185)Online publication date: 1-Mar-2012
        • (2011)Net-length-based routability-driven power-aware clusteringACM Transactions on Reconfigurable Technology and Systems10.1145/2068716.20687244:4(1-16)Online publication date: 28-Dec-2011
        • (2010)Power characterisation for fine-grain reconfigurable fabricsInternational Journal of Reconfigurable Computing10.1155/2010/7874052010(2-2)Online publication date: 1-Jan-2010
        • (2010)Energy-Aware Optimisation for Run-Time ReconfigurationProceedings of the 2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines10.1109/FCCM.2010.17(55-62)Online publication date: 2-May-2010
        • (2009)Power-aware FPGA packing algorithm2009 IEEE 8th International Conference on ASIC10.1109/ASICON.2009.5351571(817-819)Online publication date: Oct-2009

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media