[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article
Open access

Thermal monitoring mechanisms for chip multiprocessors

Published: 03 September 2008 Publication History

Abstract

With large-scale integration and increasing power densities, thermal management has become an important tool to maintain performance and reliability in modern process technologies. In the core of dynamic thermal management schemes lies accurate reading of on-die temperatures. Therefore, careful planning and embedding of thermal monitoring mechanisms into high-performance systems becomes crucial. In this paper, we propose three techniques to create sensor infrastructures for monitoring the maximum temperature on a multicore system. Initially, we extend a nonuniform sensor placement methodology proposed in the literature to handle chip multiprocessors (CMPs) and show its limitations. We then analyze a grid-based approach where the sensors are placed on a static grid covering each core and show that the sensor readings can differ from the actual maximum core temperature by as much as 12.6°C when using 16 sensors per core. Also, as large as 10.6% of the thermal emergencies are not captured using the same number of sensors. Based on this observation, we first develop an interpolation scheme, which estimates the maximum core temperature through interpolation of the readings collected at the static grid points. We show that the interpolation scheme improves the measurement accuracy and emergency coverage compared to grid-based placement when using the same number of sensors. Second, we present a dynamic scheme where only a subset of the sensor readings is collected to predict the maximum temperature of each core. Our results indicate that, we can reduce the number of active sensors by as much as 50%, while maintaining similar measurement accuracy and emergency coverage compared to the case where the entire sensor set on the grid is sampled at all times.

References

[1]
Binkert, N. L. D. et al. 2006. The M5 simulator: Modeling networked systems. IEEE Micro, 26, 4, 52--60.
[2]
Blackburn, D. L. 2004. Temperature measurements of semiconductor devices -- A Review. In Semiconductor Thermal Measurement, Modeling and Management Symposium.
[3]
Borkar, S. 1999. Design challenges of technology scaling. IEEE Micro (July-Aug.) 19, 4, 23--29.
[4]
Brooks, D., Tiwari, V., and Martonosi, M. 2000. Wattch: A framework for architectural-level power analysis and optimizations. In International Symposium on Computer Architecture.
[5]
Borkar, S., Karnik, T., Narendra, S., Tschanz, J., Keshavarzi, A., and De, V. 2003. Parameter variations and impact on circuits and microarchitecture. In Design Automation Conference. Anaheim, CA.
[6]
Borkar, S. et al. 2005. Platform 2015: Intel processor and platform evolution for the next decade. Whitepaper.
[7]
Bratek, P. and Kos, A. 2001. Temperature sensors placement strategy for fault diagnosis in integrated circuits. In Symposium on Semiconductor Thermal Measurement and Management.
[8]
Cong, J., Wei, J., and Zhang, Y. 2004. A thermal-driven floorplanning algorithm for 3D ICs. In International Conference on Computer-Aided Design.
[9]
Dorsey, J. et al. 2007. An integrated quad-core opteron processor. In International Solid-State Circuits Conference.
[10]
Gunther, S. et al. 2001. Managing the impact of increasing microprocessor power consumption. Intel Tech. J.
[11]
http://www.capgo.com/Resources/Temperature/Semiconductor/Semi.html. Introduction to Semiconductor Temperature Sensors.
[12]
http://download.intel.com/design/Pentium4/datashts/29864312.pdf. Intel Pentium 4 Processor with 512-KB L2 Cache on 0.13 Micron Process Thermal Design Guidelines. 2002.
[13]
http://www.maxim-ic.com/appnotes.cfm?an_pk=689. IC Temperature Sensor Find the Hot Spots. 2001.
[14]
Huang, W. et al. 2004. Compact thermal modeling for temperature-aware design. In Design Automation Conference.
[15]
Huang, W., et al. 2000. A framework for dynamic energy efficiency and temperature management. In International Symposium on Microarchitecture.
[16]
Juang, P. et al. 2005. Coordinated, distributed, formal energy management of chip multiprocessors. In International Symposium on Low Power Electronics and Systems.
[17]
Kaxiras, S. and Xekalakis, P. 2004. 4T-decay sensors: A new class of small, fast, robust, and low-power, temperature/leakage sensors. In International Symposium on Low Power Electronics and Design.
[18]
Kongetira, P., Aingaran, K., and Olukotun, K. 2005. Niagara: A 32-way multithreaded sparc processor. IEEE Micro, 25, 2, 21--29.
[19]
Krinitsin, V. Pentium 4 and Athlon XP: Thermal Conditions. Available from http://www.digit-life.com/articles/pentium4athlonxpthermalmanagement/.
[20]
Kumar, R., Zyuban, V., and Tullsen, D. M. 2005. Interconnections in multi-core architectures: understanding mechanisms, overheads and scaling. In International Symposium on Computer Architecture.
[21]
Lee, K.-J., Skadron, K., and Huang, W. 2005. Analytical model for sensor placement on microprocessors. In International Conference on Computer Design.
[22]
Li, Y., Brooks, D., Brooks, Hu, Z., Skadron, K. 2005. Performance, energy, and thermal considerations for SMT and CMP architectures. In Proceedings of the 11th International Symposium on High-Performance Computer Architecture. San Francisco, CA.
[23]
Liu, Y. et al. 2007. Accurate Temperature-dependent integrated circuit leakage power estimation is easy. In Design, Automation and Test in Europe.
[24]
Lopez-Buedo, S., Garrido, J., and Boemo, E. I. 2002. Dynamically inserting, operating, and eliminating thermal sensors of FPGA-based systems. IEEE Trans. Components Packaging Tech. 25, 4, 561--566.
[25]
Mondal, S., Mukherjee, R., and Memik, S. O. 2006. Fine-grain thermal profiling and sensor insertion for FPGAs. In IEEE International Symposium on Circuits and Systems.
[26]
Mukherjee, R. and Memik O. S. 2006. Systematic temperature sensor allocation and placement for microprocessors. In IEEE/ACM Design Automation Conference (DAC). San Francisco, CA.
[27]
Mukherjee, R., Mondal, S., and Memik, S. O. 2006. Thermal sensor allocation and placement for reconfigurable systems. In IEEE/ACM International Conference on Computer-Aided Design (ICCAD). San Jose, CA.
[28]
Pham, D. et al. 2005. The design and implementation of a first generation cell processor. In International Solid-State Circuits Conference.
[29]
Powell, M. D., Gomma, M., and Vijaykumar, T. N. 2004. Heat-and-Run: Leveraging SMT and CMP to manage power density through the operating system. In International Conference on Architectural Support for Programming Languages and Operating Systems.
[30]
Poirier, C. et al. 2005. Power and temperature control on a 90nm itanium-family processor. In International Solid-State Circuits Conference.
[31]
Quenot, G. M., Paris, N., and Zavidovique, B. 1991. A Temperature and voltage measurement cell for VLSI circuits. In Euro ASIC Conference.
[32]
Rattner, J. R. 2005. Keynote at the intel developer conference. Available from http://www.intel.com/technology/techresearch/idf/platform-2015-keynote.htm.
[33]
Rotem, E. et al. 2004. Analysis of thermal monitor features of the Intel Pentium M processor. In Workshop on Temperature-aware Computer Systems.
[34]
Rotem, E. et al. 2006. Temperature measurement in the Intel core duo processor. In International Workshop on Thermal Investigations of ICs.
[35]
Sankaranarayanan, K. et al. 2005. A case for thermal-aware floorplanning at the microarchitectural level. J. Instruction-Level Parallelism. 7, 1--16.
[36]
Sherwood, T., Perelman, E., and Calder, B. 2001. Basic block distribution analysis to find periodic behavior and simulation points in applications. In International Conference on Parallel Architectures and Compilation Techniques.
[37]
Skadron, K. et al. 2003. Temperature-aware microarchitecture. In International Symposium on Computer Architecture.
[38]
SPEC-CPU2000. 2000. Standard Performance Evaluation Council, Performance Evaluation in the New Millennium, Version 1.1.
[39]
Srinivasan, J. et al. 2004. The case for lifetime reliability-aware microprocessors. In International Symposium on Computer Architecture.
[40]
Tuthill, M. 1998. A switched-current, switched-capacitor temperature sensor in 0.6 um CMOS. IEEE J. Solid-State Circuits. 33, 7, 1117--1122.
[41]
Tsai, J., Chen, C. C., Chen, G., Goplen, B., Qian, H., Zhan, Y., Kang, S., Wong, M. D. F., and Sapatnekar, S. S. 2006. Temperature-aware placement for SOCs. In Proceedings of the IEEE. 94, 8 (Aug.), 1502--1518.
[42]
Velusamy, S. et al. 2005. Monitoring temperature in FPGA based SoCs. In International Conference on Computer Design.
[43]
Wang, N., Zhang, S., and Zhou, R. 2003. A novel built-in CMOS sensor for on-line thermal monitoring of VLSI circuits. In International Conference on ASIC.

Cited By

View all
  • (2024)FPGA-Based Sensors for Distributed Digital Manufacturing Systems: A State-of-the-Art ReviewSensors10.3390/s2423770924:23(7709)Online publication date: 2-Dec-2024
  • (2023)Accurate On-Chip Thermal Peak Detection Based on Heuristic Algorithms and Embedded Temperature SensorsElectronics10.3390/electronics1213297812:13(2978)Online publication date: 6-Jul-2023
  • (2022)Data ConvectionProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/35080276:1(1-25)Online publication date: 28-Feb-2022
  • Show More Cited By

Index Terms

  1. Thermal monitoring mechanisms for chip multiprocessors

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Architecture and Code Optimization
    ACM Transactions on Architecture and Code Optimization  Volume 5, Issue 2
    August 2008
    124 pages
    ISSN:1544-3566
    EISSN:1544-3973
    DOI:10.1145/1400112
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 September 2008
    Revised: 01 August 2007
    Received: 01 March 2007
    Accepted: 01 January 2007
    Published in TACO Volume 5, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Thermal sensor allocation
    2. nonuniform and uniform sensor placement

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)114
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)FPGA-Based Sensors for Distributed Digital Manufacturing Systems: A State-of-the-Art ReviewSensors10.3390/s2423770924:23(7709)Online publication date: 2-Dec-2024
    • (2023)Accurate On-Chip Thermal Peak Detection Based on Heuristic Algorithms and Embedded Temperature SensorsElectronics10.3390/electronics1213297812:13(2978)Online publication date: 6-Jul-2023
    • (2022)Data ConvectionProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/35080276:1(1-25)Online publication date: 28-Feb-2022
    • (2022)Combating Stealthy Thermal Covert Channel Attack With Its Thermal Signal Transmitted in Direct Sequence Spread SpectrumIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2022.319734441:11(4064-4075)Online publication date: Nov-2022
    • (2022)Thermal Sensor Placement for Multicore Systems Based on Low-Complex Compressive Sensing TheoryIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2022.314347641:11(5100-5111)Online publication date: 1-Nov-2022
    • (2022)A 0.6V–1.8V Compact Temperature Sensor With 0.24 °C Resolution, ±1.4 °C Inaccuracy and 1.06nJ per ConversionIEEE Sensors Journal10.1109/JSEN.2022.317110622:12(11480-11488)Online publication date: 15-Jun-2022
    • (2022)Entropy-based Thermal Sensor Allocation for Temperature-aware Multi-core Platforms2022 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS48785.2022.9937996(2534-2537)Online publication date: 28-May-2022
    • (2021)Influence of Selected Factors on Thermal Parameters of the Components of Forced Cooling Systems of Electronic DevicesElectronics10.3390/electronics1003034010:3(340)Online publication date: 1-Feb-2021
    • (2021)Hardware Accelerator for Runtime Temperature Estimation in Many-CoresIEEE Design & Test10.1109/MDAT.2021.306891438:4(62-69)Online publication date: Aug-2021
    • (2021)Methodology for Accurate Hotspot Prediction by Upscale Spatiotemporal Resolution of Temperature Sensing using a Power Meter and Resistor-Capacitor Thermal Model2021 20th IEEE Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (iTherm)10.1109/ITherm51669.2021.9503200(130-138)Online publication date: 1-Jun-2021
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media