[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article
Open access

Performance-aware thermal management via task scheduling

Published: 07 May 2010 Publication History

Abstract

High on-chip temperature impairs the processor's reliability and reduces its lifetime. Hardware-level dynamic thermal management (DTM) techniques can effectively constrain the chip temperature, but degrades the performance. We propose an OS-level technique that performs thermal-aware job scheduling to reduce DTMs. The algorithm is based on the observation that hot and cool jobs executed in a different order can make a difference in resulting temperature. Real-system implementation in Linux shows that our scheduler can remove 10.5% to 73.6% of the hardware DTMs in a medium thermal environment. The CPU throughput is improved by up to 7.6% (4.1%, on average) in a severe thermal environment.

References

[1]
Bansal, N., Kimbrel, T., and Pruhs, K. 2004. Dynamic speed scaling to manage energy and temperature. In Proceedings of the 45th Annual Symposium on Foundations of Computer Science. IEEE, Los Alamitos, CA, 520--529.
[2]
Bansal, N. and Pruhs, K. 2005. Speed scaling to manage temperature. In Proceedings of the Symposium on Theoretical Aspects of Computer Science. Springer, Berlin, 460--471.
[3]
Bellosa, F. 2000. The benefits of event-driven energy accounting in power-sensitive systems. In Proceedings of the 9th European Workshop. ACM, New York.
[4]
Bellosa, F.,Weissel, A.,Waitz, M., and Kellner, S. 2003. Event-driven energy accounting for dynamic thermal management. In Proceedings of the Workshop on Compilers and Operating Systems for Low-Power. IEEE, Los Alamitos, CA.
[5]
Bovet, D. and Cesati, M. 1984. Understanding the Linux Kernel 3rd Ed. O'Reilly, Sebastopol, CA.
[6]
Brooks, D. and Martonosi, M. 2001. Dynamic thermal management for high-performance microprocessors. In Proceedings of the 7th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA, 171--180.
[7]
Choi, J., Cher, C.-Y., Franke, H., Hamann, H., and Bose, A. W. P. 2007. Thermal-aware task scheduling at the system software level. In Proceedings of International Symposium on Low-Power Electronics and Design. ACM, New York, 213--218.
[8]
Choi, J., Kim, Y., Sivasubramaniam, A., Srebric, J., Wang, Q., and Lee, J. 2007. Modeling and managing thermal profiles of rack-mounted servers with thermostat. In Proceedings of 13th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA.
[9]
Chrobak, M., Durr, C., Hurand, M., and Robert, J. 2008. Algorithms for temperature-aware task scheduling in microprocessor systems. In Proceedings of the 4th International Conference on Algorithmic Aspects in Information and Management. Springer, Berlin, 120--130.
[10]
Coskun, A., Rosing, T., and Gross, K. 2008. Proactive temperature balancing for low- cost thermal management in mpsocs. In Proceedings of the International Conference on Computer-Aided Design. IEEE, Los Alamitos, CA, 250--257.
[11]
Coskun, A., Rosing, T., Whisnant, K., and Gross, K. 2008. Static and dynamic temperature-aware scheduling for multiprocessor socs. IEEE Trans. VLSI Syst. 16, 9, 1127--1140.
[12]
Donald, J. and Martonosi, M. 2006. Techniques for multicore thermal management: Classification and new exploration. In Proceedings of the 33rd International Symposium on Computer Architecture. ACM, New York, 78--88.
[13]
Gunther, S. H., Binns, F., Carmean, D. M., and Hall, J. C. 2001. Managing the impact of increasing microprocessor power consumption. Intel Tech. J. ftp://download.intel.com/technology/itj/q12001/pdf/art_4.pdf
[14]
Han, Y., Koren, I., and Krishna, C. M. 2006. Temptor: A lightweight runtime temperature monitoring tool using performance counters. In Proceedings of the 3rd Workshop on Temperature-Aware Computer Systems. IEEE, Los Alamitos, CA.
[15]
Hanson, H., Keckler, S., Ghiasi, S., Rajamani, K., Rawson, F., and Rubio, J. 2007. Thermal response to dvfs: Analysis with an Intel Pentium m. In Proceedings of the International Symposium on Low-Power Electronics and Design. ACM, New York, 219--224.
[16]
Hanumaiah, V., Vrudhula, S., and Chatha, K. 2009. Performance optimal speed control of multi-core processors under thermal constraints. In Proceedings of the Design, Automation and Test Conference in Europe. ACM, New York, 1548--1551.
[17]
He, L., Liao, W., and Stan, M. R. 2004. System level leakage reduction considering the interdependence of temperature and leakage. In Proceedings of the Design Automation Conference. ACM, New York, 12--17.
[18]
Heath, T., Centeno, A. P., George, P., Ramos, L., Jaluria, Y., and Bianchini, R. 2006. Mercury and freon: Temperature emulation and management for server systems. In Proceedings of the International Conference on Architectural Support for Programming Language and Operating Systems. ACM, New York, 106--116.
[19]
Heo, S., Barr, K., and Asanovic, K. 2003. Reducing power density through activity migration. In Proceedings of the International Symposium on Low-Power Electronics and Design. ACM, New York, 217--222.
[20]
Huang, W., Humenay, E., Skadron, K., and Stan, M. R. 2005. The need for a full-chip and package thermal model for thermally optimized ic designs. In Proceedings of the International Symposium on Low-Power Electronics and Design. ACM, New York, 245--250.
[21]
Huang, W., Stan, M. R., Skadron, K., Sankaranarayanan, K., Ghosh, S., and Velusamy, S. 2004. Compact thermal modeling for temperature-aware design. In Proceedings of the 41st Annual Conference on Design Automation. ACM, New York, 878--883.
[22]
Intel. 2002. Intel Pentium 4 processor in the 478-pin package thermal design guidelines. http://www.intel.com/design/Pentium4/guides/249889.htm.
[23]
Isci, C. and Martonosi, M. 2003. Runtime power monitoring in high-end processors: Methodology and empirical data. In Proceedings of the 36th Annual International Symposium on Microarchitecture. IEEE, Los Alamitos, CA, 93--104.
[24]
Jayaseelan, R. and Mitra, T. 2008. Temperature aware task sequencing and voltage scaling. In Proceedings of the International Conference on Computer-Aided Design. ACM, New York, 618--623.
[25]
Joseph, R. and Martonosi, M. 2001. Run-time power estimation in high-performance microprocessors. In Proceedings of the International Symposium on Low-Power Electronics and Design. ACM, New York, 135--140.
[26]
Jung, H., Rong, P., and Pedram, M. 2008. Stochastic modeling of a thermally-managed multi-core system. In Proceedings of the 45th Design Automation Conference. ACM, New York, 728--733.
[27]
Khan, O. and Kundu, S. 2008. A framework for predictive dynamic temperature management of microprocessor systems. In Proceedings of the International Conference on Computer-Aided Design. ACM, New York, 258--263.
[28]
Krum, A. 2000. Thermal Management. CRC Press, Boca Raton, FL.
[29]
Kumar, A., Shang, L., Peh, L.-S., and Jha, N. 2006. Hybdtm: A coordinated hardware-software approach for dynamic thermal management. In Proceedings of the 43rd Design Automation Conference. ACM, New York, 548--553.
[30]
Kursun, E. and Cher, C. Y. 2008. Variation-aware thermal characterization and management of multi-core architectures. In Proceedings of the 26th International Conference on Computer Design. ACM, New York, 280--285.
[31]
Kursun, E., Cher, C.-Y., Buyuktosunoglu, A., and Bose, P. 2006. Investigating the effects of task scheduling on thermal behavior. In Proceedings of the 3rd Workshop on Temperature-Aware Computer Systems. IEEE, Los Alamitos, CA.
[32]
Li, Y., Brooks, D., Hu, Z., and Skadron, K. 2005. Performance, energy, and thermal considerations for smt and cmp architectures. In Proceedings of the 11th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA, 71--82.
[33]
Li, Y., Lee, B., Brooks, D., Hu, Z., and Skadron, K. 2006. Cmp design space exploration subject to physical constraints. In Proceedings of the 12th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA.
[34]
Lu, Z., Lach, J., Stan, M. R., and Skadron, K. 2005. Improved thermal management with reliability banking. IEEE Micro. 25, 6, 40--49.
[35]
Mcgowen, R. 2005. Adaptive designs for power and thermal optimization. In Proceedings of the International Conference on Computer-Aided Design. ACM, New York, 118--121.
[36]
Monferrer, P. C., Magklis, G., González, J., and González, A. 2005. Distributing the frontend for temperature reduction. In Proceedings of the 11th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA, 61--70.
[37]
Moore, J., Chase, J., Ranganathan, P., and Sharma, R. 2005. Making scheduling “cool”: Temperature-aware workload placement in data centers. In Proceedings of the Annual Technical Conference. USENIX, Berkeley, CA, 61--75.
[38]
NIMO. 2007. Predictive technology model. http://ptm.asu.edu/.
[39]
Pillai, P. and Shin, K. G. 2001. Real-time dynamic voltage scaling for low-power embedded operating systems. In Proceedings of the 18th Symposium on Operating Systems Principles. ACM, New York, 89--102.
[40]
Powell, M. D., Gomaa, M., and Vijaykumar, T. N. 2004. Heat-and-run: Leveraging smt and cmp to manage power density through the operating system. In Proceedings of the 11th International Conference on Architectural Support for Programming Languages and Operating Systems. ACM, New York, 260--270.
[41]
Rohou, E. and Smith, M. D. 1999. Dynamically managing processor temperature and power. In Proceedings of the 2nd Workshop on Feedback-Directed Optimization.
[42]
Samson, E., Machiroutu, S., Chang, J.-Y., Santos, I., Hermerding, J., Dani, A., Prasher, R., and Song, D. W. 2005. Interface material selection and a thermal management technique in second-generation platforms built on Intel Centrino mobile technology. Intel Tech. J. http://download. intel.com/technology/itj/2005/volume09issue01/art06_interface_materials/vol09_art06.pdf.
[43]
Skadron, K., Abdelzaher, T., and Stan, M. R. 2002. Control-theoretic techniques and thermal-rc modeling for accurate and localized dynamic thermal management. In Proceedings of the 8th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA, 17--28.
[44]
Skadron, K., Stan, M. R., Huang, W., Velusamy, S., Sankaranarayanan, K., and Tarjan, D. 2003. Temperature-aware microarchitecture. In Proceedings of the 30th International Symposium on Computer Architecture. ACM, New York, 2--13.
[45]
Sprunt, B. 2002. Brink and abyss Pentium 4 performance counter tools for Linux. Tech. rep., Bucknell University, Lewisburg, PA.
[46]
Srinivasan, J. and Adve, S. V. 2003. Predictive dynamic thermal management for multimedia applications. In Proceedings of the 17th Annual International Conference on Supercomputing. ACM, New York, 109--120.
[47]
Stoer, J. and Bulirsch, R. 1991. Introduction to Numerical Analysis 2nd Ed. Springer, Berlin.
[48]
Wang, S. and Bettati, R. 2006a. Delay analysis in temperature-constrained hard real-time systems with general task arrivals. In Proceedings of the Real-Time Systems Symposium. IEEE, Los Alamitos, CA, 323--334.
[49]
Wang, S. and Bettati, R. 2006b. Reactive speed control in temperature-constrained real-time systems. In Proceedings of the 18th Conference on Real-Time Systems. IEEE, Los Alamitos, CA.
[50]
Yang, J., Zhou, X., Chrobak, M., Zhang, Y., and Jin, L. 2008. Dynamic thermal management through task scheduling. In Proceedings of the International Symposium on Performance Analysis of Systems and Software. IEEE, Los Alamitos, CA, 191--201.
[51]
Yeo, I., Liu, C. C., and Kim, E. J. 2008. Predictive dynamic thermal management for multicore systems. In Proceedings of the 45th Design Automation Conference. ACM, New York, 734--739.
[52]
Yuan, W. and Nahrstedt, K. 2003. Energy-efficient soft real-time cpu scheduling for mobile multimedia systems. In Proceedings of the 19th Symposium on Operating Systems Principles. ACM, New York, 149--163.
[53]
Zeldovich, N. and Chandra, R. 2005. Interactive performance measurement with vncplay. In Proceedings of the FREENIX Track: USENIX Annual Technical Conference. USENIX, Berkeley, CA.

Cited By

View all
  • (2024)Aquilo: Temperature-aware scheduler for millimeter-wave devices and networksHigh-Confidence Computing10.1016/j.hcc.2024.1002234:4(100223)Online publication date: Dec-2024
  • (2022)Temperature Estimation in Multi-Core Processors Using Statistical Approach for Task SchedulingCommunication and Intelligent Systems10.1007/978-981-19-2130-8_75(963-972)Online publication date: 19-Aug-2022
  • (2021)Ant Colony Optimization-Based Thermal-Aware Adaptive Routing Mechanism for Optical NoCsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2020.302913240:9(1836-1849)Online publication date: Sep-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Architecture and Code Optimization
ACM Transactions on Architecture and Code Optimization  Volume 7, Issue 1
April 2010
151 pages
ISSN:1544-3566
EISSN:1544-3973
DOI:10.1145/1736065
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 May 2010
Accepted: 01 February 2010
Revised: 01 December 2009
Received: 01 May 2009
Published in TACO Volume 7, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Thermal management
  2. task scheduling

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)53
  • Downloads (Last 6 weeks)10
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Aquilo: Temperature-aware scheduler for millimeter-wave devices and networksHigh-Confidence Computing10.1016/j.hcc.2024.1002234:4(100223)Online publication date: Dec-2024
  • (2022)Temperature Estimation in Multi-Core Processors Using Statistical Approach for Task SchedulingCommunication and Intelligent Systems10.1007/978-981-19-2130-8_75(963-972)Online publication date: 19-Aug-2022
  • (2021)Ant Colony Optimization-Based Thermal-Aware Adaptive Routing Mechanism for Optical NoCsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2020.302913240:9(1836-1849)Online publication date: Sep-2021
  • (2020)A Case for Temperature-Aware Scheduler for Millimeter-Wave Devices and Networks2020 IEEE 28th International Conference on Network Protocols (ICNP)10.1109/ICNP49622.2020.9259381(1-12)Online publication date: 13-Oct-2020
  • (2018)TheSPoT: Thermal Stress-Aware Power and Temperature Management for Multiprocessor Systems-on-ChipIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2017.276841737:8(1532-1545)Online publication date: Aug-2018
  • (2018)Joint Cooling and Server Control in Data Centers: A Cross-Layer Framework for Holistic Energy MinimizationIEEE Systems Journal10.1109/JSYST.2017.270086312:3(2461-2472)Online publication date: Sep-2018
  • (2018)Task Migration Algorithm to Reduce Temperature Imbalance Amongst Cores in Linux Based Multi-Core Processor Systems2018 3rd International Conference for Convergence in Technology (I2CT)10.1109/I2CT.2018.8529564(1-7)Online publication date: Apr-2018
  • (2017)Dynamic thermal and IT resource management strategies for data center energy minimizationJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-017-0094-z6:1(1-16)Online publication date: 1-Dec-2017
  • (2016)Towards Joint Optimization Over ICT and Cooling Systems in Data Centre: A SurveyIEEE Communications Surveys & Tutorials10.1109/COMST.2016.254510918:3(1596-1616)Online publication date: Nov-2017
  • (2015)A Fine-Grained Power Gating Control on Linux Monitoring Power Consumption of Processor Functional UnitsIEICE Transactions on Electronics10.1587/transele.E98.C.559E98.C:7(559-568)Online publication date: 2015
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media