[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3330345.3330372acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article

Power efficient job scheduling by predicting the impact of processor manufacturing variability

Published: 26 June 2019 Publication History

Abstract

Modern CPUs suffer from performance and power consumption variability due to the manufacturing process. As a result, systems that do not consider such variability caused by manufacturing issues lead to performance degradations and wasted power. In order to avoid such negative impact, users and system administrators must actively counteract any manufacturing variability.
In this work we show that parallel systems benefit from taking into account the consequences of manufacturing variability when making scheduling decisions at the job scheduler level. We also show that it is possible to predict the impact of this variability on specific applications by using variability-aware power prediction models. Based on these power models, we propose two job scheduling policies that consider the effects of manufacturing variability for each application and that ensure that power consumption stays under a system-wide power budget. We evaluate our policies under different power budgets and traffic scenarios, consisting of both single- and multi-node parallel applications, utilizing up to 4096 cores in total. We demonstrate that they decrease job turnaround time, compared to contemporary scheduling policies used on production clusters, up to 31% while saving up to 5.5% energy.

References

[1]
2017. The Green500 List. http://www.green500.org. (June 2017).
[2]
Axel Auweter, Arndt Bode, Matthias Brehm, Luigi Brochard, Nicolay Hammer, Herbert Huber, Raj Panda, Francois Thomas, and Torsten Wilde. 2014. A Case Study of Energy Aware Scheduling on SuperMUC. In Supercomputing, Julian Martin Kunkel, Thomas Ludwig, and Hans Werner Meuer (Eds.). 394--409.
[3]
Eduard Ayguadé, Alejandro Duran, Jay Hoeflinger, Federico Massaioli, and Xavier Teruel. 2007. An Experimental Evaluation of the New OpenMP Tasking Model.
[4]
Peter E. Bailey, Aniruddha Marathe, David K. Lowenthal, Barry Rountree, and Martin Schulz. 2015. Finding the Limits of Power-constrained Application Performance. Article 79, 12 pages.
[5]
Frank Bellosa. 2000. The Benefits of Event: Driven Energy Accounting in Power-sensitive Systems (EW 9). 6.
[6]
Ramon Bertran, Alper Buyuktosunoglu, Meeta S. Gupta, Marc Gonzalez, and Pradip Bose. 2012. Systematic Energy Characterization of CMP/SMT Processor Systems via Automated Micro-Benchmarks (MICRO-45). 13.
[7]
Ramon Bertran, Marc Gonzalez, Xavier Martorell, Nacho Navarro, and Eduard Ayguade. 2010. Decomposable and Responsive Power Models for Multicore Processors Using Performance Counters (ICS '10). 12.
[8]
Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, and Kai Li. 2008. The PARSEC Benchmark Suite: Characterization and Architectural Implications. 10.
[9]
W. L. Bircher and L. K. John. 2007. Complete System Power Estimation: A Trickle-Down Approach Based on Performance Events. 158--168.
[10]
William Lloyd Bircher and Lizy K. John. 2012. Complete System Power Estimation Using Processor Performance Events. IEEE Trans. Comput. 61, 4 (April 2012), 15.
[11]
W. L. Bircher, M. Valluri, J. Law, and L. K. John. 2005. Runtime Identification of Microprocessor Energy Saving Opportunities (ISLPED '05). 6.
[12]
T. Cao, Y. He, and M. Kondo. 2016. Demand-Aware Power Management for Power-Constrained HPC Systems. 21--31.
[13]
Dimitrios Chasapis, Marc Casas, Miquel Moretó, Martin Schulz, Eduard Ayguadé, Jesus Labarta, and Mateo Valero. 2016. Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes (ICS '16). Article 5, 12 pages.
[14]
Dimitrios Chasapis, Marc Casas, Miquel Moretó, Raul Vidal, Eduard Ayguadé, Jesús Labarta, and Mateo Valero. 2015. PARSECSs: Evaluating the Impact of Task Parallelism in the PARSEC Benchmark Suite. ACM Trans. Archit. Code Optim. 12, 4, Article 41 (Dec. 2015), 22 pages.
[15]
Spencer Desrochers, Chad Paradis, and Vincent M. Weaver. 2016. A Validation of DRAM RAPL Power Measurements. In Proceedings of the Second International Symposium on Memory Systems (MEMSYS '16). 455--470.
[16]
Daniel A. Ellsworth, Allen D. Malony, Barry Rountree, and Martin Schulz. 2015. Dynamic Power Sharing for Higher Job Throughput (SC '15). Article 80, 11 pages.
[17]
Maja Etinski, Julita Corbalan, Jesus Labarta, and Mateo Valero. 2010. Utilization driven power-aware parallel job scheduling. Computer Science - Research and Development 25, 3 (01 Sep 2010), 207--216.
[18]
M. Etinski, J. Corbalan, J. Labarta, and M. Valero. 2011. Linear programming based parallel job scheduling for power constrained systems.
[19]
M. Etinski, J. Corbalan, J. Labarta, and M. Valero. 2012. Parallel job scheduling for power constrained {HPC} systems. Parallel Comput. 38, 12 (2012), 615 -- 630.
[20]
Dror G. Feitelson, Dan Tsafrir, and David Krakov. 2014. Experience with using the Parallel Workloads Archive. J. Parallel and Distrib. Comput. 74, 10 (2014), 2967 -- 2982.
[21]
F. Fraternali, A. Bartolini, C. Cavazzoni, and L. Benini. 2018. Quantifying the Impact of Variability and Heterogeneity on the Energy Efficiency for a Next-Generation Ultra-Green Supercomputer. IEEE Transactions on Parallel and Distributed Systems 29, 7 (July 2018), 1575--1588.
[22]
Neha Gholkar, Frank Mueller, and Barry Rountree. 2016. Power Tuning HPC Jobs on Power-Constrained Systems (PACT '16). 13.
[23]
Bhavishya Goel, Sally A. McKee, Roberto Gioiosa, Karan Singh, Major Bhadauria, and Marco Cesati. 2010. Portable, Scalable, Per-core Power Estimation for Intelligent Resource Management (GREENCOMP '10). 12.
[24]
D. Hackenberg, T. Ilsche, R. Schöne, D. Molka, M. Schmidt, and W. E. Nagel. 2013. Power measurement techniques on standard compute nodes: A quantitative comparison. In 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 194--204.
[25]
D. Hackenberg, R. Schöne, T. Ilsche, D. Molka, J. Schuchart, and R. Geyer. 2015. An Energy Efficiency Feature Survey of the Intel Haswell Processor. In 2015 IEEE International Parallel and Distributed Processing Symposium Workshop. 896--904.
[26]
Chung-hsing Hsu and Wu chun Feng. 2005. A Power-Aware Run-Time System for High-Performance Computing. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (SC'05). 1--1.
[27]
Yuichi Inadomi, Tapasya Patki, Koji Inoue, Mutsumi Aoyagi, Barry Rountree, Martin Schulz, David Lowenthal, Yasutaka Wada, Keiichiro Fukazawa, Masatsugu Ueda, Masaaki Kondo, and Ikuo Miyoshi. 2015. Analyzing and Mitigating the Impact of Manufacturing Variability in Power-constrained Supercomputing (SC '15). Article 78, 12 pages.
[28]
Intel 2010. Intel® 64 and IA-32 Architectures Software Developer's Manual Volume 3A: System Programming Guide, Part 1. Intel.
[29]
Canturk Isci and Margaret Martonosi. 2003. Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data (MICRO 36).
[30]
Morris A. Jette, Andy B. Yoo, and Mark Grondona. 2002. SLURM: Simple Linux Utility for Resource Management.
[31]
Haoqiang Jin and Rob F. Van der Wijngaart. 2006. Performance Characteristics of the Multi-zone NAS Parallel Benchmarks. J. Parallel Distrib. Comput. 66, 5 (May 2006), 12.
[32]
Russ Joseph and Margaret Martonosi. 2001. Run-time Power Estimation in High Performance Microprocessors (ISLPED '01). 6.
[33]
Kashif Nizam Khan, Mikael Hirki, Tapio Niemi, Jukka K. Nurminen, and Zhonghong Ou. 2018. RAPL in Action: Experiences in Using RAPL for Power Measurements. ACM Trans. Model. Perform. Eval. Comput. Syst. 3, 2, Article 9 (March 2018), 26 pages.
[34]
Bhavesh Khemka, Ryan Friese, Sudeep Pasricha, Anthony A. Maciejewski, Howard Jay Siegel, Gregory A. Koenig, Sarah Powers, Marcia Hilton, Rajendra Rambharos, and Steve Poole. 2015. Utility maximizing dynamic resource management in an oversubscribed energy-constrained heterogeneous computing system. Sustainable Computing: Informatics and Systems 5 (2015), 14 -- 30.
[35]
Katia Leal. 2016. Energy efficient scheduling strategies in Federated Grids. Sustainable Computing: Informatics and Systems 9 (2016), 33 -- 41.
[36]
Tao Li and Lizy Kurian John. 2003. Run-time Modeling and Estimation of Operating System Power Consumption (SIGMETRICS '03). 12.
[37]
Livermore Computing. 2014. The Catalyst Supercomputer. http://computation.llnl.gov/computers/catalyst. (2014). http://computation.llnl.gov/computers/catalyst
[38]
M. Maiterth, G. Koenig, K. Pedretti, S. Jana, N. Bates, A. Borghesi, D. Montoya, A. Bartolini, and M. Puzovic. 2018. Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- Initial Analysis. In 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). 685--693.
[39]
Aniruddha Marathe, Yijia Zhang, Grayson Blanks, Nirmal Kumbhare, Ghaleb Abdulla, and Barry Rountree. 2017. An Empirical Survey of Performance and Energy Efficiency Variation on Intel Processors. In Proceedings of the 5th International Workshop on Energy Efficient Supercomputing (E2SC'17). ACM, New York, NY, USA, Article 9, 8 pages.
[40]
A. W. Mu'alem and D. G. Feitelson. 2001. Utilization, predictability, workloads, and user runtime estimates in scheduling the IBM SP2 with backfilling. IEEE Transactions on Parallel and Distributed Systems 12, 6 (June 2001), 529--543.
[41]
Dan Nagle. 2005. MPI - The Complete Reference, Vol. 1, The MPI Core, 2Nd Ed., Scientific and Engineering Computation Series, by Marc Snir, Steve Otto, Steven Huss-Lederman, David Walker and Jack Dongarra. Sci. Program. 13, 1 (Jan. 2005), 7.
[42]
OpenMP Architecture Review Board. 2013. OpenMP Application Program Interface Version 4.0. (2013). http://www.openmp.org/mp-documents/OpenMP4.0.0.pdf
[43]
Tapasya Patki, David K. Lowenthal, Barry Rountree, Martin Schulz, and Bronis R. de Supinski. 2013. Exploring Hardware Overprovisioning in Power-constrained, High Performance Computing. 10.
[44]
Tapasya Patki, David K. Lowenthal, Anjana Sasidharan, Matthias Maiterth, Barry L. Rountree, Martin Schulz, and Bronis R. de Supinski. 2015. Practical Resource Management in Power-Constrained, High Performance Computing. In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC '15). ACM, New York, NY, USA, 12.
[45]
N. Rajovic, P.M. Carpenter, I. Gelado, N. Puzovic, A. Ramirez, and M. Valero. 2013. Supercomputing with commodity CPUs: Are mobile SoCs ready for HPC?
[46]
Barry Rountree, Dong H. Ahn, Bronis R. de Supinski, David K. Lowenthal, and Martin Schulz. 2012. Beyond DVFS: A First Look at Performance Under a Hardware-Enforced Power Bound (IPDPSW '12). 7.
[47]
Barry Rountree, David K. Lownenthal, Bronis R. de Supinski, Martin Schulz, Vincent W. Freeh, and Tyler Bletsch. 2009. Adagio: Making DVS Practical for Complex HPC Applications (ICS '09). 10.
[48]
Osman Sarood, Akhil Langer, Abhishek Gupta, and Laxmikant Kale. 2014. Maximizing Throughput of Overprovisioned HPC Data Centers Under a Strict Power Budget. 12.
[49]
K. Shoga, B. Rountree, and M. Schulz. 2014. Whitelisting MSRs with msr-safe. (November 2014). https://github.com/LLNL/msr-safe
[50]
Hayk Shoukourian. 2015. Adviser for Energy Consumption Management: Green Energy Conservation. Dissertation. Technische Universität München.
[51]
Karan Singh, Major Bhadauria, and Sally A. McKee. 2009. Real Time Power Estimation and Thread Scheduling via Performance Counters. SIGARCH Comput. Archit. News 37, 2 (July 2009), 10.
[52]
ASCAC Subcomittee. 2014. Top ten exascale research challenges. Technical Report. US Department of Energy.
[53]
Radu Teodorescu and Josep Torrellas. 2008. Variation-Aware Application Scheduling and Power Management for Chip Multiprocessors (ISCA '08). 12.
[54]
Ehsan Totoni, Akhil Langer, Josep Torrellas, and Laxmikant V.Kale. 2014. Scheduling for HPC Systems with Process Variation Heterogeneity. Technical Report. University of Illinois at Urbana-Champaign.

Cited By

View all
  • (2024)Enhancement of System Performance Using PeSche Scheduling Algorithm on MultiprocessorsMetaheuristics Algorithm and Optimization of Engineering and Complex Systems10.4018/979-8-3693-3314-3.ch015(274-301)Online publication date: 28-Jun-2024
  • (2024)sys-sage: A Unified Representation of Dynamic Topologies & Attributes on HPC SystemsProceedings of the 38th ACM International Conference on Supercomputing10.1145/3650200.3656627(363-375)Online publication date: 30-May-2024
  • (2024)Toward Sustainable HPC: In-Production Deployment of Incentive-Based Power Efficiency Mechanism on the Fugaku SupercomputerProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00030(1-16)Online publication date: 17-Nov-2024
  • Show More Cited By

Index Terms

  1. Power efficient job scheduling by predicting the impact of processor manufacturing variability

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ICS '19: Proceedings of the ACM International Conference on Supercomputing
      June 2019
      533 pages
      ISBN:9781450360791
      DOI:10.1145/3330345
      Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 26 June 2019

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. HPC
      2. energy efficient
      3. job scheduling
      4. manufacturing variability
      5. power prediction

      Qualifiers

      • Research-article

      Conference

      ICS '19
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 629 of 2,180 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)34
      • Downloads (Last 6 weeks)7
      Reflects downloads up to 11 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Enhancement of System Performance Using PeSche Scheduling Algorithm on MultiprocessorsMetaheuristics Algorithm and Optimization of Engineering and Complex Systems10.4018/979-8-3693-3314-3.ch015(274-301)Online publication date: 28-Jun-2024
      • (2024)sys-sage: A Unified Representation of Dynamic Topologies & Attributes on HPC SystemsProceedings of the 38th ACM International Conference on Supercomputing10.1145/3650200.3656627(363-375)Online publication date: 30-May-2024
      • (2024)Toward Sustainable HPC: In-Production Deployment of Incentive-Based Power Efficiency Mechanism on the Fugaku SupercomputerProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00030(1-16)Online publication date: 17-Nov-2024
      • (2024)Power-Efficiency Variation on A64FX Supercomputers and its Application to System Operation2024 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops)10.1109/CLUSTERWorkshops61563.2024.00018(55-65)Online publication date: 24-Sep-2024
      • (2024)Light-Weight Prediction for Improving Energy Consumption in HPC PlatformsEuro-Par 2024: Parallel Processing10.1007/978-3-031-69577-3_11(152-165)Online publication date: 26-Aug-2024
      • (2023)Enhancement of system performance using PeSche scheduling algorithm on multiprocessorsi-manager's Journal on Power Systems Engineering10.26634/jps.11.3.2027711:3(37)Online publication date: 2023
      • (2023)AOA: Adaptive Overclocking Algorithm on CPU-GPU Heterogeneous PlatformsAlgorithms and Architectures for Parallel Processing10.1007/978-3-031-22677-9_14(253-272)Online publication date: 11-Jan-2023
      • (2022)Not all GPUs are created equalProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/3571885.3571971(1-15)Online publication date: 13-Nov-2022
      • (2022)Analyzing Performance and Power-Efficiency Variations among NVIDIA GPUsProceedings of the 51st International Conference on Parallel Processing10.1145/3545008.3545084(1-12)Online publication date: 29-Aug-2022
      • (2022)TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming ModelsSC22: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41404.2022.00085(1-15)Online publication date: Nov-2022
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media