More Web Proxy on the site http://driver.im/

research-article

Open access

Learning-Augmented Decentralized Online Convex Optimization in Networks

Authors:

Shaolei RenAuthors Info & Claims

Proceedings of the ACM on Measurement and Analysis of Computing Systems, Volume 8, Issue 3

Article No.: 38, Pages 1 - 42

https://doi.org/10.1145/3700420

Published: 13 December 2024 Publication History

Abstract

This paper studies learning-augmented decentralized online convex optimization in a networked multi-agent system, a challenging setting that has remained under-explored. We first consider a linear learning-augmented decentralized online algorithm (LADO-Lin) that combines a machine learning (ML) policy with a baseline expert policy in a linear manner. We show that, while LADO-Lin can exploit the potential of ML predictions to improve the average cost performance, it cannot have guaranteed worst-case performance. To address this limitation, we propose a novel online algorithm (LADO) that adaptively combines the ML policy and expert policy to safeguard the ML predictions to achieve strong competitiveness guarantees. We also prove the average cost bound for LADO, revealing the tradeoff between average performance and worst-case robustness and demonstrating the advantage of training the ML policy by explicitly considering the robustness requirement. Finally, we run an experiment on decentralized battery management. Our results highlight the potential of ML augmentation to improve the average performance as well as the guaranteed worst-case performance of LADO.

References

[1]

Hamidreza Shahbazi and Farid Karbalaei. Decentralized voltage control of power systems using multi-agent systems. Journal of Modern Power Systems and Clean Energy, 8(2):249--259, 2020.

[2]

Yuanyuan Shi, Guannan Qu, Steven Low, Anima Anandkumar, and Adam Wierman. Stability constrained reinforcement learning for real-time voltage control. arXiv preprint arXiv:2109.14854, 2021.

[3]

Saghar Hosseini, Airlie Chapman, and Mehran Mesbahi. Online distributed convex optimization on dynamic networks. IEEE Transactions on Automatic Control, 61(11):3545--3550, 2016.

[4]

Amal Feriani and Ekram Hossain. Single and multi-agent deep reinforcement learning for ai-enabled wireless networks: A tutorial. IEEE Communications Surveys & Tutorials, 23(2):1226--1252, 2021.

[5]

Yasar Sinan Nasir and Dongning Guo. Multi-agent deep reinforcement learning for dynamic power allocation in wireless networks. IEEE Journal on Selected Areas in Communications, 37(10):2239--2250, 2019.

[6]

Fuqiang Yao and Luliang Jia. A collaborative multi-agent reinforcement learning anti-jamming algorithm in wireless networks. IEEE wireless communications letters, 8(4):1024--1027, 2019.

[7]

Felipe Caro and Jérémie Gallien. Clearance pricing optimization for a fast-fashion retailer. Operations research, 60(6):1404--1422, 2012.

Digital Library

[8]

Ozan Candogan, Kostas Bimpikis, and Asuman Ozdaglar. Optimal pricing in networks with externalities. OperationsResearch, 60(4):883--905, 2012.

Digital Library

[9]

Kaixiang Lin, Shu Wang, and Jiayu Zhou. Collaborative deep reinforcement learning. arXiv preprint arXiv:1702.05796, 2017.

[10]

Xuanyu Cao and Tamer Baar. Decentralized online convex optimization with feedback delays. IEEE Transactions on Automatic Control, 67(6):2889--2904, 2021.

[11]

Nicolas Christianson, Tinashe Handina, and Adam Wierman. Chasing convex bodies and functions with black-box advice. In COLT, 2022.

[12]

Gautam Goel, Yiheng Lin, Haoyuan Sun, and Adam Wierman. Beyond online balanced descent: An optimal algorithm for smoothed online optimization. In NeurIPS, volume 32, 2019.

[13]

Mehrdad Mahdavi, Rong Jin, and Tianbao Yang. Trading regret for efficiency: Online convex optimization with long term constraints. J. Mach. Learn. Res., 13(1):2503--2528, sep 2012.

Digital Library

[14]

Gautam Goel and Adam Wierman. An online algorithm for smoothed online convex optimization. SIGMETRICS Perform. Eval. Rev., 47(2):6--8, December 2019.

Digital Library

[15]

Lijun Zhang, Wei Jiang, Shiyin Lu, and Tianbao Yang. Revisiting smoothed online learning. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.

[16]

Guanya Shi, Yiheng Lin, Soon-Jo Chung, Yisong Yue, and Adam Wierman. Online optimization with memory and competitive control. Advances in Neural Information Processing Systems, 33:20636--20647, 2020.

[17]

Weici Pan, Guanya Shi, Yiheng Lin, and Adam Wierman. Online optimization with feedback delay and nonlinear switching cost. Proc. ACM Meas. Anal. Comput. Syst., 6(1), Feb 2022.

Digital Library

[18]

Niangjun Chen, Gautam Goel, and Adam Wierman. Smoothed online convex optimization in high dimensions via online balanced descent. In COLT, 2018.

[19]

Alec Koppel, Felicia Y. Jakubiec, and Alejandro Ribeiro. A saddle point algorithm for networked online convex optimization. IEEE Transactions on Signal Processing, 63(19):5149--5164, 2015.

Digital Library

[20]

Xiuxian Li, Xinlei Yi, and Lihua Xie. Distributed online convex optimization with an aggregative variable. IEEE Transactions on Control of Network Systems, 2021.

[21]

Xuanyu Cao and Tamer Baar. Decentralized online convex optimization based on signs of relative states. Automatica, 129:109676, 2021.

Digital Library

[22]

Saghar Hosseini, Airlie Chapman, and Mehran Mesbahi. Online distributed convex optimization on dynamic networks. IEEE Transactions on Automatic Control, 61(11):3545--3550, 2016.

[23]

Weiwei Kong, Christopher Liaw, Aranyak Mehta, and D. Sivakumar. A new dog learns old tricks: RL finds classic optimization algorithms. In ICLR, 2019.

[24]

Thomas Barrett, William Clements, Jakob Foerster, and Alex Lvovsky. Exploratory combinatorial optimization with reinforcement learning. In AAAI, 2020.

[25]

Han Zhang, Wenzhong Li, Shaohua Gao, Xiaoliang Wang, and Baoliu Ye. Reles: A neural adaptive multipath scheduler based on deep reinforcement learning. In INFOCOM, 2019.

Digital Library

[26]

Zhihui Shao, Jianyi Yang, Cong Shen, and Shaolei Ren. Learning for robust combinatorial optimization: Algorithm and application. In INFOCOM, 2022.

Digital Library

[27]

Kaiqing Zhang, Zhuoran Yang, Han Liu, Tong Zhang, and Tamer Basar. Fully decentralized multi-agent reinforcement learning with networked agents. In International Conference on Machine Learning, pages 5872--5881. PMLR, 2018.

[28]

Kaiqing Zhang, Zhuoran Yang, and Tamer Baar. Multi-agent reinforcement learning: A selective overview of theories and algorithms. Handbook of Reinforcement Learning and Control, pages 321--384, 2021.

[29]

Afshin Oroojlooy and Davood Hajinezhad. A review of cooperative multi-agent deep reinforcement learning. Applied Intelligence, pages 1--46, 2022.

[30]

Pengfei Li, Jianyi Yang, and Shaolei Ren. Robustified learning for online optimization with memory costs. In INFOCOM, 2023.

[31]

Alexander Wei and Fred Zhang. Optimal robustness-consistency trade-offs for learning-augmented online algorithms. In NeurIPS, 2020.

[32]

Joan Boyar, Lene M. Favrholdt, Christian Kudahl, Kim S. Larsen, and Jesper W. Mikkelsen. Online algorithms with advice: A survey. SIGACT News, 47(3):93--129, August 2016.

Digital Library

[33]

Étienne Bamas, Andreas Maggiori, and Ola Svensson. The primal-dual method for learning augmented algorithms. Advances in Neural Information Processing Systems, 33:20083--20094, 2020.

[34]

Thodoris Lykouris and Sergei Vassilvitskii. Competitive caching with machine learned advice. J. ACM, 68(4), July 2021.

Digital Library

[35]

Daan Rutten, Nicolas Christianson, Debankur Mukherjee, and Adam Wierman. Smoothed online optimization with unreliable predictions. Proc. ACM Meas. Anal. Comput. Syst., 7(1), mar 2023.

Digital Library

[36]

Yiheng Lin, Judy Gan, Guannan Qu, Yash Kanoria, and Adam Wierman. Decentralized online convex optimization in networked systems. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, and Sivan Sabato, editors, Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pages 13356--13393. PMLR, 17--23 Jul 2022.

[37]

M. Lin, A. Wierman, L. L. H. Andrew, and E. Thereska. Dynamic right-sizing for power-proportional data centers. In INFOCOM, 2011.

[38]

Mohammad A. Islam, Kishwar Ahmed, Hong Xu, Nguyen H. Tran, Gang Quan, and Shaolei Ren. Exploiting spatiotemporal diversity for water saving in geo-distributed data centers. IEEE Transactions on Cloud Computing, 6(3):734--746, 2018.

[39]

Meta. Sustainability report. https://sustainability.fb.com/, 2021.

[40]

Gautam Goel, Yiheng Lin, Haoyuan Sun, and Adam Wierman. Beyond online balanced descent: An optimal algorithm for smoothed online optimization. Advances in Neural Information Processing Systems, 32, 2019.

[41]

Richard Cheng, Abhinav Verma, Gabor Orosz, Swarat Chaudhuri, Yisong Yue, and Joel Burdick. Control regularization for reduced variance reinforcement learning. In Kamalika Chaudhuri and Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, pages 1141--1150. PMLR, 09--15 Jun 2019.

[42]

Hoang M. Le, Andrew Kang, Yisong Yue, and Peter Carr. Smooth imitation learning for online sequence prediction. In ICML, 2016.

[43]

Pengfei Li, Jianyi Yang, and Shaolei Ren. Learning for edge-weighted online bipartite matching with robustness guarantees. In ICML, 2023.

[44]

Yunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, and Simon Shaolei Du. A reduction-based framework for conservative bandits and reinforcement learning. In International Conference on Learning Representations, 2022.

[45]

Jakub Chdowski, Adam Polak, Bartosz Szabucki, and Konrad Tomasz ona. Robust learning-augmented caching: An experimental study. In ICML, 2021.

[46]

Le Yi Wang, Caisheng Wang, George Yin, Feng Lin, Michael P. Polis, Caiping Zhang, and Jiuchun Jiang. Balanced control strategies for interconnected heterogeneous battery systems. IEEE Transactions on Sustainable Energy, 7(1):189--199, 2016.

[47]

Ross Koningstein, Ian Schneider, Bokan Chen, Alexandre Duarte, Binz Roy, Diyue Xiao, Maya Haridasan, Patrick Hung, Nick Care, Saurav Talukdar, Eric Mullen, Kendal Smith, MariEllen Cottman, and Walfredo Cirne. Carbon-aware computing for datacenters. IEEE Transactions on Power Systems, 38(2):1270--1280, 2023.

[48]

Pengfei Li, Jianyi Yang, Adam Wierman, and Shaolei Ren. Towards environmentally equitable AI via geographical load balancing. In e-Energy, 2024.

[49]

Amba Kak and Sarah Myers West. AI Now 2023 landscape: Confronting tech power. AI Now Institute, April 2023.

[50]

Alejandro Garofali Acosta, Shaun Riordan, and Mario Torres Jarrín. The environmental and ethical challenges of artificial intelligence. ThinkTwenty (T20) Policy Brief, July 2023.

[51]

UNESCO. Recommendation on the ethics of artificial intelligence. In Policy Recommendation, 2022.

[52]

Elad Hazan. Introduction to online convex optimization. Foundations and Trends® in Optimization, 2(3--4):157--325, 2016.

[53]

Keerti Anand, Rong Ge, Amit Kumar, and Debmalya Panigrahi. A regression approach to learning-augmented online algorithms. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.

[54]

Manish Purohit, Zoya Svitkina, and Ravi Kumar. Improving online algorithms via ml predictions. In NeurIPS, 2018.

[55]

Goran Zuzic, Di Wang, Aranyak Mehta, and D. Sivakumar. Learning robust algorithms for online allocation problems using adversarial training. In https://arxiv.org/abs/2010.08418, 2020.

[56]

Pengfei Li, Jianyi Yang, and Shaolei Ren. Expert-calibrated learning for online optimization with switching costs. In SIGMETRICS, 2022.

Digital Library

[57]

Antonios Antoniadis, Christian Coester, Marek Elias, Adam Polak, and Bertrand Simon. Online metric algorithms with untrusted predictions. In ICML, 2020.

[58]

Andrew A Chien, Liuzixuan Lin, Hai Nguyen, Varsha Rao, Tristan Sharma, and Rajini Wijayawardana. Reducing the carbon impact of generative AI inference (today and in 2035). In Proceedings of the 2nd Workshop on Sustainable Computer Systems, HotCarbon '23, New York, NY, USA, 2023. Association for Computing Machinery.

Digital Library

[59]

Eli Cortez, Anand Bonde, Alexandre Muzio, Mark Russinovich, Marcus Fontoura, and Ricardo Bianchini. Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms. In Proceedings of the 26th Symposium on Operating Systems Principles, pages 153--167, 2017.

Digital Library

[60]

Manajit Sengupta, Yu Xie, Anthony Lopez, Aron Habte, Galen Maclaurin, and James Shelby. The national solar radiation data base (nsrdb). Renewable and sustainable energy reviews, 89:51--60, 2018.

[61]

Can Wan, Jian Zhao, Yonghua Song, Zhao Xu, Jin Lin, and Zechun Hu. Photovoltaic and solar power forecasting for smart grid energy management. CSEE Journal of Power and Energy Systems, 1(4):38--46, 2015.

[62]

Asis Sarkar and Dhiren Kumar Behera. Wind turbine blade efficiency and power calculation with electrical analogy. International Journal of Scientific and Research Publications, 2(2):1--5, 2012.

[63]

Zhipeng Tu, Xi Wang, Yiguang Hong, Lei Wang, Deming Yuan, and Guodong Shi. Distributed online convex optimization with compressed communication. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems, volume 35, pages 34492--34504. Curran Associates, Inc., 2022.

[64]

Daan Rutten, Nico Christianson, Debankur Mukherjee, and Adam Wierman. Online optimization with untrusted predictions. CoRR, abs/2202.03519, 2022.

[65]

Yifan Wu, Roshan Shariff, Tor Lattimore, and Csaba Szepesvári. Conservative bandits. In International Conference on Machine Learning, pages 1254--1262. PMLR, 2016.

[66]

Yunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, and Simon Shaolei Du. A reduction-based framework for conservative bandits and reinforcement learning. In International Conference on Learning Representations, 2021.

[67]

Evrard Garcelon, Mohammad Ghavamzadeh, Alessandro Lazaric, and Matteo Pirotta. Conservative exploration in reinforcement learning. In Silvia Chiappa and Roberto Calandra, editors, Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 of Proceedings of Machine Learning Research, pages 1431--1441. PMLR, 26--28 Aug 2020.

[68]

Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and J. Zico Kolter. Differentiable convex optimization layers. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.

[69]

Noelle Walsh. How Microsoft measures datacenter water and energy use to improve Azure Cloud sustainability. Microsoft Azure Blog, April 2022.

[70]

Tesla. Tesla powerwall 2 datasheet - North America. https://www.tesla.com/sites/default/files/pdfs/powerwall/Powerwall%202_AC_Datasheet_en_northamerica.pdf.

[71]

LG Electronics. LG electronics home series energy storage system datasheet. https://www.lg.com/us/ess/pdf/Resi_LGEUS_Home_8_Spec_0524.pdf.

[72]

SolarEdge. Solaredge energy bank datasheet. https://knowledge-center.solaredge.com/sites/kc/files/se-energy-bank-battery-datasheet-nam.pdf.

[73]

IQ Battery System. IQ battery 10t datasheet. https://www.switchsolarusa.com/wp-content/uploads/2023/02/IQ-Battery-10T-DS-EN-US-10--25--2021.pdf.

[74]

FranklinWH. Franklin home power datasheet, https://www.franklinwh.com/document/franklin-home-power-v11-datasheet.

[75]

Moritz Hardt and Max Simchowitz. Convex optimization and approximation. https://ee227c.github.io/notes/ee227c-notes.pdf, 2018.

Index Terms

Learning-Augmented Decentralized Online Convex Optimization in Networks
1. Computing methodologies
  1. Distributed computing methodologies
  2. Machine learning

Recommendations

Expert-Calibrated Learning for Online Optimization with Switching Costs
POMACS

We study online convex optimization with switching costs, a practically important but also extremely challenging problem due to the lack of complete offline information. By tapping into the power of machine learning (ML) based optimizers, ML-augmented ...
Decentralized online convex optimization based on signs of relative states
Abstract
In this paper, we study a class of decentralized online convex optimization problems with time-varying loss functions over multi-agent networks. We propose a decentralized online subgradient method by using only the signs of the ...
Online Unit Covering in Euclidean Space
Combinatorial Optimization and Applications
Abstract
We revisit the online Unit Covering problem in higher dimensions: Given a set of n points in , that arrive one by one, cover the points by balls of unit radius, so as to minimize the number of balls used. In this paper, we work in using Euclidean ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems

Proceedings of the ACM on Measurement and Analysis of Computing Systems Volume 8, Issue 3

POMACS

December 2024

588 pages

EISSN:2476-1249

DOI:10.1145/3708555

Editors:
John C.S. Lui
The Chinese University of Hong Kong, Hong Kong
,
Leana Golubchik
University of Southern California, United States
,
Zhi-Li Zhang
University of Minnesota, United States

Issue’s Table of Contents

Copyright © 2024 Owner/Author.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 December 2024

Published in POMACS Volume 8, Issue 3

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
129
Total Downloads

Downloads (Last 12 months)129
Downloads (Last 6 weeks)129

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents