research-article

Public Access

Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems

Authors:

Aditya S. Mate,

Arpita Biswas,

Christoph Siebenbrunner,

Susobhan Ghosh,

Milind TambeAuthors Info & Claims

AAMAS '22: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems

Pages 880 - 888

Published: 09 May 2022 Publication History

PDF eReader

Abstract

We propose Streaming Bandits, a Restless Multi-Armed Bandit (RMAB) framework in which heterogeneous arms may arrive and leave the system after staying on for a finite lifetime. Streaming Bandits naturally capture the health-intervention planning problem, where health workers must manage the health outcomes of a patient cohort while new patients join and existing patients leave the cohort each day. Our contributions are as follows: (1) We derive conditions under which our problem satisfies indexability, a pre-condition that guarantees the existence and asymptotic optimality of the Whittle Index solution for RMABs. We establish the conditions using a polytime reduction of the Streaming Bandit setup to regular RMABs. (2) We further prove a phenomenon that we call index decay - whereby the Whittle index values are low for short residual lifetimes - driving the intuition underpinning our algorithm. (3) We propose a novel and efficient algorithm to compute the index-based solution for Streaming Bandits. Unlike previous methods, our algorithm does not rely on solving the costly finite horizon problem on each arm of the RMAB, thereby lowering the computational complexity compared to existing methods. (4) Finally, we evaluate our approach via simulations run on real-world data sets from a tuberculosis patient monitoring task and an intervention planning task for improving maternal healthcare, in addition to other synthetic domains. Across the board, our algorithm achieves a 2-orders-of-magnitude speed-up over existing methods while maintaining the same solution quality. The full paper is available at: https://arxiv.org/pdf/2103.04730.pdf

Supplementary Material

ZIP File (fp717aux.pdf.zip)

This is the full paper (with Appendix) pdf

Download
698.85 KB

References

[1]

N. Akbarzadeh and A. Mahajan. 2019. Restless bandits with controlled restarts: Indexability and computation of Whittle index. In 2019 IEEE Conference on Decision and Control. IEEE.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Planning to Fairly Allocate: Probabilistic Fairness in the Restless Bandit Setting

Indexability of restless bandit problems and optimality of Whittle index for dynamic multichannel access

Risk-Aware Interventions in Public Health: Planning with Restless Multi-Armed Bandits

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations