Computer Science > Machine Learning

arXiv:2305.19158 (cs)

[Submitted on 30 May 2023 (v1), last revised 4 Aug 2023 (this version, v2)]

Title:Competing for Shareable Arms in Multi-Player Multi-Armed Bandits

Authors:Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui

View PDF

Abstract:Competitions for shareable and limited resources have long been studied with strategic agents. In reality, agents often have to learn and maximize the rewards of the resources at the same time. To design an individualized competing policy, we model the competition between agents in a novel multi-player multi-armed bandit (MPMAB) setting where players are selfish and aim to maximize their own rewards. In addition, when several players pull the same arm, we assume that these players averagely share the arms' rewards by expectation. Under this setting, we first analyze the Nash equilibrium when arms' rewards are known. Subsequently, we propose a novel Selfish MPMAB with Averaging Allocation (SMAA) approach based on the equilibrium. We theoretically demonstrate that SMAA could achieve a good regret guarantee for each player when all players follow the algorithm. Additionally, we establish that no single selfish player can significantly increase their rewards through deviation, nor can they detrimentally affect other players' rewards without incurring substantial losses for themselves. We finally validate the effectiveness of the method in extensive synthetic experiments.

Comments:	ICML 2023
Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
Cite as:	arXiv:2305.19158 [cs.LG]
	(or arXiv:2305.19158v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.19158

Submission history

From: Renzhe Xu [view email]
[v1] Tue, 30 May 2023 15:59:56 UTC (625 KB)
[v2] Fri, 4 Aug 2023 06:29:20 UTC (626 KB)

Computer Science > Machine Learning

Title:Competing for Shareable Arms in Multi-Player Multi-Armed Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Competing for Shareable Arms in Multi-Player Multi-Armed Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators