[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content

Advertisement

Log in

On the optimality of the Gittins index rule for multi-armed bandits with multiple plays

  • Published:
Mathematical Methods of Operations Research Aims and scope Submit manuscript

Abstract.

We investigate the general multi-armed bandit problem with multiple servers. We determine a condition on the reward processes sufficient to guarantee the optimality of the strategy that operates at each instant of time the projects with the highest Gittins indices. We call this strategy the Gittins index rule for multi-armed bandits with multiple plays, or briefly the Gittins index rule. We show by examples that: (i) the aforementioned sufficient condition is not necessary for the optimality of the Gittins index rule; and (ii) when the sufficient condition is relaxed the Gittins index rule is not necessarily optimal. Finally, we present an application of the general results to the multiserver scheduling of parallel queues without arrivals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Manuscript received: March 1999/final version received: July 1999

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pandelis, D., Teneketzis, D. On the optimality of the Gittins index rule for multi-armed bandits with multiple plays. Mathematical Methods of OR 50, 449–461 (1999). https://doi.org/10.1007/s001860050080

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s001860050080

Navigation