Research Highlight
Published: 20 December 2023

Game theory

One algorithm to play them all

Fernando Chirigati¹

Nature Computational Science volume 3, page 1005 (2023)Cite this article

195 Accesses
1 Altmetric
Metrics details

Subjects

Owing to advances in artificial intelligence, computers have been able to beat humans at increasingly complex games. Nevertheless, proposed algorithms in this space are often developed with a single game in mind, and more importantly, existing algorithms are able to play games of either perfect information — in which each player knows or can see other player’s moves (such as chess and Go) — or imperfect information — in which some aspects of the game are unknown or hidden to some or all players (such as poker), but never both, given that they mostly require different strategies. In a recent study, Michael Bowling and colleagues introduce an algorithm, called Student of Games (SoG), that can perform well with both perfect and imperfect information.

The search algorithm in SoG is based on counterfactual regret minimization (CFR), which is an iterative approach that converges to a Nash equilibrium in two-player zero-sum games and that is often used for solving games of imperfect information. In a nutshell, SoG trains agents via self-play, in which each player uses a search based on CFR to generate a policy — a set of probability distributions over game actions — for the current game state, which is then used to sample an action to take. The authors evaluated SoG on four games: two games of perfect information (chess and Go) and two games of imperfect information (poker and Scotland Yard). In chess and Go, while SoG does not perform as well as a previously proposed algorithm (AlphaZero), it still has a strong performance while being able to scale with increasing availability of computation resources. Notably, SoG beats state-of-the-art agents in poker and Scotland Yard. The ability of the proposed algorithm to perform well in fundamentally different game types will likely prove to be useful in the continuous development of artificial intelligence and game theory.

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Author information

Authors and Affiliations

Nature Computational Science https://www.nature.com/natcomputsci
Fernando Chirigati

Authors

Fernando Chirigati
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fernando Chirigati.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chirigati, F. One algorithm to play them all. Nat Comput Sci 3, 1005 (2023). https://doi.org/10.1038/s43588-023-00582-4

Download citation

Published: 20 December 2023
Issue Date: December 2023
DOI: https://doi.org/10.1038/s43588-023-00582-4

One algorithm to play them all

Subjects

Access options

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Access options

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links