More Web Proxy on the site http://driver.im/

Article

An algorithm for constructing and solving imperfect recall abstractions of large extensive-form games

Authors:

Jiří Čermák,

Branislav Bošansky,

Viliam LisyAuthors Info & Claims

IJCAI'17: Proceedings of the 26th International Joint Conference on Artificial Intelligence

Pages 936 - 942

Published: 19 August 2017 Publication History

Abstract

We solve large two-player zero-sum extensive-form games with perfect recall. We propose a new algorithm based on fictitious play that significantly reduces memory requirements for storing average strategies. The key feature is exploiting imperfect recall abstractions while preserving the convergence rate and guarantees of fictitious play applied directly to the perfect recall game. The algorithm creates a coarse imperfect recall abstraction of the perfect recall game and automatically refines its information set structure only where the imperfect recall might cause problems. Experimental evaluation shows that our novel algorithm is able to solve a simplified poker game with 7 ċ 10⁵ information sets using an abstracted game with only 1:8% of information sets of the original game. Additional experiments on poker and randomly generated games suggest that the relative size of the abstraction decreases as the size of the solved games increases.

References

[1]

Branislav Bošansky, Albert Xin Jiang, Milind Tambe, and Christopher Kiekintveld. Combining compact representation and incremental generation in large games with sequential strategies. In AAAI , pages 812-818, 2015.

Digital Library

[2]

Branislav Bošansky, Viliam Lisy, Marc Lanctot, Jiří Čermák, and Mark H.M. Winands. Algorithms for Computing Strategies in Two-Player Simultaneous Move Games. Artificial Intelligence , 237:1-40, 2016.

Digital Library

[3]

Branislav Bošansky, Christopher Kiekintveld, Viliam Lisy, and Michal Pechouček. An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information. Journal of Artificial Intelligence Research , 51:829-866, 2014.

Digital Library

[4]

Noam Brown and Tuomas W Sandholm. Simultaneous abstraction and equilibrium finding in games. In AAAI , 2015.

Digital Library

[5]

George W Brown. Some notes on computation of games solutions. Technical report, DTIC Document, 1949.

[6]

Jiři Čermák, Branislav Bošansky, and Michal Pechouček. Combining incremental strategy generation and branch and bound search for computing maxmin strategies in imperfect recall games. In AAMAS , pages 902-910, 2017.

Digital Library

[7]

Katherine Chen and Michael Bowling. Tractable objectives for robust policy optimization. In NIPS , pages 2078-2086, 2012.

Digital Library

[8]

Constantinos Daskalakis and Qinxuan Pan. A counter-example to karlin's strong conjecture for fictitious play. In Annual Symposium on Foundations of Computer Science , pages 11-20. IEEE, 2014.

Digital Library

[9]

Andrew Gilpin and Tuomas Sandholm. Lossless Abstraction of Imperfect Information Games. Journal of the ACM , 54(5), 2007.

Digital Library

[10]

Andrew Gilpin, Tuomas Sandholm, and Troels Bjerre Sørensen. Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of texas hold'em poker. In AAAI , pages 50-57, 2007.

Digital Library

[11]

Johannes Heinrich, Marc Lanctot, and David Silver. Fictitious self-play in extensive-form games. In ICML , pages 805-813, 2015.

Digital Library

[12]

Michael Johanson, Kevin Waugh, Michael Bowling, and Martin Zinkevich. Accelerating best response calculation in large extensive games. In IJCAI , volume 11, pages 258-265, 2011.

Digital Library

[13]

Samuel Karlin. Mathematical methods and theory in games, programming, and economics , volume 2. Courier Corporation, 2003.

[14]

Christian Kroer and Tuomas Sandholm. Extensive-Form Game Abstraction with Bounds. In EC , pages 621-638. ACM, 2014.

Digital Library

[15]

Christian Kroer and Tuomas Sandholm. Imperfect-Recall Abstractions with Bounds in Games. In EC , pages 459-476. ACM, 2016.

Digital Library

[16]

Harold W. Kuhn. Extensive Games and the Problem of Information. Contributions to the Theory of Games , II:193-216, 1953.

[17]

Marc Lanctot, Richard Gibson, Neil Burch, Martin Zinkevich, and Michael Bowling. No-Regret Learning in Extensive-Form Games with Imperfect Recall. In ICML , pages 1-21, 2012.

Digital Library

[18]

Viliam Lisy, Trevor Davis, and Michael Bowling. Counterfactual Regret Minimization in Sequential Security Games. In AAAI , 2016.

[19]

Matej Moravčík, Martin Schmid, Neil Burch, Viliam Lisy, Dustin Morrill, Nolan Bard, Trevor Davis, Kevin Waugh, Michael Johanson, and Michael Bowling. Deepstack: Expert-level artificial intelligence in heads-up no-limit poker. Science , 2017.

[20]

Julia Robinson. An iterative method of solving a game. Annals of mathematics , pages 296-301, 1951.

[21]

Martin Schmid, Matej Moravcik, and Milan Hladik. Bounding the support size in extensive form games with imperfect information. In AAAI , pages 784-790, 2014.

Digital Library

[22]

Bernhard von Stengel. Efficient Computation of Behavior Strategies. Games and Economic Behavior , 14:220-246, 1996.

[23]

Kevin Waugh, Martin Zinkevich, Michael Johanson, Morgan Kan, David Schnizlein, and Michael H Bowling. A Practical Use of Imperfect Recall. In Symposium on Abstraction, Reformulation and Approximation (SARA) , 2009.

[24]

M. Zinkevich, M. Johanson, M. Bowling, and C. Piccione. Regret Minimization in Games with Incomplete Information. In NIPS , pages 1729-1736, 2008.

Digital Library

Cited By

Farina GCelli AGatti NSandholm T(2018)Ex ante coordination and collusion in zero-sum multi-player extensive-form gamesProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327546.3327633(9661-9671)Online publication date: 3-Dec-2018
https://dl.acm.org/doi/10.5555/3327546.3327633
Kroer CSandholm T(2018)A unified framework for extensive-form game abstraction with boundsProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3326943.3327000(613-624)Online publication date: 3-Dec-2018
https://dl.acm.org/doi/10.5555/3326943.3327000

An algorithm for constructing and solving imperfect recall abstractions of large extensive-form games
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Virtual worlds software

Recommendations

Imperfect-Recall Abstractions with Bounds in Games
EC '16: Proceedings of the 2016 ACM Conference on Economics and Computation

Imperfect-recall abstraction has emerged as the leading paradigm for practical large-scale equilibrium computation in imperfect-information games. However, imperfect-recall abstractions are poorly understood, and only weak algorithm-specific guarantees ...
Solving Imperfect Recall Games
AAMAS '17: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems

Imperfect recall games remain an unexplored part of the game theory, even though recent results in abstraction algorithms show that the imperfect recall might be the key to solving the immense games found in the real world efficiently. Our research ...
Solving Extensive-Form Games Using Imperfect Recall Abstraction / Řešení Her v Extensivní Formě Pomocí Imperfect Recall Abstrakci

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

IJCAI'17: Proceedings of the 26th International Joint Conference on Artificial Intelligence

August 2017

5253 pages

ISBN:9780999241103

Editor:
Carles Sierra
IIIA-CSIC

Sponsors

Australian Comp Soc: Australian Computer Society
NSF: National Science Foundation
Griffith University
University of Technology Sydney
AI Journal: AI Journal

Publisher

AAAI Press

Publication History

Published: 19 August 2017

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 23 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Farina GCelli AGatti NSandholm T(2018)Ex ante coordination and collusion in zero-sum multi-player extensive-form gamesProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327546.3327633(9661-9671)Online publication date: 3-Dec-2018
https://dl.acm.org/doi/10.5555/3327546.3327633
Kroer CSandholm T(2018)A unified framework for extensive-form game abstraction with boundsProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3326943.3327000(613-624)Online publication date: 3-Dec-2018
https://dl.acm.org/doi/10.5555/3326943.3327000

View Options

View options

Media

Figures

Other

Tables

View Table of Contents