Do Artificial Agents Reproduce Human Strategies in the Advisers’ Game?

Maximilian Moll²²,
Jurgis Karpus²³ &
Bahador Bahrami²³

Part of the book series: Lecture Notes in Operations Research ((LNOR))

Included in the following conference series:

International Conference on Operations Research

798 Accesses

Abstract

Game theory has been recently used to study optimal advice-giving strategies in settings where multiple advisers compete for a single client’s attention. In the advisers’ game, a client chooses between two well informed advisers to place bets under uncertainty. Experiments have shown that human advisers can learn to play strategically instead of honestly to exploit client behavior. Here, we analyze under which conditions agents trained with Q-learning can adopt similar strategies. To this end, the agent is trained against different heuristics and itself.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 159.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 199.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 199.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

The net effect of advice on strategy-proof mechanisms: an experiment for the Vickrey auction

Article Open access 02 February 2022

One Arm to Rule Them All: Online Learning with Multi-armed Bandits for Low-Resource Conversational Agents

The Economics of Advice

Article Open access 01 March 2025

References

Hertz, U., Palminteri, S., Brunetti, S., Olesen, C., Frith, C. D., & Bahrami, B. (2017). Neural computations underpinning the strategic management of influence in advice giving. Nature Communications, 8, 2191. https://doi.org/10.1038/s41467-017-02314-5
Kruvers, R. H. J. M., Hertz, U., Karpus, J., Balode, M., Jayles, B., Binmore, K., & Bahrami, B. (2021). Strategic disinformation outperforms honesty in competition for social influence. Nature Communications, 24(12), 103505 (2021). https://doi.org/10.1016/j.isci.2021.103505
Lanctot, M., Lockhart, E., Lespiau, J. B., Zambaldi, V., Upadhyay, S., Pérolat, J., ... , & Ryan-Davis, J. (2019). OpenSpiel: A framework for reinforcement learning in games. arXiv preprint arXiv:1908.09453
Mukhopadhyay, S.,Tilak, O., & Chakrabarti, S. (2018). Reinforcement learning algorithms for uncertain, dynamic, zero-sum games. In 17th IEEE International Conference on Machine Learning and Applications (ICMLA) (pp. 48–54). https://doi.org/10.1109/ICMLA.2018.00015
Ni, Z., & Paul, S. (2019). A multistage game in smart grid security: A reinforcement learning solution. IEEE Transactions on Neural Networks and Learning Systems, 30(9), 2684–2695. https://doi.org/10.1109/TNNLS.2018.2885530
Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction. Cambridge: MIT Press.
Google Scholar
Vinyals, O., Babuschkin, I., Czarnecki, W. M., Mathieu, M., Dudzik, A., Chung, J., ..., & Silver, D. (2019). Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature, 575(7782), 350–354. https://doi.org/10.1038/s41586-019-1724-z
Watkins, C. J. C. H. (1989). Learning from delayed rewards (Ph.D. thesis). University of Cambridge, England.
Google Scholar

Download references

Acknowledgements

Jurgis Karpus was supported by LMUexcellent, funded by the Federal Ministry of Education and Research (BMBF) and the Free State of Bavaria under the Excellence Strategy of the Federal Government and the Länder. B. B. was supported by the Humboldt Foundation and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (819040-acronym: rid-O).

Author information

Authors and Affiliations

Universität der Bundeswehr München, Werner-Heisenberg-Weg 39, 85577, Neubiberg, Germany
Maximilian Moll
Ludwig-Maximilians-Universität München, Geschwister-Scholl-Platz 1, 80539, München, Germany
Jurgis Karpus & Bahador Bahrami

Authors

Maximilian Moll
View author publications
You can also search for this author in PubMed Google Scholar
Jurgis Karpus
View author publications
You can also search for this author in PubMed Google Scholar
Bahador Bahrami
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maximilian Moll .

Editor information

Editors and Affiliations

Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Oliver Grothe
Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Stefan Nickel
Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Steffen Rebennack
Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Oliver Stein

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Moll, M., Karpus, J., Bahrami, B. (2023). Do Artificial Agents Reproduce Human Strategies in the Advisers’ Game?. In: Grothe, O., Nickel, S., Rebennack, S., Stein, O. (eds) Operations Research Proceedings 2022. OR 2022. Lecture Notes in Operations Research. Springer, Cham. https://doi.org/10.1007/978-3-031-24907-5_72

Download citation

DOI: https://doi.org/10.1007/978-3-031-24907-5_72
Published: 30 August 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-24906-8
Online ISBN: 978-3-031-24907-5
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics