Abstract
Game theory has been recently used to study optimal advice-giving strategies in settings where multiple advisers compete for a single client’s attention. In the advisers’ game, a client chooses between two well informed advisers to place bets under uncertainty. Experiments have shown that human advisers can learn to play strategically instead of honestly to exploit client behavior. Here, we analyze under which conditions agents trained with Q-learning can adopt similar strategies. To this end, the agent is trained against different heuristics and itself.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hertz, U., Palminteri, S., Brunetti, S., Olesen, C., Frith, C. D., & Bahrami, B. (2017). Neural computations underpinning the strategic management of influence in advice giving. Nature Communications, 8, 2191. https://doi.org/10.1038/s41467-017-02314-5
Kruvers, R. H. J. M., Hertz, U., Karpus, J., Balode, M., Jayles, B., Binmore, K., & Bahrami, B. (2021). Strategic disinformation outperforms honesty in competition for social influence. Nature Communications, 24(12), 103505 (2021). https://doi.org/10.1016/j.isci.2021.103505
Lanctot, M., Lockhart, E., Lespiau, J. B., Zambaldi, V., Upadhyay, S., Pérolat, J., ... , & Ryan-Davis, J. (2019). OpenSpiel: A framework for reinforcement learning in games. arXiv preprint arXiv:1908.09453
Mukhopadhyay, S.,Tilak, O., & Chakrabarti, S. (2018). Reinforcement learning algorithms for uncertain, dynamic, zero-sum games. In 17th IEEE International Conference on Machine Learning and Applications (ICMLA) (pp. 48–54). https://doi.org/10.1109/ICMLA.2018.00015
Ni, Z., & Paul, S. (2019). A multistage game in smart grid security: A reinforcement learning solution. IEEE Transactions on Neural Networks and Learning Systems, 30(9), 2684–2695. https://doi.org/10.1109/TNNLS.2018.2885530
Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction. Cambridge: MIT Press.
Vinyals, O., Babuschkin, I., Czarnecki, W. M., Mathieu, M., Dudzik, A., Chung, J., ..., & Silver, D. (2019). Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature, 575(7782), 350–354. https://doi.org/10.1038/s41586-019-1724-z
Watkins, C. J. C. H. (1989). Learning from delayed rewards (Ph.D. thesis). University of Cambridge, England.
Acknowledgements
Jurgis Karpus was supported by LMUexcellent, funded by the Federal Ministry of Education and Research (BMBF) and the Free State of Bavaria under the Excellence Strategy of the Federal Government and the Länder. B. B. was supported by the Humboldt Foundation and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (819040-acronym: rid-O).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Moll, M., Karpus, J., Bahrami, B. (2023). Do Artificial Agents Reproduce Human Strategies in the Advisers’ Game?. In: Grothe, O., Nickel, S., Rebennack, S., Stein, O. (eds) Operations Research Proceedings 2022. OR 2022. Lecture Notes in Operations Research. Springer, Cham. https://doi.org/10.1007/978-3-031-24907-5_72
Download citation
DOI: https://doi.org/10.1007/978-3-031-24907-5_72
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-24906-8
Online ISBN: 978-3-031-24907-5
eBook Packages: Business and ManagementBusiness and Management (R0)