Multi-agent Dynamic Pricing Using Reinforcement Learning and Asymmetric Information

Alexander Kastius²²,
Nils Kiele²² &
Rainer Schlosser²²

Part of the book series: Lecture Notes in Operations Research ((LNOR))

Included in the following conference series:

International Conference on Operations Research

723 Accesses

Abstract

Self-learning agents can be used in numerous ways for dynamic pricing nowadays. It has been shown, that reinforcement learning can serve as a toolkit to efficiently develop pricing strategies in dynamic environments. In many real-world situations, it can be expected that multiple market participants rely on such self-learning agents to implement pricing decisions. From the view of one agent, this violates the fundamental Markov property, which leads to instability in the learning process. Past publications proposed to rely on asymmetric information to achieve equilibria and usually focused on tabular solutions or solvers. We use multi-agent learning and asymmetric information with function approximation tools for high-dimensional state spaces by exchanging policy information between multiple actors. We discuss possible problems and their solutions and propose a simulation environment for further evaluation of the developed system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 159.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 199.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 199.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Self-Adaptive Agents in a Dynamic Pricing Duopoly: Competition, Collusion, and Risk Considerations

Article Open access 20 April 2022

Self-learning Governance of Black-Box Multi-Agent Systems

Simulation of Unintentional Collusion Caused by Auto Pricing in Supply Chain Markets

References

Goodfellow, I. J., Bengio, Y., & Courville, A. C. (2016). Deep learning. In Adaptive computation and machine learning. MIT Press.
Google Scholar
Kastius, A., & Schlosser, R. (2022). Dynamic pricing under competition using reinforcement learning. Journal of Revenue and Pricing Management, 21, 50–63.
Article Google Scholar
Könönen, V. (2003). Gradient based method for symmetric and asymmetric multiagent reinforcement learning. In IDEAL 2003, revised papers. Lecture Notes in Computer Science (Vol. 2690, pp. 68–75). Springer.
Google Scholar
Könönen, V. (2004). Asymmetric multiagent reinforcement learning. Web Intelligence and Agent Systems, 2(2), 105–121.
Google Scholar
Könönen, V. (2006). Dynamic pricing based on asymmetric multiagent reinforcement learning. International Journal of Intelligent Systems, 21(1), 73–98.
Article Google Scholar
Kulkarni, T. D., Narasimhan, K., Saeedi, A., & Tenenbaum, J. (2016). Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation. In NIPS 2016 (pp. 3675–3683).
Google Scholar
Le, T. P., Vien, N. A., & Chung, T. (2018). A deep hierarchical reinforcement learning algorithm in partially observable Markov decision processes. IEEE Access, 6, 49089–49102.
Google Scholar
Pereira, S. (2020). Stackelberg multi-agent reinforcement learning for hierarchical environments [Master’s thesis]. http://hdl.handle.net/10012/15851
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning—An introduction. In Adaptive computation and machine learning. MIT Press.
Google Scholar
Tharakunnel, K., & Bhattacharyya, S. (2009). Single-leader-multiple-follower games with boundedly rational agents. Journal of Economic Dynamics and Control, 33, 1593–1603.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Hasso Plattner Institute, University of Potsdam, Potsdam, Germany
Alexander Kastius, Nils Kiele & Rainer Schlosser

Authors

Alexander Kastius
View author publications
You can also search for this author in PubMed Google Scholar
Nils Kiele
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Schlosser
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Kastius .

Editor information

Editors and Affiliations

Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Oliver Grothe
Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Stefan Nickel
Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Steffen Rebennack
Institute for Operations Research, Karlsruhe Institute of Technology, Karlsruhe, Germany
Oliver Stein

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kastius, A., Kiele, N., Schlosser, R. (2023). Multi-agent Dynamic Pricing Using Reinforcement Learning and Asymmetric Information. In: Grothe, O., Nickel, S., Rebennack, S., Stein, O. (eds) Operations Research Proceedings 2022. OR 2022. Lecture Notes in Operations Research. Springer, Cham. https://doi.org/10.1007/978-3-031-24907-5_66

Download citation

DOI: https://doi.org/10.1007/978-3-031-24907-5_66
Published: 30 August 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-24906-8
Online ISBN: 978-3-031-24907-5
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics

Multi-agent Dynamic Pricing Using Reinforcement Learning and Asymmetric Information

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Self-Adaptive Agents in a Dynamic Pricing Duopoly: Competition, Collusion, and Risk Considerations

Self-learning Governance of Black-Box Multi-Agent Systems

Simulation of Unintentional Collusion Caused by Auto Pricing in Supply Chain Markets

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Multi-agent Dynamic Pricing Using Reinforcement Learning and Asymmetric Information

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Self-Adaptive Agents in a Dynamic Pricing Duopoly: Competition, Collusion, and Risk Considerations

Self-learning Governance of Black-Box Multi-Agent Systems

Simulation of Unintentional Collusion Caused by Auto Pricing in Supply Chain Markets

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation