Joint chance-constrained Markov decision processes

736 Accesses
Explore all metrics

Abstract

We consider a finite state-action uncertain constrained Markov decision process under discounted and average cost criteria. The running costs are defined by random variables and the transition probabilities are known. The uncertainties present in the objective function and the constraints are modelled using chance constraints. We assume that the random cost vectors follow multivariate elliptically symmetric distributions and dependence among the random constraints is driven by a Gumbel–Hougaard copula. We propose two second order cone programming problems whose optimal values give lower and upper bounds of the optimal value of the uncertain constrained Markov decision process. As an application, we study a stochastic version of a service and admission control problem in a queueing system. The proposed approximation methods are illustrated on randomly generated instances of queueing control problem as well as on well known class of Markov decision problems known as Garnets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Implementation of Markovian Queueing Network Model with Multiple Closed Chains

Article 08 April 2016

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Article 15 April 2016

First passage Markov decision processes with constraints and varying discount factors

Article 25 June 2015

References

Altman, E. (1999). Constrained Markov decision processes. Chapman and Hall/CRC.
Google Scholar
Archibald, T. W., McKinnon, K. I. M., & Thomas, L. C. (1995). On the generation of Markov decision processes. Journal of the Operational Research Society, 46(3), 354–361.
Article Google Scholar
Bertsekas, D. P. (2000). Dynamic programming and optimal control (2nd ed.). Athena Scientific.
Google Scholar
Charnes, A., & Cooper, W. W. (1959). Chance-constrained programming. Management Science, 6, 73–79.
Article Google Scholar
Cheng, J., & Lisser, A. (2012). A second-order cone programming approach for linear programs with joint probabilistic constraints. Operations Research, 5, 325–328.
Google Scholar
Cheng, J., Houda, M., & Lisser, A. (2015). Chance constrained 0–1 quadratic programs using copulas. Optimization Letters, 9(7), 1283–1295.
Article Google Scholar
Delage, E., & Mannor, S. (2010). Percentile optimization for Markov decision processes with parameter uncertainty. Operations Research, 58(1), 203–213.
Article Google Scholar
El Asri, L., Piot, B., Geist, M., Laroche, R., & Pietquin, O. (2016) Score-based inverse reinforcement learning. In Proceedings of the 2016 international conference on autonomous agents & multiagent systems. AAMAS ’16 (pp 457–465). International Foundation for Autonomous Agents and Multiagent Systems.
Fang, K.-T., Kotz, S., & Ng, K. W. (1990). Symmetric multivariate and related distributions. Chapman and Hall/CRC.
Book Google Scholar
Geng, X., & Xie, L. (2019). Data-driven decision making in power systems with probabilistic guarantees: Theory and applications of chance-constrained optimization. Annual Reviews in Control, 47, 341.
Article Google Scholar
Henrion, R. (2007). Structural properties of linear probabilistic constraints. Optimization, 56, 425–440.
Article Google Scholar
Iyengar, G. N. (2005). Robust dynamic programming. Mathematics of Operations Research, 30(2), 257–280.
Article Google Scholar
Jaworski, P., Durante, F., Härdle, W. K., & Rychlik, T. (2010). Copula theory and its applications. In Proceedings of the workshop Held in Warsaw, Poland, 25–26 September 2009.
Luedtke, J., Ahmed, S., & Nemhauser, G. (2010). An integer programming approach for linear programs with probabilistic constraints. Mathematical Programming, 122(2), 247–272.
Article Google Scholar
Mannor, S., Simester, D., Sun, P., & Tsitsiklis, J. N. (2007). Bias and variance approximation in value function estimates. Management Science, 53(2), 308–322.
Article Google Scholar
Nelsen, R. B. (2006). An introduction to copulas. Springer Series in StatisticsSpringer.
Google Scholar
Nilim, A., & El Ghaoui, L. (2005). Robust control of Markov decision processes with uncertain transition matrices. Operations Research, 53(5), 780–798.
Article Google Scholar
Prékopa, A. (1995). Stochastic programming. Kluwer Academic Publishers.
Book Google Scholar
Puterman, M. L. (1994). Markov decision process (1st ed.). Wiley.
Book Google Scholar
Satia, J. K., & Lave, R. E., Jr. (1973). Markovian decision processes with uncertain transition probabilities. Operations Research, 21(3), 728–740.
Article Google Scholar
Sklar, M. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institut de statistique de l’Université de Paris, 8, 229–231.
Google Scholar
Varagapriya, V., Singh, V. V., & Lisser, A. (2022). Constrained Markov decision processes with uncertain costs. Operations Research Letters, 50(2), 218–223.
Article Google Scholar
White, C. C., & Eldeib, H. K. (1994). Markov decision processes with imprecise transition probabilities. Operations Research, 42(4), 739–749.
Article Google Scholar
Wiesemann, W., Kuhn, D., & Rustem, B. (2013). Robust Markov decision processes. Mathematics of Operations Research, 38(1), 153–183.
Article Google Scholar
Ye, Y. (2011). The simplex and policy-iteration methods are strongly polynomial for the Markov decision problem with a fixed discount rate. Mathematics of Operations Research, 36(4), 593–603.
Article Google Scholar

Download references

Acknowledgements

The research of first author was supported by CSIR, India. The research of second and third author was supported by DST/CEFIPRA Project No. IFC/4117/DST-CNRS-5th call/2017-18/2 and CNRS Project No. AR/SB:2018-07-440, respectively.

Author information

Authors and Affiliations

Department of Mathematics, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, 110016, India
V Varagapriya & Vikas Vikram Singh
CNRS, CentraleSupelec, Laboratoire des Signaux et Systemes, Universite Paris Saclay, Bat Breguet, 3 Rue Joliot Curie, 91190, Gif-sur-Yvette, France
Abdel Lisser

Authors

V Varagapriya
View author publications
You can also search for this author in PubMed Google Scholar
Vikas Vikram Singh
View author publications
You can also search for this author in PubMed Google Scholar
Abdel Lisser
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to V Varagapriya.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Varagapriya, V., Singh, V.V. & Lisser, A. Joint chance-constrained Markov decision processes. Ann Oper Res 322, 1013–1035 (2023). https://doi.org/10.1007/s10479-022-05025-3

Download citation

Accepted: 12 October 2022
Published: 27 October 2022
Issue Date: March 2023
DOI: https://doi.org/10.1007/s10479-022-05025-3

Joint chance-constrained Markov decision processes

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Implementation of Markovian Queueing Network Model with Multiple Closed Chains

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

First passage Markov decision processes with constraints and varying discount factors

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Joint chance-constrained Markov decision processes

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Implementation of Markovian Queueing Network Model with Multiple Closed Chains

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

First passage Markov decision processes with constraints and varying discount factors

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation