Total Reward Variance in Discrete and Continuous Time Markov Chains

Karel Sladký² &
Nico M. van Dijk³

Part of the book series: Operations Research Proceedings ((ORP,volume 2004))

6752 Accesses
2 Citations

Abstract

This note studies the variance of total cumulative rewards for Markov reward chains in both discrete and continuous time. It is shown that parallel results can be obtained for both cases.

First, explicit formulae are presented for the variance within finite time. Next, the infinite time horizon is considered. Most notably, it is concluded that the variance has a linear growth rate. Explicit expressions are provided, related to the standard average reward case, to compute this growth rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 103.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 129.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Long-Run Rewards for Markov Automata

Estimation for Discrete-time Semi-Markov Reward Processes: Analysis and Inference

Article 03 November 2015

On a reward rate estimation for the finite irreducible continuous-time Markov chain

Article 01 September 2017

References

Benito, F. (1982): Calculating the variance in Markov processes with random reward. Trabajos de Estadistica y de Investigacion Operativa, 33, 73–85
MATH MathSciNet Google Scholar
Mandl, P. (1971): On the variance in controlled Markov chains. Kybernetika, 7, 1–12
MATH MathSciNet Google Scholar
Puterman, M. L. (1994): Markov Decision Processes — Discrete Stochastic Dynamic Programming. Wiley, New York
Google Scholar
Ross, S.M. (1970): Applied Probability Models with Optimization Applications. Holden-Day, San Francisco, CA
Google Scholar
Sladký, K., Sitař, M. (2004): Optimal solutions for undiscounted variance penalized Markov decision chains. In: Dynamic Stochastic Optimization (Marti, K., Ermoliev, Y., Pflug, G., Eds.), LNEMS, Vol. 532, Springer, Berlin, pp. 43–66
Google Scholar
Sobel, M. J. (1982): The variance of discounted Markov decision processes. J. Appl. Probab., 19, 794–802
Article MATH MathSciNet Google Scholar
van Dijk, N. M., Sladky, K. (2004): On total reward variance in continuous-time Markov reward chains. Manuscript
Google Scholar
White, D. J. (1988): Mean, variance and probability criteria in finite Markov decision processes: A review. J. Optim. Theory Appl., 56, 1–29
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic, Pod vodárenskou věží 4, 182 08, Praha 8, Czech Republic
Karel Sladký
Department of Economic Sciences and Econometrics, University of Amsterdam, Roetersstrat 11, 1018 WB, Amsterdam, The Netherlands
Nico M. van Dijk

Authors

Karel Sladký
View author publications
You can also search for this author in PubMed Google Scholar
Nico M. van Dijk
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Warandelaan 2, 5037 AB, Tilburg, The Netherlands
Hein Fleuren , Dick den Hertog & Peter Kort , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sladký, K., van Dijk, N.M. (2005). Total Reward Variance in Discrete and Continuous Time Markov Chains. In: Fleuren, H., den Hertog, D., Kort, P. (eds) Operations Research Proceedings 2004. Operations Research Proceedings, vol 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27679-3_40

Download citation

DOI: https://doi.org/10.1007/3-540-27679-3_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24274-1
Online ISBN: 978-3-540-27679-1
eBook Packages: Business and EconomicsBusiness and Management (R0)

Publish with us

Policies and ethics

Total Reward Variance in Discrete and Continuous Time Markov Chains

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Long-Run Rewards for Markov Automata

Estimation for Discrete-time Semi-Markov Reward Processes: Analysis and Inference

On a reward rate estimation for the finite irreducible continuous-time Markov chain

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Total Reward Variance in Discrete and Continuous Time Markov Chains

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Long-Run Rewards for Markov Automata

Estimation for Discrete-time Semi-Markov Reward Processes: Analysis and Inference

On a reward rate estimation for the finite irreducible continuous-time Markov chain

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation