On-Line Model-Based Continuous State Reinforcement Learning Using Background Knowledge

Bernhard Hengst²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7691))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

3553 Accesses

Abstract

Without a model the application of reinforcement learning to control a dynamic system can be hampered by several shortcomings. The number of trials needed to learn a good policy can be costly and time consuming for robotic applications where data is gathered in real-time. In this paper we describe a variable resolution model-based reinforcement learning approach that distributes sample points in the state-space in proportion to the effect of actions. In this way the base learner economises on storage to approximate an effective model. Our approach is conducive to including background knowledge to speed up learning. We show how different types of background knowledge can used to speed up learning in this setting. In particular, we show good performance for a weak type of background knowledge by initially overgeneralising local experience.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

High-accuracy model-based reinforcement learning, a survey

Article 04 February 2023

Offline reinforcement learning with anderson acceleration for robotic tasks

Article 10 January 2022

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Article 22 April 2021

References

Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach. Prentice Hall, Upper Saddle River (1995)
MATH Google Scholar
Mitchell, T.M.: Machine Learning. McGraw-Hill, Singapore (1997)
MATH Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. In: John Whiley & Sons, John Whiley & Sons, Inc., New York (1994)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Wiering, M., van Otterlo, M. (eds.): Reinforcement Learning: State of the Art. Adaptation, Learning, and Optimization, vol. 12. Springer (2012)
Google Scholar
Santamaria, J.C., Sutton, R.S., Ram, A.: Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive Behavior 6(2) (1998)
Google Scholar
Gabel, T., Riedmiller, M.: CBR for State Value Function Approximation in Reinforcement Learning. In: Muñoz-Ávila, H., Ricci, F. (eds.) ICCBR 2005. LNCS (LNAI), vol. 3620, pp. 206–221. Springer, Heidelberg (2005)
Chapter Google Scholar
Jong, N.K., Stone, P.: Compositional Models for Reinforcement Learning. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009, Part I. LNCS, vol. 5781, pp. 644–659. Springer, Heidelberg (2009)
Chapter Google Scholar
Kuipers, B.: Qualitative simulation. Artificial Intelligence 29, 289–338 (1986)
Article MathSciNet MATH Google Scholar
jMonkeyEngine 3D Game Development SDK (2012), http://jmonkeyengine.org/
Simon, H.A.: Rational choice and the structure of the environment. Psychological Review 63(2), 129–138 (1956)
Article Google Scholar
Moore, A.W.: Efficient memory-based learning for robot control. Technical Report UCAM-CL-TR-209, University of Cambridge, Computer Laboratory (November 1990)
Google Scholar
Hengst, B., Lange, M., White, B.: Learning ankle-tilt and foot-placement control for flat-footed bipedal balancing and walking. In: 11th IEEE-RAS International Conference on Humanoid Robots (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science and Engineering, University of New South Wales, Australia
Bernhard Hengst

Authors

Bernhard Hengst
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Engineering, University of New South Wales, 2052, Sydney, NSW, Australia
Michael Thielscher
School of Computing and Mathematics, University of Western Sydney, 1797, Penrith South DC, NSW, Australia
Dongmo Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hengst, B. (2012). On-Line Model-Based Continuous State Reinforcement Learning Using Background Knowledge. In: Thielscher, M., Zhang, D. (eds) AI 2012: Advances in Artificial Intelligence. AI 2012. Lecture Notes in Computer Science(), vol 7691. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35101-3_72

Download citation

DOI: https://doi.org/10.1007/978-3-642-35101-3_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35100-6
Online ISBN: 978-3-642-35101-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On-Line Model-Based Continuous State Reinforcement Learning Using Background Knowledge

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

High-accuracy model-based reinforcement learning, a survey

Offline reinforcement learning with anderson acceleration for robotic tasks

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

On-Line Model-Based Continuous State Reinforcement Learning Using Background Knowledge

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

High-accuracy model-based reinforcement learning, a survey

Offline reinforcement learning with anderson acceleration for robotic tasks

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation