Abstract
A novel quantum reinforcement learning is proposed through combining quantum theory and reinforcement learning. Inspired by state superposition principle, a framework of state value update algorithm is introduced. The state/action value is represented with quantum state and the probability of action eigenvalue is denoted by probability amplitude, which is updated according to rewards. This approach makes a good tradeoff between exploration and exploitation using probability and can speed up learning. The results of simulated experiment verified its effectiveness and superiority.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sutton, R., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
Sutton, R.: Learning to Predict by the Methods of Temporal Difference. Mach. Learn. 3, 9–44 (1988)
Watkins, C., Dayan, P.: Q-learning. Mach. Learn. 8, 279–292 (1992)
Beom, H.R., Cho, H.S.: A Sensor-based Navigation for a Mobile Robot Using Fuzzy Logic and Reinforcement Learning. IEEE Trans. Syst. Man. Cyc. 25, 464–477 (1995)
Smart, W.D., Kaelbling, L.P.: Effective Reinforcement Learning for Mobile Robots. In: Proceedings of the IEEE Int. Conf. on Robotic. Autom. (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dong, D., Chen, C., Chen, Z. (2005). Quantum Reinforcement Learning. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539117_97
Download citation
DOI: https://doi.org/10.1007/11539117_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28325-6
Online ISBN: 978-3-540-31858-3
eBook Packages: Computer ScienceComputer Science (R0)