Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

Sébastien Jodogne²¹,
Cyril Briquet²¹ &
Justus H. Piater²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5825 Accesses
2 Citations

Abstract

Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for the closed-loop learning of mappings from images to actions. This approach requires a family of function approximators that maps visual percepts to a real-valued function. For this purpose, we use Regression Extra-Trees, a fast, yet accurate and versatile machine learning algorithm. The inputs of the Extra-Trees consist of a set of visual features that digest the informative patterns in the visual signal. We also show how to parallelize the Extra-Tree learning process to further reduce the computational expense, which is often essential in visual tasks. Experimental results on real-world images are given that indicate that the combination of API with Extra-Trees is a promising framework for the interactive learning of visual tasks.

Download to read the full chapter text

Chapter PDF

ApmNet: Toward Generalizable Visual Continuous Control with Pre-trained Image Models

Robust Visual Reinforcement Learning by Prompt Tuning

Sidekick Policy Learning for Active Visual Exploration

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bertsekas, D., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific (1996)
Google Scholar
Jodogne, S., Piater, J.: Interactive learning of mappings from visual percepts to actions. In: De Raedt, L., Wrobel, S. (eds.) Proc. of the 22nd International Conference on Machine Learning (ICML), Bonn Germany, pp. 393–400. ACM, New York (2005)
Chapter Google Scholar
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Machine Learning 36(1), 3–42 (2006)
Article Google Scholar
Lagoudakis, M., Parr, R.: Least-squares policy iteration. Journal of Machine Learning Research 4, 1107–1149 (2003)
Article MathSciNet Google Scholar
Marée, R., Geurts, P., Piater, J., Wehenkel, L.: Random subwindows for robust image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, San Diego (CA, USA), vol. 1, pp. 34–40 (2005)
Google Scholar
Puterman, M., Shin, M.: Modified policy iteration algorithms for discounted Markov decision problems. Management Science 24, 1127–1137 (1978)
Article MATH MathSciNet Google Scholar
Schmid, C., Mohr, R., Bauckhage, C.: Evaluation of interest point detectors. International Journal of Computer Vision 37(2), 151–172 (2000)
Article MATH Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, Madison (WI, USA), vol. 2, pp. 257–263 (2003)
Google Scholar
Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503–556 (2005)
MathSciNet Google Scholar
Cohen, B.: Incentives build robustness in BitTorrent. In: Proc. of the Workshop on Economics of Peer-to-Peer Systems (2003)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Jodogne, S., Piater, J.: Learning, then compacting visual policies (extended abstract). In: Proc. of the 7th European Workshop on Reinforcement Learning (EWRL), Naples (Italie), pp. 8–10 (2005)
Google Scholar
Jodogne, S., Scalzo, F., Piater, J.: Task-driven learning of spatial combinations of visual features. In: Proc. of the IEEE Workshop on Learning in Computer Vision and Pattern Recognition, San Diego (CA, USA). IEEE, Los Alamitos (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Montefiore Institute (B28), University of Liège, B-4000, Liège, Belgium
Sébastien Jodogne, Cyril Briquet & Justus H. Piater

Authors

Sébastien Jodogne
View author publications
You can also search for this author in PubMed Google Scholar
Cyril Briquet
View author publications
You can also search for this author in PubMed Google Scholar
Justus H. Piater
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jodogne, S., Briquet, C., Piater, J.H. (2006). Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_23

Download citation

DOI: https://doi.org/10.1007/11871842_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

Abstract

Chapter PDF

Similar content being viewed by others

ApmNet: Toward Generalizable Visual Continuous Control with Pre-trained Image Models

Robust Visual Reinforcement Learning by Prompt Tuning

Sidekick Policy Learning for Active Visual Exploration

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

Abstract

Chapter PDF

Similar content being viewed by others

ApmNet: Toward Generalizable Visual Continuous Control with Pre-trained Image Models

Robust Visual Reinforcement Learning by Prompt Tuning

Sidekick Policy Learning for Active Visual Exploration

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation