Abstract
When learning abstract probabilistic decision making models for multi-modal service robots from human demonstrations, alternative courses of events may be missed by human teachers during demonstrations. We present an active model space exploration approach with generalization of observed action effect knowledge leading to interactive requests of new demonstrations to verify generalizations.
At first, the robot observes several user demonstrations of interacting humans, including dialog, object poses and human body movement. Discretization and analysis then lead to a symbolic-causal model of a demonstrated task in the form of a preliminary Partially observable Markov decision process. Based on the transition model generated from demonstrations, new hypotheses of unobserved action effects, generalized transitions, can be derived along with a generalization confidence estimate. To validate generalized transitions which have a strong impact on a decision policy, a request generator proposes further demonstrations to human teachers, used in turn to implicitly verify hypotheses.
The system has been evaluated on a multi-modal service robot with realistic tasks, including furniture manipulation and execution-time interacting humans.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Schmidt-Rohr, S.R., Lösch, M., Jäkel, R., Dillmann, R.: Programming by demonstration of probabilistic decision making on a multi-modal service robot. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, Taipeh, Taiwan (2010)
Cassandra, A.R., Kaelbling, L.P., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on Artificial Intelligence (1994)
Kurniawati, H., Hsu, D., Lee, W.: SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In: Proc. Robotics: Science and Systems (2008)
Schmidt-Rohr, S.R., Knoop, S., Lösch, M., Dillmann, R.: Bridging the gap of abstraction for probabilistic decision making on a multi-modal service robot. In: RSS, Zürich (2008)
Lösch, M., Schmidt-Rohr, S., Knoop, S., Vacek, S., Dillmann, R.: Feature set selection and optimal classifier for human activity recognition. In: RO-MAN (2007)
Jaekel, R., Schmidt-Rohr, S.R., Loesch, M., Dillmann, R.: Representation and constrained planning of manipulation strategies in the context of programming by demonstration. In: IEEE International Conference on Robotics and Automation, ICRA 2010 (2010)
Pardowitz, M., Knoop, S., Dillmann, R., Zollner, R.: Incremental learning of tasks from user demonstrations, past experiences, and vocal comments. IEEE Trans. on Systems, Man, and Cybernetics (2007)
Grollman, D., Jenkins, O.C.: Incremental learning of subtasks from unsegmented demonstration. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (2010)
Veeraraghavan, H., Veloso, M.: Learning task specific plans through sound and visually interpretable demonstrations. In: IROS (2008)
Ross, S., Chaib-draa, B., Pineau, J.: Bayes-adaptive pomdps. In: NIPS. MIT Press (2007)
Jaulmes, R., Pineau, J., Precup, D.: A formal framework for robot learning and control under model uncertainty. In: 2007 IEEE International Conference on Robotics and Automation (April 2007)
Shon, A.P., Storz, J.J., Rao, R.P.N.: Towards a real-time bayesian imitation system for a humanoid robot. In: 2007 IEEE International Conference on Robotics and Automation, pp. 2847–2852 (2007)
Tenorth, M., Beetz, M.: Priming Transformational Planning with Observations of Human Activities. In: IEEE International Conference on Robotics and Automation, ICRA (2010)
Chernova, S., Veloso, M.: Interactive policy learning through confidence-based autonomy. Journal of Artificial Intelligence Research 34 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Schmidt-Rohr, S.R., Romahn, F., Meissner, P., Jäkel, R., Dillmann, R. (2013). Learning Probabilistic Decision Making by a Service Robot with Generalization of User Demonstrations and Interactive Refinement. In: Lee, S., Yoon, KJ., Lee, J. (eds) Frontiers of Intelligent Autonomous Systems. Studies in Computational Intelligence, vol 466. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35485-4_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-35485-4_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35484-7
Online ISBN: 978-3-642-35485-4
eBook Packages: EngineeringEngineering (R0)