Learning Probabilistic Decision Making by a Service Robot with Generalization of User Demonstrations and Interactive Refinement

Sven R. Schmidt-Rohr⁴,
Fabian Romahn⁴,
Pascal Meissner⁴,
Rainer Jäkel⁴ &
…
Rüdiger Dillmann⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 466))

1682 Accesses

Abstract

When learning abstract probabilistic decision making models for multi-modal service robots from human demonstrations, alternative courses of events may be missed by human teachers during demonstrations. We present an active model space exploration approach with generalization of observed action effect knowledge leading to interactive requests of new demonstrations to verify generalizations.

At first, the robot observes several user demonstrations of interacting humans, including dialog, object poses and human body movement. Discretization and analysis then lead to a symbolic-causal model of a demonstrated task in the form of a preliminary Partially observable Markov decision process. Based on the transition model generated from demonstrations, new hypotheses of unobserved action effects, generalized transitions, can be derived along with a generalization confidence estimate. To validate generalized transitions which have a strong impact on a decision policy, a request generator proposes further demonstrations to human teachers, used in turn to implicitly verify hypotheses.

The system has been evaluated on a multi-modal service robot with realistic tasks, including furniture manipulation and execution-time interacting humans.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 103.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 129.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 129.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

From demonstrations to task-space specifications. Using causal analysis to extract rule parameterization from demonstrations

Article Open access 17 June 2020

Maximum Causal Entropy Specification Inference from Demonstrations

Abstraction-Refinement for Hierarchical Probabilistic Models

References

Schmidt-Rohr, S.R., Lösch, M., Jäkel, R., Dillmann, R.: Programming by demonstration of probabilistic decision making on a multi-modal service robot. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, Taipeh, Taiwan (2010)
Google Scholar
Cassandra, A.R., Kaelbling, L.P., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on Artificial Intelligence (1994)
Google Scholar
Kurniawati, H., Hsu, D., Lee, W.: SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In: Proc. Robotics: Science and Systems (2008)
Google Scholar
Schmidt-Rohr, S.R., Knoop, S., Lösch, M., Dillmann, R.: Bridging the gap of abstraction for probabilistic decision making on a multi-modal service robot. In: RSS, Zürich (2008)
Google Scholar
Lösch, M., Schmidt-Rohr, S., Knoop, S., Vacek, S., Dillmann, R.: Feature set selection and optimal classifier for human activity recognition. In: RO-MAN (2007)
Google Scholar
Jaekel, R., Schmidt-Rohr, S.R., Loesch, M., Dillmann, R.: Representation and constrained planning of manipulation strategies in the context of programming by demonstration. In: IEEE International Conference on Robotics and Automation, ICRA 2010 (2010)
Google Scholar
Pardowitz, M., Knoop, S., Dillmann, R., Zollner, R.: Incremental learning of tasks from user demonstrations, past experiences, and vocal comments. IEEE Trans. on Systems, Man, and Cybernetics (2007)
Google Scholar
Grollman, D., Jenkins, O.C.: Incremental learning of subtasks from unsegmented demonstration. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (2010)
Google Scholar
Veeraraghavan, H., Veloso, M.: Learning task specific plans through sound and visually interpretable demonstrations. In: IROS (2008)
Google Scholar
Ross, S., Chaib-draa, B., Pineau, J.: Bayes-adaptive pomdps. In: NIPS. MIT Press (2007)
Google Scholar
Jaulmes, R., Pineau, J., Precup, D.: A formal framework for robot learning and control under model uncertainty. In: 2007 IEEE International Conference on Robotics and Automation (April 2007)
Google Scholar
Shon, A.P., Storz, J.J., Rao, R.P.N.: Towards a real-time bayesian imitation system for a humanoid robot. In: 2007 IEEE International Conference on Robotics and Automation, pp. 2847–2852 (2007)
Google Scholar
Tenorth, M., Beetz, M.: Priming Transformational Planning with Observations of Human Activities. In: IEEE International Conference on Robotics and Automation, ICRA (2010)
Google Scholar
Chernova, S., Veloso, M.: Interactive policy learning through confidence-based autonomy. Journal of Artificial Intelligence Research 34 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Anthropomatics (IFA), Karlsruhe Institute of Technology, Karlsruhe, Germany
Sven R. Schmidt-Rohr, Fabian Romahn, Pascal Meissner, Rainer Jäkel & Rüdiger Dillmann

Authors

Sven R. Schmidt-Rohr
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Romahn
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Meissner
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Jäkel
View author publications
You can also search for this author in PubMed Google Scholar
Rüdiger Dillmann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sven R. Schmidt-Rohr .

Editor information

Editors and Affiliations

School of Information, and Communication Engineering, Sungkyunkwan University, 300 Cheoncheon-dong, Gyeonggi-do, 440-746, Korea, Republic of (South Korea)
Sukhan Lee
, Dept of Aerospace Information Eng, Konkuk University, 1 Hwayang-dong, Seoul, 143-701, Korea, Republic of (South Korea)
Kwang-Joon Yoon
, School of Electrical Engineering, Pusan National University, San 40, Jangcheon-dong, Pusan, 609-735, Korea, Republic of (South Korea)
Jangmyung Lee

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Schmidt-Rohr, S.R., Romahn, F., Meissner, P., Jäkel, R., Dillmann, R. (2013). Learning Probabilistic Decision Making by a Service Robot with Generalization of User Demonstrations and Interactive Refinement. In: Lee, S., Yoon, KJ., Lee, J. (eds) Frontiers of Intelligent Autonomous Systems. Studies in Computational Intelligence, vol 466. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35485-4_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-35485-4_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35484-7
Online ISBN: 978-3-642-35485-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Learning Probabilistic Decision Making by a Service Robot with Generalization of User Demonstrations and Interactive Refinement

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

From demonstrations to task-space specifications. Using causal analysis to extract rule parameterization from demonstrations

Maximum Causal Entropy Specification Inference from Demonstrations

Abstraction-Refinement for Hierarchical Probabilistic Models

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Learning Probabilistic Decision Making by a Service Robot with Generalization of User Demonstrations and Interactive Refinement

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

From demonstrations to task-space specifications. Using causal analysis to extract rule parameterization from demonstrations

Maximum Causal Entropy Specification Inference from Demonstrations

Abstraction-Refinement for Hierarchical Probabilistic Models

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation