poster

Monte carlo bayesian hierarchical reinforcement learning

Authors:

Vien Anh Ngo,

Hung Ngo,

Ertel WolfgangAuthors Info & Claims

AAMAS '14: Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems

Pages 1551 - 1552

Published: 05 May 2014 Publication History

Get Access

Abstract

In this paper, we propose to use hierarchical action decomposition to make Bayesian model-based reinforcement learning more efficient and feasible in practice. We formulate Bayesian hierarchical reinforcement learning as a partially observable semi-Markov decision process (POSMDP). The main POSMDP task is partitioned into a hierarchy of POSMDP subtasks; lower-level subtasks get solved first, then higher-level ones. We sample from a prior belief to build an approximate model for each POSMDP, then solve using Monte Carlo Value Iteration with Macro-Actions solver. Experimental results show that our algorithm performs significantly better than that of flat BRL in terms of both reward, and especially solving time, in at least one order of magnitude.

References

[1]

H. Bai, D. Hsu, W. S. Lee, and N. A. Vien. Monte carlo value iteration for continuous-state pomdps. In WAFR, pages 175--191, 2010.

Crossref

Google Scholar

[2]

T. G. Dietterich. Hierarchical reinforcement learning with the maxq value function decomposition. J. Artif. Intell. Res. (JAIR), 13:227--303, 2000.

Digital Library

Google Scholar

[3]

M. Duff. Optimal learning: Computational procedures for Bayes-adaptive Markov decision processes. PhD thesis, University of Massachusetts Amherst, 2002.

Digital Library

Google Scholar

[4]

A. Guez, D. Silver, and P. Dayan. Efficient bayes-adaptive reinforcement learning using sample-based search. In NIPS, pages 1034--1042, 2012.

Google Scholar

[5]

Z. W. Lim, D. Hsu, and L. W. Sun. Monte Carlo value iteration with macro-actions. In NIPS, pages 1287--1295, 2011.

Google Scholar

[6]

J. Pineau. Tractable Planning Under Uncertainty: Exploiting Structure. PhD thesis, Robotics Institute, Carnegie Mellon University, 2004.

Digital Library

Google Scholar

[7]

W. H. Turkett. Robust Multiagent Plan Generation and Execution with Decision Theoretic Planners. PhD thesis, Department of Computer Science and Engineering, University of South Carolina, 1998.

Digital Library

Google Scholar

[8]

N. A. Vien and W. Ertel. Monte carlo tree search for bayesian reinforcement learning. In ICMLA (1), pages 138--143, 2012.

Digital Library

Google Scholar

[9]

Y. Wang, K. S. Won, D. Hsu, and W. S. Lee. Monte carlo bayesian reinforcement learning. In ICML, 2010.

Google Scholar

Cited By

View all

Li ZNarayan ALeong TSingh SMarkovitch S(2017)An efficient approach to model-based hierarchical reinforcement learningProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298023.3298089(3583-3589)Online publication date: 4-Feb-2017
https://dl.acm.org/doi/10.5555/3298023.3298089
Vien NNgo HLee SChung T(2014)Approximate planning for bayesian hierarchical reinforcement learningApplied Intelligence10.1007/s10489-014-0565-641:3(808-819)Online publication date: 1-Oct-2014
https://dl.acm.org/doi/10.1007/s10489-014-0565-6

Index Terms

Monte carlo bayesian hierarchical reinforcement learning
1. Computing methodologies

Recommendations

Monte Carlo Bayesian reinforcement learning
ICML'12: Proceedings of the 29th International Coference on International Conference on Machine Learning

Bayesian reinforcement learning (BRL) encodes prior knowledge of the world in a model and represents uncertainty in model parameters by maintaining a probability distribution over them. This paper presents Monte Carlo BRL (MC-BRL), a simple and general ...
Approximate planning for bayesian hierarchical reinforcement learning

In this paper, we propose to use hierarchical action decomposition to make Bayesian model-based reinforcement learning more efficient and feasible for larger problems. We formulate Bayesian hierarchical reinforcement learning as a partially observable ...
Monte-Carlo tree search for Bayesian reinforcement learning

Bayesian model-based reinforcement learning can be formulated as a partially observable Markov decision process (POMDP) to provide a principled framework for optimally balancing exploitation and exploration. Then, a POMDP solver can be used to solve the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

AAMAS '14: Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems

May 2014

1774 pages

ISBN:9781450327381

General Chairs:
Ana Bazzan
UFRGS, Brazil
,
Michael Huhns
University of South Carolina, USA
,
Program Chairs:
Alessio Lomuscio
Imperial College London, UK
,
Paul Scerri
Carnegie Mellon University, USA

In-Cooperation

SIGAI: ACM Special Interest Group on Artificial Intelligence

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 05 May 2014

Check for updates

Author Tags

Qualifiers

Poster

Conference

AAMAS '14

Sponsor:

AAMAS '14: International conference on Autonomous Agents and Multi-Agent Systems

May 5 - 9, 2014

Paris, France

Acceptance Rates

AAMAS '14 Paper Acceptance Rate 169 of 709 submissions, 24%;

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
101
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Li ZNarayan ALeong TSingh SMarkovitch S(2017)An efficient approach to model-based hierarchical reinforcement learningProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298023.3298089(3583-3589)Online publication date: 4-Feb-2017
https://dl.acm.org/doi/10.5555/3298023.3298089
Vien NNgo HLee SChung T(2014)Approximate planning for bayesian hierarchical reinforcement learningApplied Intelligence10.1007/s10489-014-0565-641:3(808-819)Online publication date: 1-Oct-2014
https://dl.acm.org/doi/10.1007/s10489-014-0565-6

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Monte Carlo Bayesian reinforcement learning

Approximate planning for bayesian hierarchical reinforcement learning

Monte-Carlo tree search for Bayesian reinforcement learning

Comments

Information

Published In

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations