An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning

An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning | IEEE Journals & Magazine | IEEE Xplore

More Web Proxy on the site http://driver.im/

IEEE Account

Purchase Details

Profile Information

Need Help?