Cited By
View all- Asadi KLittman M(2017)An alternative softmax operator for reinforcement learningProceedings of the 34th International Conference on Machine Learning - Volume 7010.5555/3305381.3305407(243-252)Online publication date: 6-Aug-2017
- Krishnamurthy AAgarwal ALangford J(2016)PAC reinforcement learning with rich observationsProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157096.3157303(1848-1856)Online publication date: 5-Dec-2016
- Wagner P(2013)Optimistic policy iteration and natural actor-criticProceedings of the 27th International Conference on Neural Information Processing Systems - Volume 110.5555/2999611.2999789(1592-1600)Online publication date: 5-Dec-2013