Cited By
View all- Matloff NPooch UPegden CBlais GSolomon S(1984)Use of lattice structures for reduction of simulation run timeProceedings of the 16th conference on Winter simulation10.5555/800013.809461(202-205)Online publication date: 1-Jan-1984
Markov chain Monte Carlo (MCMC) has been widely used to approximate the expectation of the statistic of a given probability measure $\pi$ on a finite set, and the asymptotic variance is a typical approach to evaluating the performance of MCMC methods. In ...
Policy gradient, which makes use of Monte Carlo method to get an unbiased estimation of the parameter gradients, has been widely used in reinforcement learning. One key issue in policy gradient is reducing the variance of the estimation. From the ...
IEEE Press
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in