Variance reduction techniques for gradient estimates in reinforcement learning
Evan Greensmith, Peter L. Bartlett, Jonathan Baxter
Research output: Contribution to journal › Article › peer-review
361
Citations
(Scopus)