Skip to main navigation Skip to search Skip to main content

Variance reduction techniques for gradient estimates in reinforcement learning

Evan Greensmith, Peter L. Bartlett, Jonathan Baxter

    Research output: Contribution to journalArticlepeer-review

    363 Citations (SciVal)

    Fingerprint

    Dive into the research topics of 'Variance reduction techniques for gradient estimates in reinforcement learning'. Together they form a unique fingerprint.
    Sort by

    Mathematics