Skip to main navigation Skip to search Skip to main content

Variance reduction techniques for gradient estimates in reinforcement learning

Evan Greensmith, Peter L. Bartlett, Jonathan Baxter

    Research output: Contribution to journalArticlepeer-review

    361 Citations (Scopus)

    Fingerprint

    Dive into the research topics of 'Variance reduction techniques for gradient estimates in reinforcement learning'. Together they form a unique fingerprint.
    Sort by

    Mathematics