Variance reduction techniques for gradient estimates in reinforcement learning

Evan Greensmith, Peter Bartlett, Jon Baxter

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    5 Citations (Scopus)
    Original languageEnglish
    Title of host publicationAdvances in Neural Information Processing Systems 14
    EditorsTG Dietterich, S Becker Z Ghahramani
    Place of PublicationCambridge
    PublisherMIT Press
    Pages1507-1514
    EditionPeer Reviewed
    ISBN (Print)0262042088
    Publication statusPublished - 2002
    EventConference on Advances in Neural Information Processing Systems (NIPS 2002) - Cambridge USA, United States
    Duration: 1 Jan 2002 → …

    Conference

    ConferenceConference on Advances in Neural Information Processing Systems (NIPS 2002)
    Country/TerritoryUnited States
    Period1/01/02 → …
    OtherSeptember 2 2002

    Cite this