Sparse Kernel-SARSA(λ) with an Eligibility Trace

Matthew Robards, Peter Sunehag, Scott Sanner, Bhaskara Marthi

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    We introduce the first online kernelized version of SARSA(?) to permit sparsifi- cation for arbitrary ? for 0 = ? = 1; this is possible via a novel kernelization of the eligibility trace that is maintained separately from the kernelized value function. This separation is crucial for preserving the functional structure of the eligibility trace when using sparse kernel projection techniques that are essential for memory efficiency and capacity control. The result is a simple and practical Kernel-SARSA(?) algorithm for general 0 = ? = 1 that is memory-efficient in comparison to standard SARSA(?) (using various basis functions) on a range of domains including a real robotics task running on a Willow Garage PR2 robot.
    Original languageEnglish
    Title of host publicationProceedings of Machine Learning and Knowledge Discovery in Databases - European Conference (ECML PKDD 2011)
    EditorsDimitrios Gunopulos
    Place of PublicationBerlin, Heidelberg
    PublisherSpringer
    Pages16
    EditionPeer Reviewed
    ISBN (Print)9783642237799
    DOIs
    Publication statusPublished - 2011
    EventMachine Learning and Knowledge Discovery in Databases European Conference (ECML PKDD 2011) - Athens Greece
    Duration: 1 Jan 2011 → …

    Conference

    ConferenceMachine Learning and Knowledge Discovery in Databases European Conference (ECML PKDD 2011)
    Period1/01/11 → …
    OtherSeptember 5-9 2011

    Fingerprint

    Dive into the research topics of 'Sparse Kernel-SARSA(λ) with an Eligibility Trace'. Together they form a unique fingerprint.

    Cite this