Self-bounded Prediction suffix tree via approximate string matching

Dongwoo Kim*, Christian Walder

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Prediction suffix trees (PST) provide an effective tool for sequence modelling and prediction. Current prediction techniques for PSTs rely on exact matching between the suffix of the current sequence and the previously observed sequence. We present a provably correct algorithm for learning a PST with approximate suffix matching by relaxing the exact matching condition. We then present a self-bounded enhancement of our algorithm where the depth of suffix tree grows automatically in response to the model performance on a training sequence. Through experiments on synthetic datasets as well as three real-world datasets, we show that the approximate matching PST results in better predictive performance than the other variants of PST.

    Original languageEnglish
    Title of host publication35th International Conference on Machine Learning, ICML 2018
    EditorsJennifer Dy, Andreas Krause
    PublisherInternational Machine Learning Society (IMLS)
    Pages4172-4185
    Number of pages14
    ISBN (Electronic)9781510867963
    Publication statusPublished - 2018
    Event35th International Conference on Machine Learning, ICML 2018 - Stockholm, Sweden
    Duration: 10 Jul 201815 Jul 2018

    Publication series

    Name35th International Conference on Machine Learning, ICML 2018
    Volume6

    Conference

    Conference35th International Conference on Machine Learning, ICML 2018
    Country/TerritorySweden
    CityStockholm
    Period10/07/1815/07/18

    Fingerprint

    Dive into the research topics of 'Self-bounded Prediction suffix tree via approximate string matching'. Together they form a unique fingerprint.

    Cite this