Human Pose Forecasting via Deep Markov Models

Sam Toyer, Anoop Cherian, Tengda Han, Stephen Gould

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    30 Citations (Scopus)

    Abstract

    Human pose forecasting is an important problem in computer vision with applications to human-robot interaction, visual surveillance, and autonomous driving. Usually, forecasting algorithms use 3D skeleton sequences and are trained to forecast for a few milliseconds into the future. Long-range forecasting is challenging due to the difficulty of estimating how long a person continues an activity. To this end, our contributions are threefold: (i) we propose a generative framework for poses using variational autoencoders based on Deep Markov Models (DMMs); (ii) we evaluate our pose forecasts using a pose-based action classifier, which we argue better reflects the subjective quality of pose forecasts than distance in coordinate space; (iii) last, for evaluation of the new model, we introduce a 480,000-frame video dataset called Ikea Furniture Assembly (Ikea FA), which depicts humans repeatedly assembling and disassembling furniture. We demonstrate promising results for our approach on both Ikea FA and the existing NTU RGB+D dataset.

    Original languageEnglish
    Title of host publicationDICTA 2017 - 2017 International Conference on Digital Image Computing
    Subtitle of host publicationTechniques and Applications
    EditorsYi Guo, Manzur Murshed, Zhiyong Wang, David Dagan Feng, Hongdong Li, Weidong Tom Cai, Junbin Gao
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages1-8
    Number of pages8
    ISBN (Electronic)9781538628393
    DOIs
    Publication statusPublished - 19 Dec 2017
    Event2017 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2017 - Sydney, Australia
    Duration: 29 Nov 20171 Dec 2017

    Publication series

    NameDICTA 2017 - 2017 International Conference on Digital Image Computing: Techniques and Applications
    Volume2017-December

    Conference

    Conference2017 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2017
    Country/TerritoryAustralia
    CitySydney
    Period29/11/171/12/17

    Fingerprint

    Dive into the research topics of 'Human Pose Forecasting via Deep Markov Models'. Together they form a unique fingerprint.

    Cite this