A stochastic conditioning scheme for diverse human motion prediction

Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Lars Petersson, Stephen Gould

    Research output: Contribution to journalConference articlepeer-review

    86 Citations (Scopus)

    Abstract

    Human motion prediction, the task of predicting future 3D human poses given a sequence of observed ones, has been mostly treated as a deterministic problem. However, human motion is a stochastic process: Given an observed sequence of poses, multiple future motions are plausible. Existing approaches to modeling this stochasticity typically combine a random noise vector with information about the previous poses. This combination, however, is done in a deterministic manner, which gives the network the flexibility to learn to ignore the random noise. Alternatively, in this paper, we propose to stochastically combine the root of variations with previous pose information, so as to force the model to take the noise into account. We exploit this idea for motion prediction by incorporating it into a recurrent encoder-decoder network with a conditional variational autoencoder block that learns to exploit the perturbations. Our experiments on two large-scale motion prediction datasets demonstrate that our model yields high-quality pose sequences that are much more diverse than those from state-of-the-art stochastic motion prediction techniques.

    Original languageEnglish
    Article number9157240
    Pages (from-to)5222-5231
    Number of pages10
    JournalProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    DOIs
    Publication statusPublished - 2020
    Event2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020 - Virtual, Online, United States
    Duration: 14 Jun 202019 Jun 2020

    Fingerprint

    Dive into the research topics of 'A stochastic conditioning scheme for diverse human motion prediction'. Together they form a unique fingerprint.

    Cite this