Temporal-Viewpoint Transportation Plan for Skeletal Few-Shot Action Recognition

Lei Wang, Piotr Koniusz*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

We propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE). To factor out misalignment between query and support sequences of 3D body joints, we propose an advanced variant of Dynamic Time Warping which jointly models each smooth path between the query and support frames to achieve simultaneously the best alignment in the temporal and simulated camera viewpoint spaces for end-to-end learning under the limited few-shot training data. Sequences are encoded with a temporal block encoder based on Simple Spectral Graph Convolution, a lightweight linear Graph Neural Network backbone. We also include a setting with a transformer. Finally, we propose a similarity-based loss which encourages the alignment of sequences of the same class while preventing the alignment of unrelated sequences. We show state-of-the-art results on NTU-60, NTU-120, Kinetics-skeleton and UWA3D Multiview Activity II.

Original languageEnglish
Title of host publicationComputer Vision – ACCV 2022 - 16th Asian Conference on Computer Vision, Proceedings
EditorsLei Wang, Juergen Gall, Tat-Jun Chin, Imari Sato, Rama Chellappa
PublisherSpringer Science and Business Media Deutschland GmbH
Pages307-326
Number of pages20
Volume13844
ISBN (Print)9783031263156
DOIs
Publication statusPublished - 2 Mar 2023
Event16th Asian Conference on Computer Vision, ACCV 2022 - Macao, China
Duration: 4 Dec 20228 Dec 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13844 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th Asian Conference on Computer Vision, ACCV 2022
Country/TerritoryChina
CityMacao
Period4/12/228/12/22

Fingerprint

Dive into the research topics of 'Temporal-Viewpoint Transportation Plan for Skeletal Few-Shot Action Recognition'. Together they form a unique fingerprint.
  • Temporal-Viewpoint Transportation Plan for Skeletal Few-Shot Action Recognition

    Wang, L. & Koniusz, P., 2 Mar 2023, Computer Vision – ACCV 2022 - 16th Asian Conference on Computer Vision, Proceedings. Wang, L., Gall, J., Chin, T-J., Sato, I. & Chellappa, R. (eds.). Springer Science and Business Media Deutschland GmbH, Vol. 13844. p. 307-326 20 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13844 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    2 Citations (Scopus)

Cite this