Discovering Motion Primitives for Unsupervised Grouping and One-Shot Learning of Human Actions, Gestures, and Expressions

被引:86
|
作者
Yang, Yang [1 ]
Saleemi, Imran [1 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Dept Elect Engn & Comp Sci EECS, Comp Vis Lab, Orlando, FL 32816 USA
关键词
Human actions; one-shot learning; unsupervised clustering; gestures; facial expressions; action representation; action recognition; motion primitives; motion patterns; histogram of motion primitives; motion primitives strings; Hidden Markov model; RECOGNITION;
D O I
10.1109/TPAMI.2012.253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel representation of articulated human actions and gestures and facial expressions. The main goals of the proposed approach are: 1) to enable recognition using very few examples, i.e., one or k-shot learning, and 2) meaningful organization of unlabeled datasets by unsupervised clustering. Our proposed representation is obtained by automatically discovering high-level subactions or motion primitives, by hierarchical clustering of observed optical flow in four-dimensional, spatial, and motion flow space. The completely unsupervised proposed method, in contrast to state-of-the-art representations like bag of video words, provides a meaningful representation conducive to visual interpretation and textual labeling. Each primitive action depicts an atomic subaction, like directional motion of limb or torso, and is represented by a mixture of four-dimensional Gaussian distributions. For one-shot and k-shot learning, the sequence of primitive labels discovered in a test video are labeled using KL divergence, and can then be represented as a string and matched against similar strings of training videos. The same sequence can also be collapsed into a histogram of primitives or be used to learn a Hidden Markov model to represent classes. We have performed extensive experiments on recognition by one and k-shot learning as well as unsupervised action clustering on six human actions and gesture datasets, a composite dataset, and a database of facial expressions. These experiments confirm the validity and discriminative nature of the proposed representation.
引用
收藏
页码:1635 / 1648
页数:14
相关论文
共 32 条
  • [11] One-Shot Learning for Deformable Medical Image Registration and Periodic Motion Tracking
    Fechter, Tobias
    Baltas, Dimos
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2506 - 2517
  • [12] Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent
    Fu, Yuqian
    Wang, Chengrong
    Fu, Yanwei
    Wang, Yu-Xiong
    Bai, Cong
    Xue, Xiangyang
    Jiang, Yu-Gang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 411 - 419
  • [13] Improved GLOH Approach for One-Shot Learning Human Gesture Recognition
    Karn, Nabin Kumar
    Jiang, Feng
    BIOMETRIC RECOGNITION, 2016, 9967 : 441 - 452
  • [14] One-shot learning of human–robot handovers with triadic interaction meshes
    David Vogt
    Simon Stepputtis
    Bernhard Jung
    Heni Ben Amor
    Autonomous Robots, 2018, 42 : 1053 - 1065
  • [15] A methodology for gestural interaction relying on user-defined gestures sets following a one-shot learning approach
    Cespedes-Hernandez, David
    Manuel Gonzalez-Calleros, Juan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 5001 - 5010
  • [16] Self-supervision & meta-learning for one-shot unsupervised cross-domain detection
    Borlino, Francesco Cappio
    Polizzotto, Salvatore
    Caputo, Barbara
    Tommasi, Tatiana
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 223
  • [17] One-shot learning of human-robot handovers with triadic interaction meshes
    Vogt, David
    Stepputtis, Simon
    Jung, Bernhard
    Ben Amor, Heni
    AUTONOMOUS ROBOTS, 2018, 42 (05) : 1053 - 1065
  • [18] One shot learning human actions recognition using key poses
    Zou, W. H.
    Li, S. G.
    Lei, Z.
    Dai, N.
    INFORMATION SYSTEMS AND COMPUTING TECHNOLOGY, 2013, : 15 - 24
  • [19] Unsupervised hyperspectral stimulated Raman microscopy image enhancement: denoising and segmentation via one-shot deep learning
    Abdolghader, Pedram
    Ridsdale, Andrew
    Grammatikopoulos, Tassos
    Resch, Gavin
    Legare, Francois
    Stolow, Albert
    Pegoraro, Adrian F.
    Tamblyn, Isaac
    OPTICS EXPRESS, 2021, 29 (21) : 34205 - 34219
  • [20] One-Shot Learning of Human Activity With an MAP Adapted GMM and Simplex-HMM
    Rodriguez, Mario
    Orrite, Carlos
    Medrano, Carlos
    Makris, Dimitrios
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (07) : 1769 - 1780