Unsupervised Hierarchical Dynamic Parsing and Encoding for Action Recognition

被引:16
|
作者
Su, Bing [1 ]
Zhou, Jiahuan [2 ]
Ding, Xiaoqing [3 ]
Wu, Ying [2 ]
机构
[1] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100190, Peoples R China
[2] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[3] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Action recognition; temporal clustering; hierarchical modeling; dynamic encoding; ENSEMBLE; VECTOR; MODELS; PARTS;
D O I
10.1109/TIP.2017.2745212
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generally, the evolution of an action is not uniform across the video, but exhibits quite complex rhythms and non-stationary dynamics. To model such non-uniform temporal dynamics, in this paper, we describe a novel hierarchical dynamic parsing and encoding method to capture both the locally smooth dynamics and globally drastic dynamic changes. It parses the dynamics of an action into different layers and encodes such multi-layer temporal information into a joint representation for action recognition. At the first layer, the action sequence is parsed in an unsupervised manner into several smooth-changing stages corresponding to different key poses or temporal structures by temporal clustering. The dynamics within each stage are encoded by mean-pooling or rank-pooling. At the second layer, the temporal information of the ordered dynamics extracted from the previous layer is encoded again by rank-pooling to form the overall representation. Extensive experiments on a gesture action data set (Chalearn Gesture) and three generic action data sets (Olympic Sports, Hollywood2, and UCF101) have demonstrated the effectiveness of the proposed method.
引用
收藏
页码:5784 / 5799
页数:16
相关论文
共 50 条
  • [31] UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging
    Haenig, Christian
    Bordag, Stefan
    Quasthoff, Uwe
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1109 - 1114
  • [32] Hierarchical linear dynamical systems for unsupervised musical note recognition
    Cinar, Goktug T.
    Sequeira, Pedro M. N.
    Principe, Jose C.
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (04): : 1638 - 1662
  • [33] Deep Temporal Feature Encoding for Action Recognition
    Li, Lin
    Zhang, Zhaoxiang
    Huang, Yan
    Wang, Liang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1109 - 1114
  • [34] STM: SpatioTemporal and Motion Encoding for Action Recognition
    Jiang, Boyuan
    Wang, MengMeng
    Gan, Weihao
    Wu, Wei
    Yan, Junjie
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2000 - 2009
  • [35] Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation
    Sarfraz, M. Saquib
    Murray, Naila
    Sharma, Vivek
    Diba, Ali
    van Gool, Luc
    Stiefelhagen, Rainer
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11220 - 11229
  • [36] Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatio-Temporal Graph Convolutional Network for Action Recognition
    Papadopoulos, Konstantinos
    Ghorbel, Enjie
    Aouada, Djamila
    Ottersten, Bjoern
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 452 - 458
  • [37] Using a Product Manifold distance for unsupervised action recognition
    O'Hara, Stephen
    Lui, Yui Man
    Draper, Bruce A.
    IMAGE AND VISION COMPUTING, 2012, 30 (03) : 206 - 216
  • [38] UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION
    Yang, Xiaohang
    Kong, Lingtong
    Yang, Jie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2445 - 2449
  • [39] Unsupervised open-world human action recognition
    Matheus Gutoski
    André Eugenio Lazzaretti
    Heitor Silvério Lopes
    Pattern Analysis and Applications, 2023, 26 : 1753 - 1770
  • [40] Unsupervised open-world human action recognition
    Gutoski, Matheus
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1753 - 1770