Cross-View Action Recognition Based on Hierarchical View-Shared Dictionary Learning

被引:9
|
作者
Zhang, Chengkun [1 ]
Zheng, Huicheng [1 ,3 ]
Lai, Jianhuang [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangdong Key Lab Informat Secur Technol, Guangzhou 510006, Guangdong, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou 510006, Guangdong, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Cross-view; action recognition; hierarchical transfer learning; feature space transformation; dictionary learning;
D O I
10.1109/ACCESS.2018.2815611
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing human actions across different views is challenging, since observations of the same action often vary greatly with viewpoints. To solve this problem, most existing methods explore the cross-view feature transfer relationship at video level only, ignoring the sequential composition of action segments therein. In this paper, we propose a novel hierarchical transfer framework, which is based on an action temporal-structure model that contains sequential relationship between action segments at multiple timescales. Thus, it can capture the view invariance of the sequential relationship of segment-level transfer. Additionally, we observe that the original feature distributions under different views differ greatly, leading to view-dependent representations irrelevant to the intrinsic structure of actions. Thus, at each level of the proposed framework, we transform the original feature spaces of different views to a view-shared low dimensional feature space, and jointly learn a dictionary in this space for these views. This view-shared dictionary captures the common structure of action data across the views and can represent the action segments in a way robust to view changes. Moreover, the proposed method can be kernelized easily, and operate in both unsupervised and supervised cross-view scenarios. Extensive experimental results on the IXMAS and WVU datasets demonstrate superiority of the proposed method over state-of-the-art methods.
引用
收藏
页码:16855 / 16868
页数:14
相关论文
共 50 条
  • [31] Learning a Non-linear Knowledge Transfer Model for Cross-View Action Recognition
    Rahmani, Hossein
    Mian, Ajmal
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2458 - 2466
  • [32] Cross-view Action Recognition over Heterogeneous Feature Spaces
    Wu, Xinxiao
    Wang, Han
    Liu, Cuiwei
    Jia, Yunde
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 609 - 616
  • [33] Cross-view action recognition with small-scale datasets
    Goyal, Gaurvi
    Noceti, Nicoletta
    Odone, Francesca
    [J]. IMAGE AND VISION COMPUTING, 2022, 120
  • [34] Histogram of Oriented Principal Components for Cross-View Action Recognition
    Rahmani, Hossein
    Mahmood, Arif
    Du Huynh
    Mian, Ajmal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (12) : 2430 - 2443
  • [35] Multi-layer representation for cross-view action recognition
    Liu, Zhigang
    Wu, Yin
    Yin, Ziyang
    [J]. INFORMATION SCIENCES, 2024, 659
  • [36] Cross-View Action Recognition Over Heterogeneous Feature Spaces
    Wu, Xinxiao
    Wang, Han
    Liu, Cuiwei
    Jia, Yunde
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4096 - 4108
  • [37] Topic-Based Knowledge Transfer Algorithm for Cross-View Action Recognition
    Chen, Changhong
    Yang, Shunqing
    Gan, Zongliang
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (03) : 614 - 617
  • [38] Cross-View Action Recognition via a Continuous Virtual Path
    Zhang, Zhong
    Wang, Chunheng
    Xiao, Baihua
    Zhou, Wen
    Liu, Shuang
    Shi, Cunzhao
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2690 - 2697
  • [39] Learning Representations From Skeletal Self-Similarities for Cross-View Action Recognition
    Shao, Zhanpeng
    Li, Youfu
    Zhang, Hong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 160 - 174
  • [40] Cross-View Projective Dictionary Learning for Person Re-identification
    Li, Sheng
    Shao, Ming
    Fu, Yun
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2155 - 2161