Cross-View Action Recognition Based on Hierarchical View-Shared Dictionary Learning

被引:9
|
作者
Zhang, Chengkun [1 ]
Zheng, Huicheng [1 ,3 ]
Lai, Jianhuang [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangdong Key Lab Informat Secur Technol, Guangzhou 510006, Guangdong, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou 510006, Guangdong, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Cross-view; action recognition; hierarchical transfer learning; feature space transformation; dictionary learning;
D O I
10.1109/ACCESS.2018.2815611
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing human actions across different views is challenging, since observations of the same action often vary greatly with viewpoints. To solve this problem, most existing methods explore the cross-view feature transfer relationship at video level only, ignoring the sequential composition of action segments therein. In this paper, we propose a novel hierarchical transfer framework, which is based on an action temporal-structure model that contains sequential relationship between action segments at multiple timescales. Thus, it can capture the view invariance of the sequential relationship of segment-level transfer. Additionally, we observe that the original feature distributions under different views differ greatly, leading to view-dependent representations irrelevant to the intrinsic structure of actions. Thus, at each level of the proposed framework, we transform the original feature spaces of different views to a view-shared low dimensional feature space, and jointly learn a dictionary in this space for these views. This view-shared dictionary captures the common structure of action data across the views and can represent the action segments in a way robust to view changes. Moreover, the proposed method can be kernelized easily, and operate in both unsupervised and supervised cross-view scenarios. Extensive experimental results on the IXMAS and WVU datasets demonstrate superiority of the proposed method over state-of-the-art methods.
引用
收藏
页码:16855 / 16868
页数:14
相关论文
共 50 条
  • [41] ASYMMETRIC CROSS-VIEW DICTIONARY LEARNING FOR PERSON RE-IDENTIFICATION
    Jiang, Minyue
    Yuan, Yuan
    Wang, Qi
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1228 - 1232
  • [42] Cross-View Gait Recognition Based on Feature Fusion
    Hong, Qi
    Wang, Zhongyuan
    Chen, Jianyu
    Huang, Baojin
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 640 - 646
  • [43] Novel Cross-View Human Action Model Recognition Based on the Powerful View-Invariant Features Technique
    Mambou, Sebastien
    Krejcar, Ondrej
    Kuca, Kamil
    Selamat, Ali
    [J]. FUTURE INTERNET, 2018, 10 (09)
  • [44] Global-Local Cross-View Fisher Discrimination for View-invariant Action Recognition
    Gao, Lingling
    Ji, Yanli
    Yang, Yang
    Shen, Heng Tao
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5255 - 5264
  • [45] View Synthesis with Scene Recognition for Cross-View Image Localization
    Lee, Uddom
    Jiang, Peng
    Wu, Hongyi
    Xin, Chunsheng
    [J]. FUTURE INTERNET, 2023, 15 (04):
  • [46] CROSS-VIEW ACTION RECOGNITION VIA LOW-RANK BASED DOMAIN ADAPTATION
    Tseng, Wen-Sheng
    Hsu, Lun-Kai
    Kang, Li-Wei
    Wang, Yu-Chiang Frank
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3244 - 3248
  • [47] Cross-View Action Recognition from Temporal Self-similarities
    Junejo, Imran N.
    Dexter, Emilie
    Laptev, Ivan
    Perez, Patrick
    [J]. COMPUTER VISION - ECCV 2008, PT II, PROCEEDINGS, 2008, 5303 : 293 - 306
  • [48] Learning Discriminative Transferable Sparse Coding for Cross-View Action Recognition in Wireless Sensor Networks
    Zhang, Zhong
    Liu, Shuang
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [49] Cross-View Action Recognition Using Contextual Maximum Margin Clustering
    Zhang, Zhong
    Wang, Chunheng
    Xiao, Baihua
    Zhou, Wen
    Liu, Shuang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (10) : 1663 - 1668
  • [50] Cross-view gait recognition based on a restrictive triplet network
    Tong, Sui-bing
    Fu, Yu-zhuo
    Ling, He-fei
    [J]. PATTERN RECOGNITION LETTERS, 2019, 125 : 212 - 219