Learning universal multiview dictionary for human action recognition

被引:36
|
作者
Yao, Tingting [1 ,2 ]
Wang, Zhiyong [1 ]
Xie, Zhao [2 ]
Gao, Jun [2 ]
Feng, David Dagan [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Dictionary learning; Sparse coding; Multiview learning; Action recognition; MOTION; PARTS;
D O I
10.1016/j.patcog.2016.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many sparse coding based approaches have been proposed for human action recognition. However, most of them focus on learning a discriminative dictionary without explicitly taking into account the common patterns shared among different action classes. In this paper, we propose a novel discriminative dictionary learning framework by formulating a universal dictionary which consists of a shared sub-dictionary and a set of class-specific sub-dictionaries. As a result, inter-class differences can be better characterized with sparse codes obtained from the class-specific dictionaries. In addition, group sparsity and locality constraints are utilized to preserve therelationship and structure among features. In order to leverage the benefits of multiple descriptors, a dictionary is learned for each view, and the corresponding sparse representations of those descriptors are fused in a low dimensional feature space together with temporal information. The experimental results on three challenging datasets demonstrate that our method is able to achieve better performance than a number of stateof-the-art ones.
引用
收藏
页码:236 / 244
页数:9
相关论文
共 50 条
  • [1] Adaptive Fusion and Category-Level Dictionary Learning Model for Multiview Human Action Recognition
    Gao, Zan
    Xuan, Hai-Zhen
    Zhang, Hua
    Wan, Shaohua
    Choo, Kim-Kwang Raymond
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) : 9280 - 9293
  • [2] Learning pose dictionary for human action recognition
    Cai, Jia-xin
    Tang, Xin
    Feng, Guo-can
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 381 - 386
  • [3] Learning Zeroth Class Dictionary for Human Action Recognition
    Cai, Jiaxin
    Tang, Xin
    Zhang, Lifang
    Feng, Guocan
    COMPUTER VISION, PT III, 2017, 773 : 651 - 666
  • [4] Multiview Supervised Dictionary Learning in Speech Emotion Recognition
    Gangeh, Mehrdad J.
    Fewzee, Pouria
    Ghodsi, Ali
    Kamel, Mohamed S.
    Karray, Fakhri
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (06) : 1056 - 1068
  • [5] Cascade Dictionary Learning for Action Recognition
    Dong, Jian
    Sun, Changyin
    Mu, Chaoxu
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR MULTIMEDIA, SIGNAL AND VISION PROCESSING (CIMSIVP), 2014, : 7 - 11
  • [6] Learning Cross-domain Dictionary Pairs for Human Action Recognition
    Zhang, Bingbing
    Shi, Dongcheng
    Ni, Kang
    Liang, Chao
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2015), 2015, 33 : 423 - 428
  • [7] Multiview Representation Learning for Human Activity Recognition
    Hamidi, Massinissa
    Osmani, Aomar
    Rasaq, Lukmon
    Dogan, Gulustan
    Alotaibi, Nouran
    2022 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (IEEE CIVEMSA 2022), 2022,
  • [8] Discriminative Dictionary Learning for Skeletal Action Recognition
    Xiang, Yang
    Xu, Jinhua
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 531 - 539
  • [9] Learning a Mid-Level Representation for Multiview Action Recognition
    Liu, Cuiwei
    Li, Zhaokui
    Shi, Xiangbin
    Du, Chong
    ADVANCES IN MULTIMEDIA, 2018, 2018
  • [10] Human action recognition by leaning pose dictionary
    Cai, Jiaxin
    Feng, Guocan
    Tang, Xin
    Luo, Zhihong
    Cai, Jiaxin, 1600, Chinese Optical Society (34):