Action recognition from mutually incoherent pose bases in static image

被引:2
|
作者
Qian, Yinzhong [1 ,2 ,3 ]
Chen, Wenbin [4 ]
Shen, I-fan [1 ,2 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Changzhou Coll Informat Technol, Changzhou, Peoples R China
[4] Fudan Univ, Sch Math Sci, Shanghai, Peoples R China
关键词
image representation; matrix algebra; support vector machines; neural nets; pose estimation; action recognition; static image; mutually incoherent pose bases; implicit poselet co-occurrences; dictionary training; sparse linear pose bases combination; sparse representation; SVM; overcomplete matrix; cumulative coherence; objective function; pose representation; local pose feature; deep convolutional neural network features; SPARSE; REPRESENTATION; DICTIONARIES;
D O I
10.1049/iet-cvi.2017.0233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition in static image is challenging. The authors propose mutually incoherent pose bases which are implicit poselet co-occurrences and are learned by dictionary training to describe body pose. Poselets in a pose basis are not constrained in space and quantity, thus pose basis can describe body pose more flexibly than k-poselet. In their method, body pose in an image is represented by a sparse linear combination of pose bases because pose in an action varies while each image only captures a snapshot from a single viewpoint. In dictionary training, the challenge is how to stabilise the sparse representation which is the input of Support Vector Machine (SVM) for action recognition, because the original pose signal is ambiguous while dictionary is an over complete matrix. Their solution is to add cumulative coherence as penalty in objective function and induce pose bases become mutually incoherent. They evaluate the method on two popular datasets and experiment results show the pose representation has encouraging performance in action recognition. Furthermore, they empirically exploit the complementary role of the local pose feature with deep convolutional neural network features from holistic image. Experiment results demonstrate aggressive performance improvement by concatenating the two features.
引用
收藏
页码:233 / 240
页数:8
相关论文
共 50 条
  • [1] Mutually Incoherent Pose Bases for Action Recognition
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-fan
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 823 - 828
  • [2] Action Recognition from Pose Signature in Static Image
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-Fan
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (03)
  • [3] Mutually reinforcing motion-pose framework for pose invariant action recognition
    Ramanathan, Manoj
    Yau, Wei-Yun
    Thalmann, Nadia Magnenat
    Teoh, Eam Khwang
    [J]. INTERNATIONAL JOURNAL OF BIOMETRICS, 2019, 11 (02) : 113 - 147
  • [4] OFPI: Optical Flow Pose Image for Action Recognition
    Chen, Dong
    Zhang, Tao
    Zhou, Peng
    Yan, Chenyang
    Li, Chuanqi
    [J]. MATHEMATICS, 2023, 11 (06)
  • [5] JOINT POSE ESTIMATION AND ACTION RECOGNITION IN IMAGE GRAPHS
    Raja, Kumar
    Laptev, Ivan
    Perez, Patrick
    Oisel, Lionel
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 25 - 28
  • [6] Action Recognition from a Single Web Image Based on an Ensemble of Pose Experts
    Zhang, Peihao
    Tan, Xiaoyang
    Jin, Xin
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 477 - 493
  • [7] Human Body Pose Distance Image Analysis for Action Recognition
    Verma, Amit
    Meenpal, Toshanlal
    Acharya, Bibhudendra
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [8] Image-based Pose Representation for Action Recognition and Hand Gesture Recognition
    Lin, Zeyi
    Zhang, Wei
    Deng, Xiaoming
    Ma, Cuixia
    Wang, Hongan
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 532 - 539
  • [9] Exploiting Motion Information from Unlabeled Videos for Static Image Action Recognition
    Zhang, Yiyi
    Li Niu
    Pan, Ziqi
    Luo, Meichao
    Zhang, Jianfu
    Cheng, Dawei
    Zhang, Liqing
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12918 - 12925
  • [10] Joint Dynamic Pose Image and Space Time Reversal for Human Action Recognition from Videos
    Liu, Mengyuan
    Meng, Fanyang
    Chen, Chen
    Wu, Songtao
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8762 - 8769