OFPI: Optical Flow Pose Image for Action Recognition

被引:2
|
作者
Chen, Dong [1 ,2 ]
Zhang, Tao [2 ]
Zhou, Peng [1 ]
Yan, Chenyang [3 ]
Li, Chuanqi [1 ,2 ]
机构
[1] Guangxi Normal Univ, Coll Comp Sci & Engn, Guilin 541004, Peoples R China
[2] Nanning Normal Univ, Coll Phys & Elect Engn, Nanning 530001, Peoples R China
[3] Kanazawa Univ, Div Elect Engn & Comp Sci, Kanazawa 9201192, Japan
关键词
action recognition; optical flow pose image; skeletal data; transformer;
D O I
10.3390/math11061451
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Most approaches to action recognition based on pseudo-images involve encoding skeletal data into RGB-like image representations. This approach cannot fully exploit the kinematic features and structural information of human poses, and convolutional neural network (CNN) models that process pseudo-images lack a global field of view and cannot completely extract action features from pseudo-images. In this paper, we propose a novel pose-based action representation method called Optical Flow Pose Image (OFPI) in order to fully capitalize on the spatial and temporal information of skeletal data. Specifically, in the proposed method, an advanced pose estimator collects skeletal data before locating the target person and then extracts skeletal data utilizing a human tracking algorithm. The OFPI representation is obtained by aggregating these skeletal data over time. To test the superiority of OFPI and investigate the significance of the model having a global field of view, we trained a simple CNN model and a transformer-based model, respectively. Both models achieved superior outcomes. Because of the global field of view, especially in the transformer-based model, the OFPI-based representation achieved 98.3% and 94.2% accuracy on the KTH and JHMDB datasets, respectively. Compared with other advanced pose representation methods and multi-stream methods, OFPI achieved state-of-the-art performance on the JHMDB dataset, indicating the utility and potential of this algorithm for skeleton-based action recognition research.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] JOINT POSE ESTIMATION AND ACTION RECOGNITION IN IMAGE GRAPHS
    Raja, Kumar
    Laptev, Ivan
    Perez, Patrick
    Oisel, Lionel
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 25 - 28
  • [2] Action Recognition from Pose Signature in Static Image
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-Fan
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (03)
  • [3] Human Body Pose Distance Image Analysis for Action Recognition
    Verma, Amit
    Meenpal, Toshanlal
    Acharya, Bibhudendra
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [4] Image-based Pose Representation for Action Recognition and Hand Gesture Recognition
    Lin, Zeyi
    Zhang, Wei
    Deng, Xiaoming
    Ma, Cuixia
    Wang, Hongan
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 532 - 539
  • [5] Temporal Hockey Action Recognition via Pose and Optical Flows
    Cai, Zixi
    Neher, Helmut
    Vats, Kanav
    Clausi, David A.
    Zelek, John
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2543 - 2552
  • [6] Optical flow-motion history image (OF-MHI) for action recognition
    Du-Ming Tsai
    Wei-Yao Chiu
    Men-Han Lee
    [J]. Signal, Image and Video Processing, 2015, 9 : 1897 - 1906
  • [7] Optical flow-motion history image (OF-MHI) for action recognition
    Tsai, Du-Ming
    Chiu, Wei-Yao
    Lee, Men-Han
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (08) : 1897 - 1906
  • [8] Action recognition from mutually incoherent pose bases in static image
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-fan
    [J]. IET COMPUTER VISION, 2018, 12 (03) : 233 - 240
  • [9] Double-Stream Convolutional Networks with Sequential Optical Flow Image for Action Recognition
    Li Qinghui
    Li Aihua
    Wang Tao
    Cui Zhigao
    [J]. ACTA OPTICA SINICA, 2018, 38 (06)
  • [10] On the Combination of IMU and Optical Flow for Action Recognition
    Alhersh, Taha
    Stuckenschmidt, Heiner
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 17 - 21