FEXNet: Foreground Extraction Network for Human Action Recognition

被引:29
|
作者
Shen, Zhongwei [1 ]
Wu, Xiao-Jun [1 ]
Xu, Tianyang [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金;
关键词
Convolutional neural networks; Spatiotemporal phenomena; Feature extraction; Three-dimensional displays; Solid modeling; Iron; Image recognition; Foreground-related features; spatiotemporal modeling; action recognition;
D O I
10.1109/TCSVT.2021.3103677
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As most human actions in video sequences embody the continuous interactions between foregrounds rather than the background scene, it is significant to disentangle these foregrounds from the background for advanced action recognition systems. In this paper, therefore, we propose a Foreground EXtraction (FEX) block to explicitly model the foreground clues to achieve effective management of action subjects. In particular, the designed FEX block contains two components. The first part is a Foreground Enhancement (FE) module, which highlights the potential feature channels related to the action attributes, providing channel-level refinement for the following spatiotemporal modeling. The second phase is a Scene Segregation (SS) module, which splits feature maps into foreground and background. Specifically, a temporal model with dynamic enhancement is constructed for the foreground part, reflecting the essential nature of the action category. While the background is modeled using simple spatial convolutions, mapping the inputs to the consistent feature space. The FEX blocks can be inserted into existing 2D CNNs (denoted as FEXNet) for spatiotemporal modeling, concentrating on the foreground clues for effective action inference. Our experiments performed on Something-Something V1, V2 and Kinetics400 verify the effectiveness of the proposed method.
引用
收藏
页码:3141 / 3151
页数:11
相关论文
共 50 条
  • [21] Motion Based Foreground Detection and Poselet Motion Features for Action Recognition
    Kraft, Erwin
    Brox, Thomas
    COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 350 - 365
  • [22] Segmentation and selective feature extraction for human detection to the direction of action recognition
    Konwar L.
    Talukdar A.K.
    Sarma K.K.
    Saikia N.
    Rajbangshi S.C.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 1371 - 1386
  • [23] End-to-end temporal attention extraction and human action recognition
    Zhang, Hong
    Xin, Miao
    Wang, Shuhang
    Yang, Yifan
    Zhang, Lei
    Wang, Helong
    MACHINE VISION AND APPLICATIONS, 2018, 29 (07) : 1127 - 1142
  • [24] Recognition of human pointing action based on color extraction and stereo tracking
    Mori, T
    Yokokawa, T
    Sato, T
    INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 93 - 100
  • [25] Human Action Recognition Using a Semantic-Probabilistic Network
    Kovalenko, Mykyta
    Antoshchuk, Svetlana
    Sieck, Juergen
    2015 INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN NETWORKS AND COMPUTER COMMUNICATIONS (ETNCC), 2015, : 67 - 72
  • [26] Human Tumble Action Recognition Using Spiking Neuron Network
    Li, Yu
    Wang, Ke
    Huang, MinFeng
    Li, RuiFeng
    Gao, TianZe
    Wu, Jun
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5309 - 5313
  • [27] Applied Human Action Recognition Network Based on SNSP Features
    M Shujah Islam
    Khush Bakhat
    Rashid Khan
    Nuzhat Naqvi
    M Mattah Islam
    Zhongfu Ye
    Neural Processing Letters, 2022, 54 : 1481 - 1494
  • [28] Applied Human Action Recognition Network Based on SNSP Features
    Islam, M. Shujah
    Bakhat, Khush
    Khan, Rashid
    Naqvi, Nuzhat
    Islam, M. Mattah
    Ye, Zhongfu
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1481 - 1494
  • [29] Secure human action recognition by encrypted neural network inference
    Kim, Miran
    Jiang, Xiaoqian
    Lauter, Kristin
    Ismayilzada, Elkhan
    Shams, Shayan
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [30] Human action recognition using a modified convolutional neural network
    Kim, Ho-Joon
    Lee, Joseph S.
    Yang, Hyun-Seung
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 715 - +