Scene adaptive mechanism for action recognition

被引:2
|
作者
Wu, Cong [1 ]
Wu, Xiao-Jun [1 ]
Xu, Tianyang [1 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Scene adaptive mechanism; Action recognition;
D O I
10.1016/j.cviu.2023.103854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene knowledge plays an important role in visual analysis. For the task of action recognition, human activities often occur in specific scenes. However, it should be emphasised that the association between actions and scenes is very complex. Simplistic attempts to improve the effectiveness of action recognition by intensifying or suppressing the scene knowledge are unwise. In this article, we tackle this problem by proposing a new action recognition framework based on the Scene Adaptive Mechanism. Specifically, with the Scene Knowledge Modulation module, we can control the feature extractors to either suppress or intensify scene knowledge. And then, through an Adaptive Fusion Layer, the role of scene information in different visual feature sequences can thus be dynamically regulated and fused. The resulting model is abbreviated as SAM-Net. Our method serves as a pluggable module, capable of integration into other backbones to further enhance their performance. We perform extensive experiments on three large datasets: Something-Something V1&V2 and Kinetics-400. The quantitative and qualitative experimental results demonstrate the effectiveness of SAM-Net, with a great improvement in performance compared to the baseline methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Recognition and action for scene understanding
    Marfil, Rebeca
    Dias, Jorge
    Escolano, Francisco
    [J]. NEUROCOMPUTING, 2015, 161 : 1 - 2
  • [2] Adaptive pattern recognition system for scene segmentation
    Kubota, T
    Huntsberger, T
    [J]. OPTICAL ENGINEERING, 1998, 37 (03) : 829 - 835
  • [3] Will Scene Information Help Realistic Action Recognition?
    Chen, Xian-gan
    Liu, Juan
    Liu, Haihua
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4538 - 4541
  • [4] Compact and adaptive spatial pyramids for scene recognition
    Elfiky, Noha M.
    Gonzalez, Jordi
    Xavier Roca, F.
    [J]. IMAGE AND VISION COMPUTING, 2012, 30 (08) : 492 - 500
  • [5] Adaptive local recalibration network for scene recognition
    Wang, Jiale
    Zou, Lian
    Fan, Cien
    Jiang, Hao
    Chen, Liqiong
    Cheng, Mofan
    Yu, Hu
    Liu, Yifeng
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 27935 - 27950
  • [6] Adaptive Adversarial Attack on Scene Text Recognition
    Yuan, Xiaoyong
    He, Pan
    Li, Xiaolin
    Wu, Dapeng
    [J]. IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 358 - 363
  • [7] Adaptive local recalibration network for scene recognition
    Jiale Wang
    Lian Zou
    Cien Fan
    Hao Jiang
    Liqiong Chen
    Mofan Cheng
    Hu Yu
    Yifeng Liu
    [J]. Applied Intelligence, 2023, 53 : 27935 - 27950
  • [8] Human action recognition based on scene semantics
    Tao Hu
    Xinyan Zhu
    Wei Guo
    Shaohua Wang
    Jianfeng Zhu
    [J]. Multimedia Tools and Applications, 2019, 78 : 28515 - 28536
  • [9] Human action recognition based on scene semantics
    Hu, Tao
    Zhu, Xinyan
    Guo, Wei
    Wang, Shaohua
    Zhu, Jianfeng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 28515 - 28536
  • [10] Action Recognition with Adaptive RBFNN
    Aphaipanan, Srisuda
    Kidjaidure, Yuttana
    [J]. 2014 FOURTH JOINT INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONIC AND ELECTRICAL ENGINEERING (JICTEE 2014), 2014,