Scene adaptive mechanism for action recognition

被引:2
|
作者
Wu, Cong [1 ]
Wu, Xiao-Jun [1 ]
Xu, Tianyang [1 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Scene adaptive mechanism; Action recognition;
D O I
10.1016/j.cviu.2023.103854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene knowledge plays an important role in visual analysis. For the task of action recognition, human activities often occur in specific scenes. However, it should be emphasised that the association between actions and scenes is very complex. Simplistic attempts to improve the effectiveness of action recognition by intensifying or suppressing the scene knowledge are unwise. In this article, we tackle this problem by proposing a new action recognition framework based on the Scene Adaptive Mechanism. Specifically, with the Scene Knowledge Modulation module, we can control the feature extractors to either suppress or intensify scene knowledge. And then, through an Adaptive Fusion Layer, the role of scene information in different visual feature sequences can thus be dynamically regulated and fused. The resulting model is abbreviated as SAM-Net. Our method serves as a pluggable module, capable of integration into other backbones to further enhance their performance. We perform extensive experiments on three large datasets: Something-Something V1&V2 and Kinetics-400. The quantitative and qualitative experimental results demonstrate the effectiveness of SAM-Net, with a great improvement in performance compared to the baseline methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Reproducible Experiments on Adaptive Discriminative Region Discovery for Scene Recognition
    Zhao, Zhengyu
    Liu, Zhuoran
    Larson, Martha
    Iscen, Ahmet
    Nitta, Naoko
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1076 - 1079
  • [22] Adaptive Feature Learning CNN for Behavior Recognition in Crowd Scene
    Shuaibu, Aliyu Nuhu
    Malik, Aamir Saeed
    Faye, Ibrahima
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2017, : 357 - 361
  • [23] VIDEO-BASED ADAPTIVE RAILWAY RECOGNITION IN COMPLEX SCENE
    Qiang, Xiang
    Zhang, Zhaoyang
    Chen, Qiwei
    Wu, Cheng
    Wang, Yiming
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 217 - 222
  • [24] Automatic human action recognition in a scene from visual inputs
    Bouma, Henri
    Hanckmann, Patrick
    Marck, Jan-Willem
    Penning, Leo
    den Hollander, Richard
    ten Hove, Johan-Martijn
    van den Broek, Sebastiaan
    Schutte, Klamer
    Burghouts, Gertjan
    [J]. UNATTENDED GROUND, SEA, AND AIR SENSOR TECHNOLOGIES AND APPLICATIONS XIV, 2012, 8388
  • [25] Content-Attention Representation by Factorized Action-Scene Network for Action Recognition
    Hou, Jingyi
    Wu, Xinxiao
    Sun, Yuchao
    Jia, Yunde
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (06) : 1537 - 1547
  • [26] Flexible scene text recognition based on dual attention mechanism
    Tian, Zhiqiang
    Wang, Chunhui
    Xiao, Youzi
    Lin, Yuping
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
  • [27] Recurrent Highway Networks with Attention Mechanism for Scene Text Recognition
    Yang, Haodong
    Li, Shuohao
    Yin, Xiaoqing
    Han, Anqi
    Zhang, Jun
    [J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 315 - 322
  • [28] ACTION RECOGNITION BY LEARNING LOCALLY ADAPTIVE CLASSIFIERS
    Tsai, Jia-Jie
    Hsieh, Chung-Yang
    Lin, Wei-Yang
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [29] Action recognition based on adaptive region perception
    Lu, Tongwei
    Yang, Qi
    Min, Feng
    Zhang, Yanduo
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 36 (2): : 943 - 959
  • [30] Action recognition based on adaptive region perception
    Tongwei Lu
    Qi Yang
    Feng Min
    Yanduo Zhang
    [J]. Neural Computing and Applications, 2024, 36 : 943 - 959