Loss Guided Activation for Action Recognition in Still Images

被引:16
|
作者
Liu, Lu [1 ]
Tan, Robby T. [1 ,2 ]
You, Shaodi [3 ,4 ]
机构
[1] Natl Univ Singapore, ECE Dept, Singapore, Singapore
[2] Yale NUS Coll, Singapore, Singapore
[3] CSIRO, DATA61, Canberra, ACT, Australia
[4] Australian Natl Univ, Canberra, ACT, Australia
来源
基金
新加坡国家研究基金会;
关键词
mage action recognition; Loss guided activation; Human-mask loss;
D O I
10.1007/978-3-030-20873-8_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One significant problem of deep-learning based human action recognition is that it can be easily misled by the presence of irrelevant objects or backgrounds. Existing methods commonly address this problem by employing bounding boxes on the target humans as part of the input, in both training and testing stages. This requirement of bounding boxes as part of the input is needed to enable the methods to ignore irrelevant contexts and extract only human features. However, we consider this solution is inefficient, since the bounding boxes might not be available. Hence, instead of using a person bounding box as an input, we introduce a human-mask loss to automatically guide the activations of the feature maps to the target human who is performing the action, and hence suppress the activations of misleading contexts. We propose a multi-task deep learning method that jointly predicts the human action class and human location heatmap. Extensive experiments demonstrate our approach is more robust compared to the baseline methods under the presence of irrelevant misleading contexts. Our method achieves 94.06% and 40.65% (in terms of mAP) on Stanford40 and MPII dataset respectively, which are 3.14% and 12.6% relative improvements over the best results reported in the literature, and thus set new state-of-the-art results. Additionally, unlike some existing methods, we eliminate the requirement of using a person bounding box as an input during testing.
引用
收藏
页码:152 / 167
页数:16
相关论文
共 50 条
  • [1] Coloring Action Recognition in Still Images
    Khan, Fahad Shahbaz
    Anwer, Rao Muhammad
    van de Weijer, Joost
    Bagdanov, Andrew D.
    Lopez, Antonio M.
    Felsberg, Michael
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 105 (03) : 205 - 221
  • [2] Coloring Action Recognition in Still Images
    Fahad Shahbaz Khan
    Rao Muhammad Anwer
    Joost van de Weijer
    Andrew D. Bagdanov
    Antonio M. Lopez
    Michael Felsberg
    [J]. International Journal of Computer Vision, 2013, 105 : 205 - 221
  • [3] Understanding action recognition in still images
    Girish, Deeptha
    Singh, Vineeta
    Ralescu, Anca
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1523 - 1529
  • [4] Temporal Hallucinating for Action Recognition with Few Still Images
    Wang, Yali
    Zhou, Lei
    Qiao, Yu
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5314 - 5322
  • [5] Context Enhancement Methodology for Action Recognition in Still Images
    He, Jiarong
    Wu, Wei
    Li, Yuxing
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 112 - 122
  • [6] Action Recognition in Still Images With Minimum Annotation Efforts
    Zhang, Yu
    Cheng, Li
    Wu, Jianxin
    Cai, Jianfei
    Do, Minh N.
    Lu, Jiangbo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) : 5479 - 5490
  • [7] Multibranch Attention Networks for Action Recognition in Still Images
    Yan, Shiyang
    Smith, Jeremy S.
    Lu, Wenjin
    Zhang, Bailing
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2018, 10 (04) : 1116 - 1125
  • [8] Knowledge memorization and generation for action recognition in still images *
    Dong, Jian
    Yang, Wankou
    Yao, Yazhou
    Porikli, Fatih
    [J]. PATTERN RECOGNITION, 2021, 120
  • [9] Reassessing Hierarchical Representation for Action Recognition in Still Images
    Li, Rui
    Liu, Zhenyu
    Tan, Jianrong
    [J]. IEEE ACCESS, 2018, 6 : 61386 - 61400
  • [10] Learning Hierarchical Context for Action Recognition in Still Images
    Zhu, Haisheng
    Hu, Jian-Fang
    Zheng, Wei-Shi
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 67 - 77