Class Semantics-based Attention for Action Detection

被引:30
|
作者
Sridhar, Deepak [1 ]
Quader, Niamul [1 ]
Muralidharan, Srikanth [1 ]
Li, Yaoxin [1 ,2 ]
Dai, Peng [1 ]
Lu, Juwei [1 ]
机构
[1] Huawei Noahs Ark Lab, Vancouver, BC, Canada
[2] Univ Waterloo, Waterloo, ON, Canada
关键词
D O I
10.1109/ICCV48922.2021.01348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action localization networks are often structured as a feature encoder sub-network and a localization sub-network, where the feature encoder learns to transform an input video to features that are useful for the localization sub-network to generate reliable action proposals. While some of the encoded features may be more useful for generating action proposals, prior action localization approaches do not include any attention mechanism that enables the localization sub-network to attend more to the more important features. In this paper, we propose a novel attention mechanism, the Class Semantics-based Attention (CSA), that learns from the temporal distribution of semantics of action classes present in an input video to find the importance scores of the encoded features, which are used to provide attention to the more useful encoded features. We demonstrate on two popular action detection datasets that incorporating our novel attention mechanism provides considerable performance gains on competitive action detection models (e.g., around 6.2% improvement over BMN action detection baseline to obtain 47.5% mAP on the THUMOS-14 dataset), and a new state-of-the-art of 36.25% mAP on the ActivityNet v1.3 dataset. Further, the CSA localization model family which includes BMN-CSA, was part of the second-placed submission at the 2021 ActivityNet action localization challenge. Our attention mechanism outperforms prior self-attention modules such as the squeeze-and-excitation in action detection task. We also observe that our attention mechanism is complementary to such self-attention modules in that performance improvements are seen when both are used together.
引用
收藏
页码:13719 / 13728
页数:10
相关论文
共 50 条
  • [1] Semantics-based composition of class hierarchies
    Snelting, G
    Tip, F
    [J]. ECOOP 2002 - OBJECT-ORIENTED PROGRAMMING, 2002, 2374 : 562 - 584
  • [2] A semantics-based approach to malware detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2008, 30 (05):
  • [3] A semantics-based approach to Malware detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    [J]. ACM SIGPLAN NOTICES, 2007, 42 (01) : 377 - 388
  • [4] A Semantics-Based Approach to Malware Detection
    Preda, Mila Dalla
    Christodorescu, Mihai
    Jha, Somesh
    Debray, Saumya
    [J]. CONFERENCE RECORD OF POPL 2007: THE 34TH ACM SIGPLAN SIGACT SYMPOSIUM ON PRINCIPLES OF PROGAMMING LANGUAGES, 2007, : 377 - 388
  • [5] A New Semantics-Based Android Malware Detection
    Zhang, Xiaohan
    Jin, Zhengping
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1412 - 1416
  • [6] SPEED: A Semantics-Based Pipeline for Economic Event Detection
    Hogenboom, Frederik
    Hogenboom, Alexander
    Frasincar, Flavius
    Kaymak, Uzay
    van der Meer, Otto
    Schouten, Kim
    Vandic, Damir
    [J]. CONCEPTUAL MODELING - ER 2010, 2010, 6412 : 452 - 457
  • [7] Semantics-based Anomaly Detection of Processes in Linux Containers
    Liang, Hongliang
    Hao, Qichen
    Li, Mingyu
    Zhang, Yini
    [J]. 2016 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2016, : 60 - 63
  • [8] Semantics-based Memory Leak Detection for C Programs
    Liu, Zhiqiang
    Xu, Bo
    Liang, Dong
    Liu, Chang
    Jiang, Zejun
    Du, Chenglie
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 2283 - 2287
  • [9] A Semantics-Based Approach on Binary Function Similarity Detection
    Zhang, Yuntao
    Fang, Binxing
    Xiong, Zehui
    Wang, Yanhao
    Liu, Yuwei
    Zheng, Chao
    Zhang, Qinnan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (15): : 25910 - 25924
  • [10] Hybrid semantics-based vulnerability detection incorporating a Temporal Convolutional Network and Self-attention Mechanism
    Chen, Jinfu
    Wang, Weijia
    Liu, Bo
    Cai, Saihua
    Towey, Dave
    Wang, Shengran
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 171