HUMAN-AWARE COARSE-TO-FINE ONLINE ACTION DETECTION

被引:1
|
作者
Yang, Zichen [1 ]
Huang, Di [1 ]
Qin, Jie [2 ]
Wang, Yunhong [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, IRIP Lab, Beijing, Peoples R China
[2] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
中国国家自然科学基金;
关键词
action detection; temporal action localization; online learning;
D O I
10.1109/ICASSP39728.2021.9413368
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we propose a two-stage framework to efficiently and effectively detect actions on-the-fly. An action location network (ALN) is developed in the first stage to judge whether the current frame is action-related, while the second stage involves an action classification network (ACN) to further identify the action category. In this way, irrelevant negative frames are quickly discarded and actions are detected as early as they occur. Moreover, we highlight human areas at both the stages by respectively incorporating a human detector and a human mask layer. As a result, more accurate spatial-temporal windows of actions are detected, based on which more robust features are extracted for classification. Experimental results on two popular benchmarks demonstrate the superior performance of the proposed approach.
引用
收藏
页码:2455 / 2459
页数:5
相关论文
共 50 条
  • [31] Context-aware coarse-to-fine network for single image desnowing
    Cheng, Yunrui
    Ren, Hao
    Zhang, Rui
    Lu, Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55903 - 55920
  • [32] A Coarse-to-Fine Method for Infrared Small Target Detection
    Yao, Shoukui
    Chang, Yi
    Qin, Xiaojuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (02) : 256 - 260
  • [33] Recursive Coarse-to-Fine Localization for Fast Object Detection
    Pedersoli, Marco
    Gonzalez, Jordi
    Bagdanov, Andrew D.
    Villanueva, Juan J.
    COMPUTER VISION - ECCV 2010, PT VI, 2010, 6316 : 280 - +
  • [34] Salient object detection using coarse-to-fine processing
    Zhou, Qiangqiang
    Zhang, Lin
    Zhao, Weidong
    Liu, Xianhui
    Chen, Yufei
    Wang, Zhicheng
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (03) : 370 - 383
  • [35] COARSE-TO-FINE AGGREGATION FOR CROSS-GRANULARITY ACTION RECOGNITION
    Mazari, Ahmed
    Sahbi, Hichem
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1541 - 1545
  • [36] Inferring Context and Goals for Online Human-Aware Planning
    Kockemann, Uwe
    Pecora, Federico
    Karlsson, Lars
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 550 - 557
  • [37] Coarse-to-Fine Satellite Images Change Detection Framework via Boundary-Aware Attentive Network
    Zhang, Yi
    Zhang, Shizhou
    Li, Ying
    Zhang, Yanning
    SENSORS, 2020, 20 (23) : 1 - 21
  • [38] Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism
    Wu, Mingda
    Huang, Di
    Guo, Yuanfang
    Wang, Yunhong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12394 - 12401
  • [39] Coarse-to-fine manifold learning
    Castro, R
    Willett, R
    Nowak, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 992 - 995
  • [40] 'Coarse-to-fine' cyclopean processing
    Popple, AV
    Findlay, JM
    PERCEPTION, 1999, 28 (02) : 155 - 165