SMC: Single-Stage Multi-location Convolutional Network for Temporal Action Detection

被引:0
|
作者
Liu, Zhikang [1 ,2 ]
Wang, Zilei [1 ]
Zhao, Yan [1 ]
Tian, Ye [1 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei, Anhui, Peoples R China
[2] Megvii Inc Face, Beijing, Peoples R China
来源
COMPUTER VISION - ACCV 2018, PT II | 2019年 / 11362卷
基金
中国国家自然科学基金;
关键词
Temporal action detection; End-to-end; Multi-scale; SMC;
D O I
10.1007/978-3-030-20890-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal action detection in untrimmed videos is an important and challenging visual task. State-of-the-art works always adopt a multi-stage pipeline, i.e., a class-agnostic segment proposal followed by a multi-label action classification. This pipeline is computationally slow and hard to optimize as each stage need be trained separately. Moreover, a desirable method should go beyond segment-level localization and make dense predictions with precise boundaries. We introduce a novel detection model in this paper, Single-stage Multi-location Convolutional Network (SMC), which completely eliminates the proposal generation and spatio-temporal feature resampling, and predicts frame-level action locations with class probabilities in a unified end-to-end network. Specifically, we associate a set of multi-scale default locations with each feature map cell in multiple layers, then predict the location offsets to the default locations, as well as action categories. SMC in practice is faster than the existing methods (753 FPS on a Titan X Maxwell GPU) and achieves state-of-the-art performance on THUMOS'14 and MEXaction2.
引用
收藏
页码:179 / 195
页数:17
相关论文
共 50 条
  • [41] Instance Shadow Detection With a Single-Stage Detector
    Wang, Tianyu
    Hu, Xiaowei
    Heng, Pheng-Ann
    Fu, Chi-Wing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3259 - 3273
  • [42] Multi-location cryptographic code repair with neural-network-based methodologies
    Xiao, Ya
    ESEC/FSE 2021 - Proceedings of the 29th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021, : 1640 - 1644
  • [43] Identification of transmission line voltage sag sources based on multi-location information convolutional transformer
    Li, Qionglin
    Zheng, Chen
    Liu, Shuming
    Dai, Shuangyin
    Zhang, Bo
    Tang, Yuzheng
    Wang, Yi
    IET RENEWABLE POWER GENERATION, 2024, 18 (15) : 3239 - 3252
  • [44] The Single-Stage Location-Routing Problem with Time Windows
    Guenduez, Halil Ibrahim
    COMPUTATIONAL LOGISTICS, 2011, 6971 : 44 - 58
  • [45] EFFECT OF FEED LOCATION ON THE PERFORMANCE OF SINGLE-STAGE MEMBRANE PERMEATORS
    YOSHISATO, RA
    CARMICHAEL, GR
    SEPARATION SCIENCE AND TECHNOLOGY, 1989, 24 (5-6) : 399 - 413
  • [46] Multi-location Cryptographic Code Repair with Neural-Network-Based Methodologies
    Xiao, Ya
    PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 1640 - 1644
  • [47] Weakly Supervised Temporal Action Localization by Multi-Stage Fusion Network
    Shen, Zhengyang
    Wang, Feng
    Dai, Jin
    IEEE ACCESS, 2020, 8 : 17287 - 17298
  • [48] HAAN: Human Action Aware Network for Multi-label Temporal Action Detection
    Gao, Zikai
    Qiao, Peng
    Dou, Yong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5059 - 5069
  • [49] An attention-based feature pyramid network for single-stage small object detection
    Lin Jiao
    Chenrui Kang
    Shifeng Dong
    Peng Chen
    Gaoqiang Li
    Rujing Wang
    Multimedia Tools and Applications, 2023, 82 : 18529 - 18544
  • [50] An attention-based feature pyramid network for single-stage small object detection
    Jiao, Lin
    Kang, Chenrui
    Dong, Shifeng
    Chen, Peng
    Li, Gaoqiang
    Wang, Rujing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18529 - 18544