SMC: Single-Stage Multi-location Convolutional Network for Temporal Action Detection

被引:0
|
作者
Liu, Zhikang [1 ,2 ]
Wang, Zilei [1 ]
Zhao, Yan [1 ]
Tian, Ye [1 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei, Anhui, Peoples R China
[2] Megvii Inc Face, Beijing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Temporal action detection; End-to-end; Multi-scale; SMC;
D O I
10.1007/978-3-030-20890-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal action detection in untrimmed videos is an important and challenging visual task. State-of-the-art works always adopt a multi-stage pipeline, i.e., a class-agnostic segment proposal followed by a multi-label action classification. This pipeline is computationally slow and hard to optimize as each stage need be trained separately. Moreover, a desirable method should go beyond segment-level localization and make dense predictions with precise boundaries. We introduce a novel detection model in this paper, Single-stage Multi-location Convolutional Network (SMC), which completely eliminates the proposal generation and spatio-temporal feature resampling, and predicts frame-level action locations with class probabilities in a unified end-to-end network. Specifically, we associate a set of multi-scale default locations with each feature map cell in multiple layers, then predict the location offsets to the default locations, as well as action categories. SMC in practice is faster than the existing methods (753 FPS on a Titan X Maxwell GPU) and achieves state-of-the-art performance on THUMOS'14 and MEXaction2.
引用
收藏
页码:179 / 195
页数:17
相关论文
共 50 条
  • [1] An improved single-stage convolutional neural network for rail transit obstacle detection
    Qin, Yuliang
    He, Deqiang
    Sun, Haimeng
    Liu, Qi
    Li, Xianwang
    Ren, Chonghui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (12)
  • [2] MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
    Abu Farha, Yazan
    Gall, Juergen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3570 - 3579
  • [3] Boundary graph convolutional network for temporal action detection
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Wang, Wei
    Lu, Weichen
    Suo, Xinhua
    IMAGE AND VISION COMPUTING, 2021, 109
  • [4] Temporal graph convolutional network for multi-agent reinforcement learning of action detection
    Wang, Liangliang
    Liu, Jiayao
    Wang, Ke
    Ge, Lianzheng
    Liang, Peidong
    APPLIED SOFT COMPUTING, 2024, 163
  • [5] SINGLE-STAGE LICENSING IN ACTION
    BRADBURY, RB
    WALKER, LP
    POWER ENGINEERING, 1984, 88 (09) : 53 - 54
  • [6] VHub: Single-stage virtual network mapping through hub location
    Shanbhag, Shashank
    Kandoor, Arun Reddy
    Wang, Cong
    Mettu, Ramgopal
    Wolf, Tilman
    COMPUTER NETWORKS, 2015, 77 : 169 - 180
  • [7] MS-TCN plus plus : Multi-Stage Temporal Convolutional Network for Action Segmentation
    Li, Shijie
    Abu Farha, Yazan
    Liu, Yun
    Cheng, Ming-Ming
    Gall, Juergen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6647 - 6658
  • [8] Extending single-location CAD/CAM for multi-location collaboration
    Kao, YC
    Lin, GCI
    ADVANCES IN CONCURRENT ENGINEERING: CE96: COLLABORATIVE WORK ORGANIZATION AND MANAGEMENT PRODUCT AND PROCESS INTEGRATION PLANNING AND SCHEDULING INFORMATION AND PROCESS MODELING DATA EXCHANGE PRACTICAL APPLICATIONS, 1996, 96 : 40 - 47
  • [9] Spacecraft Homography Pose Estimation with Single-Stage Deep Convolutional Neural Network
    Chen, Shengpeng
    Yang, Wenyi
    Wang, Wei
    Mai, Jianting
    Liang, Jian
    Zhang, Xiaohu
    SENSORS, 2024, 24 (06)
  • [10] Pose Anchor: A Single-Stage Hand Keypoint Detection Network
    Li, Yuan
    Wang, Xinggang
    Liu, Wenyu
    Feng, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2104 - 2113