Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network

被引:5
|
作者
Su, Haisheng [1 ]
Zhao, Xu [1 ]
Lin, Tianwei [1 ]
Fei, Haiping [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China
[2] Ind Internet Innovat Ctr Shanghai Co Ltd, Shanghai, Peoples R China
关键词
Temporal action detection; Weak supervision; Shot-based sampling; Temporal pooling network; Class-specific;
D O I
10.1007/978-3-030-04212-7_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised temporal action detection in untrimmed videos is an important yet challenging task, where only video-level class labels are available for temporally locating actions in the videos during training. In this paper, we propose a novel architecture for this task. Specifically, we put forward an effective shot-based sampling method aiming at generating a more simplified but representative feature sequence for action detection, instead of using uniform sampling which causes extremely irrelevant frames retained. Furthermore, in order to distinguish action instances existing in the videos, we design a multi-stage Temporal Pooling Network (TPN) for the purposes of predicting video categories and localizing class-specific action instances respectively. Experiments conducted on THUMOS14 dataset confirm that our method outperforms other state-of-the-art weakly supervised approaches.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 50 条
  • [21] Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection
    Su, Haisheng
    Zhao, Xu
    Lin, Tianwei
    Liu, Shuming
    Hu, Zhilan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1503 - 1515
  • [22] Atomic-action-based Contrastive Network for Weakly Supervised Temporal Language Grounding
    Wu, Hongzhou
    Lyu, Yifan
    Shen, Xingyu
    Zhao, Xuechen
    Wang, Mengzhu
    Zhang, Xiang
    Luo, Zhigang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1523 - 1528
  • [23] Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization
    Zhou, Jianxiong
    Wu, Ying
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6017 - 6026
  • [24] Weakly supervised spatial–temporal attention network driven by tracking and consistency loss for action detection
    Jinlei Zhu
    Houjin Chen
    Pan Pan
    Jia Sun
    EURASIP Journal on Image and Video Processing, 2022
  • [25] Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection
    Yang, Wenfei
    Zhang, Tianzhu
    Mao, Zhendong
    Zhang, Yongdong
    Tian, Qi
    Wu, Feng
    IEEE Transactions on Image Processing, 2021, 30 : 5848 - 5861
  • [26] Background Suppression Network for Weakly-Supervised Temporal Action Localization
    Lee, Pilhyeon
    Uh, Youngjung
    Byun, Hyeran
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11320 - 11327
  • [27] Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection
    Yang, Wenfei
    Zhang, Tianzhu
    Mao, Zhendong
    Zhang, Yongdong
    Tian, Qi
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5848 - 5861
  • [28] Deep snippet selective network for weakly supervised temporal action localization
    Ge, Yongxin
    Qin, Xiaolei
    Yang, Dan
    Jagersand, Martin
    PATTERN RECOGNITION, 2021, 110
  • [29] Feature Matching Network for Weakly-Supervised Temporal Action Localization
    Dou, Peng
    Zhou, Wei
    Liao, Zhongke
    Hu, Haifeng
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 459 - 471
  • [30] Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization
    Su, Haisheng
    Zhao, Xu
    Lin, Tianwei
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 558 - 574