Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network

被引:5
|
作者
Su, Haisheng [1 ]
Zhao, Xu [1 ]
Lin, Tianwei [1 ]
Fei, Haiping [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China
[2] Ind Internet Innovat Ctr Shanghai Co Ltd, Shanghai, Peoples R China
关键词
Temporal action detection; Weak supervision; Shot-based sampling; Temporal pooling network; Class-specific;
D O I
10.1007/978-3-030-04212-7_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised temporal action detection in untrimmed videos is an important yet challenging task, where only video-level class labels are available for temporally locating actions in the videos during training. In this paper, we propose a novel architecture for this task. Specifically, we put forward an effective shot-based sampling method aiming at generating a more simplified but representative feature sequence for action detection, instead of using uniform sampling which causes extremely irrelevant frames retained. Furthermore, in order to distinguish action instances existing in the videos, we design a multi-stage Temporal Pooling Network (TPN) for the purposes of predicting video categories and localizing class-specific action instances respectively. Experiments conducted on THUMOS14 dataset confirm that our method outperforms other state-of-the-art weakly supervised approaches.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 50 条
  • [1] Weakly Supervised Action Localization by Sparse Temporal Pooling Network
    Phuc Nguyen
    Liu, Ting
    Prasad, Gautam
    Han, Bohyung
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6752 - 6761
  • [2] Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization
    Dou, Peng
    Zeng, Ying
    Wang, Zhuoqun
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [3] Superframe-Based Temporal Proposals for Weakly Supervised Temporal Action Detection
    Li, Bairong
    Guo, Biao
    Zhu, Yuesheng
    Yin, Jianfeng
    Ji, Xiangli
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3628 - 3641
  • [4] Weakly Supervised Temporal Action Detection With Temporal Dependency Learning
    Li, Bairong
    Liu, Ruixin
    Chen, Tianquan
    Zhu, Yuesheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4473 - 4485
  • [5] ACTION COHERENCE NETWORK FOR WEAKLY SUPERVISED TEMPORAL ACTION LOCALIZATION
    Zhai, Yuanhao
    Wang, Le
    Liu, Ziyi
    Zhang, Qilin
    Hua, Gang
    Zheng, Nanning
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3696 - 3700
  • [6] Temporal Structure Mining for Weakly Supervised Action Detection
    Yu, Tan
    Ren, Zhou
    Li, Yuncheng
    Yan, Enxu
    Xu, Ning
    Yuan, Junsong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5521 - 5530
  • [7] Action Coherence Network for Weakly-Supervised Temporal Action Localization
    Zhai, Yuanhao
    Wang, Le
    Tang, Wei
    Zhang, Qilin
    Zheng, Nanning
    Hua, Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1857 - 1870
  • [8] Action Unit Memory Network for Weakly Supervised Temporal Action Localization
    Luo, Wang
    Zhang, Tianzhu
    Yang, Wenfei
    Liu, Jingen
    Mei, Tao
    Wu, Feng
    Zhang, Yongdong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9964 - 9974
  • [9] Complementary Attention Network for Weakly Supervised Temporal Action Localization
    Dou, Peng
    Hu, Haifeng
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6713 - 6732
  • [10] Ensemble Prototype Network For Weakly Supervised Temporal Action Localization
    Wu, Kewei
    Luo, Wenjie
    Xie, Zhao
    Guo, Dan
    Zhang, Zhao
    Hong, Richang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15