Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization

被引:8
|
作者
Zhou, Jianxiong [1 ]
Wu, Ying [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
关键词
D O I
10.1109/WACV56688.2023.00597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly-supervised Temporal Action Localization (WTAL) aims to classify and localize action instances in untrimmed videos with only video-level labels. Existing methods typically use snippet-level RGB and optical flow features extracted from pre-trained extractors directly. Because of two limitations: the short temporal span of snippets and the inappropriate initial features, these WTAL methods suffer from the lack of effective use of temporal information and have limited performance. In this paper, we propose the Temporal Feature Enhancement Dilated Convolution Network (TFE-DCN) to address these two limitations. The proposed TFE-DCN has an enlarged receptive field that covers a long temporal span to observe the full dynamics of action instances, which makes it powerful to capture temporal dependencies between snippets. Furthermore, we propose the Modality Enhancement Module that can enhance RGB features with the help of enhanced optical flow features, making the overall features appropriate for the WTAL task. Experiments conducted on THUMOS'14 and ActivityNet v1.3 datasets show that our proposed approach far outperforms state-of-the-art WTAL methods.
引用
收藏
页码:6017 / 6026
页数:10
相关论文
共 50 条
  • [21] Deep Motion Prior for Weakly-Supervised Temporal Action Localization
    Cao, Meng
    Zhang, Can
    Chen, Long
    Shou, Mike Zheng
    Zou, Yuexian
    IEEE Transactions on Image Processing, 2022, 31 : 5203 - 5213
  • [22] Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization
    Shi, Haichao
    Zhang, Xiao-Yu
    Li, Changsheng
    Gong, Lixing
    Li, Yong
    Bao, Yongjun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3820 - 3828
  • [23] Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
    Ju, Chen
    Zhao, Peisen
    Chen, Siheng
    Zhang, Ya
    Zhang, Xiaoyun
    Wang, Yanfeng
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6688 - 6701
  • [24] Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization
    Gao, Junyu
    Chen, Mengyuan
    Xu, Changsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15949 - 15963
  • [25] Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization
    Shi, Haichao
    Zhang, Xiao-Yu
    Li, Changsheng
    Gong, Lixing
    Li, Yong
    Bao, Yongjun
    MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, 2022, : 3820 - 3828
  • [26] Boosting Weakly-Supervised Temporal Action Localization with Text Information
    Li, Guozhang
    Cheng, De
    Ding, Xinpeng
    Wang, Nannan
    Wang, Xiaoyu
    Gao, Xinbo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10648 - 10657
  • [27] Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning
    Du, Jia-Run
    Feng, Jia-Chang
    Lin, Kun-Yu
    Hong, Fa-Ting
    Qi, Zhongang
    Shan, Ying
    Hu, Jian-Fang
    Zheng, Wei-Shi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 938 - 952
  • [28] Complementary adversarial mechanisms for weakly-supervised temporal action localization
    Wang, Chuanxu
    Wang, Jing
    Liu, Peng
    PATTERN RECOGNITION, 2023, 139
  • [29] A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization
    Islam, Ashraful
    Long, Chengjiang
    Radke, Richard
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1637 - 1645
  • [30] Deep Motion Prior for Weakly-Supervised Temporal Action Localization
    Cao, Meng
    Zhang, Can
    Chen, Long
    Shou, Mike Zheng
    Zou, Yuexian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5203 - 5213