Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization

被引：8

作者：

Zhou, Jianxiong ^{[1
]}

Wu, Ying ^{[1
]}

机构：

[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00597

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly-supervised Temporal Action Localization (WTAL) aims to classify and localize action instances in untrimmed videos with only video-level labels. Existing methods typically use snippet-level RGB and optical flow features extracted from pre-trained extractors directly. Because of two limitations: the short temporal span of snippets and the inappropriate initial features, these WTAL methods suffer from the lack of effective use of temporal information and have limited performance. In this paper, we propose the Temporal Feature Enhancement Dilated Convolution Network (TFE-DCN) to address these two limitations. The proposed TFE-DCN has an enlarged receptive field that covers a long temporal span to observe the full dynamics of action instances, which makes it powerful to capture temporal dependencies between snippets. Furthermore, we propose the Modality Enhancement Module that can enhance RGB features with the help of enhanced optical flow features, making the overall features appropriate for the WTAL task. Experiments conducted on THUMOS'14 and ActivityNet v1.3 datasets show that our proposed approach far outperforms state-of-the-art WTAL methods.

引用

页码：6017 / 6026

页数：10

共 50 条

[21] Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Cao, Meng
Zhang, Can
Chen, Long
Shou, Mike Zheng
Zou, Yuexian
IEEE Transactions on Image Processing, 2022, 31 : 5203 - 5213
[22] Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization
Shi, Haichao
Zhang, Xiao-Yu
Li, Changsheng
Gong, Lixing
Li, Yong
Bao, Yongjun
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3820 - 3828
[23] Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
Ju, Chen
Zhao, Peisen
Chen, Siheng
Zhang, Ya
Zhang, Xiaoyun
Wang, Yanfeng
Tian, Qi
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6688 - 6701
[24] Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization
Gao, Junyu
Chen, Mengyuan
Xu, Changsheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15949 - 15963
[25] Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization
Shi, Haichao
Zhang, Xiao-Yu
Li, Changsheng
Gong, Lixing
Li, Yong
Bao, Yongjun
MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, 2022, : 3820 - 3828
[26] Boosting Weakly-Supervised Temporal Action Localization with Text Information
Li, Guozhang
Cheng, De
Ding, Xinpeng
Wang, Nannan
Wang, Xiaoyu
Gao, Xinbo
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10648 - 10657
[27] Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning
Du, Jia-Run
Feng, Jia-Chang
Lin, Kun-Yu
Hong, Fa-Ting
Qi, Zhongang
Shan, Ying
Hu, Jian-Fang
Zheng, Wei-Shi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 938 - 952
[28] Complementary adversarial mechanisms for weakly-supervised temporal action localization
Wang, Chuanxu
Wang, Jing
Liu, Peng
PATTERN RECOGNITION, 2023, 139
[29] A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization
Islam, Ashraful
Long, Chengjiang
Radke, Richard
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1637 - 1645
[30] Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Cao, Meng
Zhang, Can
Chen, Long
Shou, Mike Zheng
Zou, Yuexian
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5203 - 5213

← 1 2 3 4 5 →