Multi-scale Dynamic Network for Temporal Action Detection

被引:2
|
作者
Ren, Yifan [1 ,2 ]
Xu, Xing [1 ,2 ]
Shen, Fumin [1 ,2 ]
Wang, Zheng [1 ,2 ]
Yang, Yang [1 ,2 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Temporal Action Detection; Dynamic Filters; Multi-scale Features;
D O I
10.1145/3460426.3463613
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, as the fundamental task in video understanding, Temporal Action Detection is attracting extensive attention. Most existing approaches use the same model parameters to process all input videos, which are not adaptive to the input video during the inference stage. In this paper, we propose a novel model termed Multi-scale Dynamic Network (MDN) to tackle this problem. The proposed MDN model incorporates multiple Multi-scale Dynamic Modules (MDMs). Each MDM can generate video-specific and segment-specific convolution kernels based on video content from different scales and adaptively capture rich semantic information for the prediction. Besides, we also design a new Edge Suppression Loss (ESL) function for MDN to pay more attention to hard examples. Extensive experiments conducted on two popular benchmarks ActivityNet-1.3 and THUMOS-14 show that the proposed MDN model achieves the state-of-the-art performance.
引用
收藏
页码:267 / 275
页数:9
相关论文
共 50 条
  • [41] Physical Knowledge Driven Multi-scale Temporal Receptive Field Network for Compressed Video Action Recognition
    He, Lijun
    Zhang, Miao
    Zhang, Sijin
    Li, Fan
    UBICOMP/ISWC '21 ADJUNCT: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2021 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2021, : 625 - 630
  • [42] Multi-scale spatial-temporal convolutional neural network for skeleton-based action recognition
    Cheng, Qin
    Cheng, Jun
    Ren, Ziliang
    Zhang, Qieshi
    Liu, Jianming
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1303 - 1315
  • [43] Multi-scale spatialtemporal information deep fusion network with temporal pyramid mechanism for video action recognition
    Ou, Hongshi
    Sun, Jifeng
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (03) : 4533 - 4545
  • [44] A multi-scale gated network for retinal hemorrhage detection
    Xia, Haiying
    Rao, Zengyan
    Zhou, Zuoshan
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5259 - 5273
  • [45] Lightweight multi-scale network for small object detection
    Li, Li
    Li, Bingxue
    Zhou, Hongjuan
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [46] An efficient network for multi-scale and overlapped wildlife detection
    Lu, Xin
    Lu, Xiaobo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (02) : 343 - 351
  • [47] Multi-scale Context Enhancement Network for Object Detection
    Wang, Yanan
    Ma, Yingdong
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 6 - 11
  • [48] Multi-scale semantic enhancement network for object detection
    Guo, Dongen
    Wu, Zechen
    Feng, Jiangfan
    Zou, Tao
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [49] StairsNet: Mixed Multi-scale Network for Object Detection
    Gao, Weiyi
    Cao, Wenlong
    Zhai, Jian
    Rui, Jianwu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 303 - 314
  • [50] MULTI-SCALE ENHANCED DEEP NETWORK FOR ROAD DETECTION
    Lu, Xiaoyan
    Zhong, Yanfei
    Zhao, Ji
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3947 - 3950