Dilated Multi-Temporal Modeling for Action Recognition

被引:0
|
作者
Zhang, Tao [1 ]
Wu, Yifan [1 ]
Li, Xiaoqiang [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 12期
关键词
computer vision; action recognition; multiple temporal modeling; dilated convolution;
D O I
10.3390/app13126934
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Action recognition involves capturing temporal information from video clips where the duration varies with videos for the same action. Due to the diverse scale of temporal context, uniform size kernels utilized in convolutional neural networks (CNNs) limit the capability of multiple-scale temporal modeling. In this paper, we propose a novel dilated multi-temporal (DMT) module that provides a solution for modeling multi-temporal information in action recognition. By using dilated convolutions with different dilation rates in different feature map channels, the DMT module captures information at multiple scales without the need for costly multi-branch networks, input-level frame pyramids, or feature map stacking that previous works have usually incurred. Therefore, this approach enables the integration of temporal information from multiple scales. In addition, the DMT module can be integrated into existing 2D CNNs, making it a straightforward and intuitive solution for addressing the challenge of multi-temporal modeling. Our proposed method has demonstrated promising results in performance and has achieved about 2% and 1% accuracy improvement on FineGym99 and SthV1. We conducted an empirical analysis that demonstrates how DMT improves the classification accuracy for action classes with varying durations.
引用
收藏
页数:15
相关论文
共 50 条
  • [11] Multi-Temporal Scales Consensus for Weakly Supervised Temporal Action Localization
    Guo, Wenbin
    Yang, Xingming
    Jiang, Zheyuan
    Wu, Kewei
    Xie, Zhao
    Computer Engineering and Applications, 2023, 59 (10): : 151 - 161
  • [12] Multi-temporal scale aggregation refinement graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (01)
  • [13] Dryland Crop Recognition Based on Multi-temporal Polarization SAR Data
    Sun, Zheng
    Wang, Di
    Zhou, Qingbo
    2019 8TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2019,
  • [14] Multi-temporal Image Change Recognition Policy by Fragmentation and Feature Mining
    David, D. Beulah
    Dorairangaswamy, M. A.
    2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC 2018), 2018, : 22 - 25
  • [15] Multi-temporal terrestrial laser scanning for modeling tree biomass change
    Srinivasan, Shruthi
    Popescu, Sorin C.
    Eriksson, Marian
    Sheridan, Ryan D.
    Ku, Nian-Wei
    FOREST ECOLOGY AND MANAGEMENT, 2014, 318 : 304 - 317
  • [16] Statistical Wavelet Subband Modeling for Multi-Temporal SAR Change Detection
    Cui, Shiyong
    Datcu, Mihai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2012, 5 (04) : 1095 - 1109
  • [17] Multi-temporal FFT Regression
    Mamun, Md. Ai
    Mondal, Md. Nazrul Islam
    Ahmed, Boshir
    Zaman, Md. Shahid Uz
    Afroge, Shyla
    2015 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION ENGINEERING (ICCIE), 2015, : 134 - 137
  • [18] Cluster-guided temporal modeling for action recognition
    Kim, Jeong-Hun
    Hao, Fei
    Leung, Carson Kai-Sang
    Nasridinov, Aziz
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [19] Multimodal action recognition: a comprehensive survey on temporal modeling
    Shabaninia, Elham
    Nezamabadi-pour, Hossein
    Shafizadegan, Fatemeh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 59439 - 59489
  • [20] Cluster-guided temporal modeling for action recognition
    Jeong-Hun Kim
    Fei Hao
    Carson Kai-Sang Leung
    Aziz Nasridinov
    International Journal of Multimedia Information Retrieval, 2023, 12