Advancing Temporal Action Localization with a Boundary Awareness Network

被引:0
|
作者
Gu, Jialiang [1 ]
Yi, Yang [1 ]
Wang, Min [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
action boundary detection; Gaussian boundary module; video understanding; temporal action localization;
D O I
10.3390/electronics13061099
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal action localization (TAL) is crucial in video analysis, yet presents notable challenges. This process focuses on the precise identification and categorization of action instances within lengthy, raw videos. A key difficulty in TAL lies in determining the exact start and end points of actions, owing to the often unclear boundaries of these actions in real-world footage. Existing methods tend to take insufficient account of changes in action boundary features. To tackle these issues, we propose a boundary awareness network (BAN) for TAL. Specifically, the BAN mainly consists of a feature encoding network, coarse pyramidal detection to obtain preliminary proposals and action categories, and fine-grained detection with a Gaussian boundary module (GBM) to get more valuable boundary information. The GBM contains a novel Gaussian boundary pooling, which serves to aggregate the relevant features of the action boundaries and to capture discriminative boundary and actionness features. Furthermore, we introduce a novel approach named Boundary Differentiated Learning (BDL) to ensure our model's capability in accurately identifying action boundaries across diverse proposals. Comprehensive experiments on both the THUMOS14 and ActivityNet v1.3 datasets, where our BAN model achieved an increase in mean Average Precision (mAP) by 1.6% and 0.2%, respectively, over existing state-of-the-art methods, illustrate that our approach not only improves upon the current state of the art but also achieves outstanding performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Gaussian Temporal Awareness Networks for Action Localization
    Long, Fuchen
    Yao, Ting
    Qiu, Zhaofan
    Tian, Xinmei
    Luo, Jiebo
    Mei, Tao
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 344 - 353
  • [2] Dual relation network for temporal action localization
    Xia, Kun
    Wang, Le
    Zhou, Sanping
    Hua, Gang
    Tang, Wei
    PATTERN RECOGNITION, 2022, 129
  • [3] TVNet: Temporal Voting Network for Action Localization
    Wang, Hanyuan
    Damen, Dima
    Mirmehdi, Majid
    Perrett, Toby
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 550 - 558
  • [4] STAN: Spatial-Temporal Awareness Network for Temporal Action Detection
    Liu, Minghao
    Liu, Haiyi
    Zhao, Sirui
    Ma, Fei
    Li, Minglei
    Dai, Zonghong
    Wang, Hao
    Xu, Tong
    Chen, Enhong
    PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2023, 2023, : 161 - 165
  • [5] MTSN: Multiscale Temporal Similarity Network for Temporal Action Localization
    Jin, Xiaodong
    Zhang, Taiping
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2573 - 2581
  • [6] A Malleable Boundary Network for temporal action detection?
    Wang, Tian
    Hou, Boyao
    Li, Zexian
    Li, Zhe
    Huang, Lei
    Zhang, Baochang
    Snoussi, Hichem
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [7] ACTION COHERENCE NETWORK FOR WEAKLY SUPERVISED TEMPORAL ACTION LOCALIZATION
    Zhai, Yuanhao
    Wang, Le
    Liu, Ziyi
    Zhang, Qilin
    Hua, Gang
    Zheng, Nanning
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3696 - 3700
  • [8] BOUNDARY INFORMATION MATTERS MORE: ACCURATE TEMPORAL ACTION DETECTION WITH TEMPORAL BOUNDARY NETWORK
    Zhang, Tao
    Liu, Shan
    Li, Thomas
    Li, Ge
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1642 - 1646
  • [9] A Temporal-Aware Relation and Attention Network for Temporal Action Localization
    Zhao, Yibo
    Zhang, Hua
    Gao, Zan
    Guan, Weili
    Nie, Jie
    Liu, Anan
    Wang, Meng
    Chen, Shengyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4746 - 4760
  • [10] Temporal Action Localization With Coarse-to-Fine Network
    Zhejiang Industry Polytechnic College, Department of Design and Art, Shaoxing
    312000, China
    不详
    310018, China
    IEEE Access, 2022, (96378-96387)