Advancing Temporal Action Localization with a Boundary Awareness Network

被引:0
|
作者
Gu, Jialiang [1 ]
Yi, Yang [1 ]
Wang, Min [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
action boundary detection; Gaussian boundary module; video understanding; temporal action localization;
D O I
10.3390/electronics13061099
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal action localization (TAL) is crucial in video analysis, yet presents notable challenges. This process focuses on the precise identification and categorization of action instances within lengthy, raw videos. A key difficulty in TAL lies in determining the exact start and end points of actions, owing to the often unclear boundaries of these actions in real-world footage. Existing methods tend to take insufficient account of changes in action boundary features. To tackle these issues, we propose a boundary awareness network (BAN) for TAL. Specifically, the BAN mainly consists of a feature encoding network, coarse pyramidal detection to obtain preliminary proposals and action categories, and fine-grained detection with a Gaussian boundary module (GBM) to get more valuable boundary information. The GBM contains a novel Gaussian boundary pooling, which serves to aggregate the relevant features of the action boundaries and to capture discriminative boundary and actionness features. Furthermore, we introduce a novel approach named Boundary Differentiated Learning (BDL) to ensure our model's capability in accurately identifying action boundaries across diverse proposals. Comprehensive experiments on both the THUMOS14 and ActivityNet v1.3 datasets, where our BAN model achieved an increase in mean Average Precision (mAP) by 1.6% and 0.2%, respectively, over existing state-of-the-art methods, illustrate that our approach not only improves upon the current state of the art but also achieves outstanding performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] BLP - BOUNDARY LIKELIHOOD PINPOINTING NETWORKS FOR ACCURATE TEMPORAL ACTION LOCALIZATION
    Kong, Weijie
    Li, Nannan
    Liu, Shan
    Li, Thomas
    Li, Ge
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1647 - 1651
  • [22] Complementary Attention Network for Weakly Supervised Temporal Action Localization
    Peng Dou
    Haifeng Hu
    Neural Processing Letters, 2023, 55 : 6713 - 6732
  • [23] Ensemble Prototype Network For Weakly Supervised Temporal Action Localization
    Wu, Kewei
    Luo, Wenjie
    Xie, Zhao
    Guo, Dan
    Zhang, Zhao
    Hong, Richang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (03) : 4560 - 4574
  • [24] Weakly Supervised Action Localization by Sparse Temporal Pooling Network
    Phuc Nguyen
    Liu, Ting
    Prasad, Gautam
    Han, Bohyung
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6752 - 6761
  • [25] Complementary Attention Network for Weakly Supervised Temporal Action Localization
    Dou, Peng
    Hu, Haifeng
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6713 - 6732
  • [26] Ensemble Prototype Network For Weakly Supervised Temporal Action Localization
    Wu, Kewei
    Luo, Wenjie
    Xie, Zhao
    Guo, Dan
    Zhang, Zhao
    Hong, Richang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [27] Relational Prototypical Network for Weakly Supervised Temporal Action Localization
    Huang, Linjiang
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11053 - 11060
  • [28] BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
    Lin, Tianwei
    Zhao, Xu
    Su, Haisheng
    Wang, Chongjing
    Yang, Ming
    COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 3 - 21
  • [29] Complementary Boundary Estimation Network for Temporal Action Proposal Generation
    Wang, Jinding
    Hu, Haifeng
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 2275 - 2295
  • [30] Complementary Boundary Estimation Network for Temporal Action Proposal Generation
    Jinding Wang
    Haifeng Hu
    Neural Processing Letters, 2020, 52 : 2275 - 2295