Boundary-Aware Temporal Sentence Grounding with Adaptive Proposal Refinement

被引:0
|
作者
Dong, Jianxiang [1 ]
Yin, Zhaozheng [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
来源
基金
美国国家科学基金会;
关键词
D O I
10.1007/978-3-031-26316-3_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal sentence grounding (TSG) in videos aims to localize the temporal interval from an untrimmed video that is relevant to a given query sentence. In this paper, we introduce an effective proposal-based approach to solve the TSG problem. A Boundary-aware Feature Enhancement (BAFE) module is proposed to enhance the proposal feature with its boundary information, by imposing a new temporal difference loss. Meanwhile, we introduce a Boundary-aware Feature Aggregation (BAFA) module to aggregate boundary features and propose a Proposal-level Contrastive Learning (PCL) method to learn query-related content features by maximizing the mutual information between the query and proposals. Furthermore, we introduce a Proposal Interaction (PI) module with Adaptive Proposal Selection (APS) strategies to effectively refine proposal representations and make the final localization. Extensive experiments on Charades-STA, ActivityNet-Captions and TACoS datasets show the effectiveness of our solution. Our code is available at https://github.com/DJX1995/BAN-APR.
引用
收藏
页码:641 / 657
页数:17
相关论文
共 50 条
  • [41] Boundary-aware small object detection with attention and interaction
    Feng, Qihan
    Shao, Zhiwen
    Wang, Zhixiao
    VISUAL COMPUTER, 2024, 40 (09): : 5921 - 5934
  • [42] Decision Boundary-Aware Data Augmentation for Adversarial Training
    Chen, Chen
    Zhang, Jingfeng
    Xu, Xilie
    Lyu, Lingjuan
    Chen, Chaochao
    Hu, Tianlei
    Chen, Gang
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (03) : 1882 - 1894
  • [43] Boundary-Aware Face Alignment with Enhanced HourglassNet and Transformer
    Li, Yingxin
    Niu, Dongmei
    Peng, Jingliang
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (01)
  • [44] Hierarchical Boundary-Aware Neural Encoder for Video Captioning
    Baraldi, Lorenzo
    Grana, Costantino
    Cucchiara, Rita
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3185 - 3194
  • [45] Unified Boundary-Aware Texturing for Interactive Volume Rendering
    Ropinski, Timo
    Diepenbrock, Stefan
    Bruckner, Stefan
    Hinrichs, Klaus
    Groeller, Eduard
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (11) : 1942 - 1955
  • [46] Boundary-aware texture region segmentation from manga
    Xueting Liu
    Chengze Li
    Tien-Tsin Wong
    Computational Visual Media, 2017, 3 (01) : 61 - 71
  • [47] Selectivity or Invariance: Boundary-aware Salient Object Detection
    Su, Jinming
    Li, Jia
    Zhang, Yu
    Xia, Changqun
    Tian, Yonghong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3798 - 3807
  • [48] SBAT: Video Captioning with Sparse Boundary-Aware Transformer
    Jin, Tao
    Huang, Siyu
    Chen, Ming
    Li, Yingming
    Zhang, Zhongfei
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 630 - 636
  • [49] Boundary-Aware Bilateral Fusion Network for Cloud Detection
    Zhao, Chao
    Zhang, Xiang
    Kuang, Nailiang
    Luo, Hangzai
    Zhong, Sheng
    Fan, Jianping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [50] Boundary-aware Image Inpainting with Multiple Auxiliary Cues
    Yamashita, Yohei
    Shimosato, Kodai
    Ukita, Norimichi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 618 - 628