Boundary-Aware Temporal Sentence Grounding with Adaptive Proposal Refinement

被引:0
|
作者
Dong, Jianxiang [1 ]
Yin, Zhaozheng [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
来源
基金
美国国家科学基金会;
关键词
D O I
10.1007/978-3-031-26316-3_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal sentence grounding (TSG) in videos aims to localize the temporal interval from an untrimmed video that is relevant to a given query sentence. In this paper, we introduce an effective proposal-based approach to solve the TSG problem. A Boundary-aware Feature Enhancement (BAFE) module is proposed to enhance the proposal feature with its boundary information, by imposing a new temporal difference loss. Meanwhile, we introduce a Boundary-aware Feature Aggregation (BAFA) module to aggregate boundary features and propose a Proposal-level Contrastive Learning (PCL) method to learn query-related content features by maximizing the mutual information between the query and proposals. Furthermore, we introduce a Proposal Interaction (PI) module with Adaptive Proposal Selection (APS) strategies to effectively refine proposal representations and make the final localization. Extensive experiments on Charades-STA, ActivityNet-Captions and TACoS datasets show the effectiveness of our solution. Our code is available at https://github.com/DJX1995/BAN-APR.
引用
收藏
页码:641 / 657
页数:17
相关论文
共 50 条
  • [21] Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
    Zheng, Minghang
    Huang, Yanjie
    Chen, Qingchao
    Peng, Yuxin
    Liu, Yang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15534 - 15543
  • [22] Look at Boundary: A Boundary-Aware Face Alignment Algorithm
    Wu, Wenyan
    Qian, Chen
    Yang, Shuo
    Wang, Quan
    Cai, Yici
    Zhou, Qiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2129 - 2138
  • [23] BASS: Boundary-Aware Superpixel Segmentation
    Rubio, Antonio
    Yu, LongLong
    Simo-Serra, Edgar
    Moreno-Noguer, Francesc
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2824 - 2829
  • [24] USBDAN: Unsupervised Scale-aware and Boundary-aware Domain Adaptive Network for Gastric Tumor Segmentation
    Zhang, Yongtao
    Yuan, Ning
    Liu, Bing
    Yang, Aocai
    Yu, Hongwei
    Lv, Kuan
    Luan, Jixin
    Hu, Pianpian
    Lei, Haijun
    Wang, Tianfu
    Ma, Guolin
    Lei, Baiying
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [25] A Survey on Temporal Sentence Grounding in Videos
    Lan, Xiaohan
    Yuan, Yitian
    Wang, Xin
    Wang, Zhi
    Zhu, Wenwu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [26] Temporal Sentence Grounding in Streaming Videos
    Gan, Tian
    Wang, Xiao
    Sun, Yan
    Wu, Jianlong
    Guo, Qingpei
    Nie, Liqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4637 - 4646
  • [27] Making Procedural Water Waves Boundary-aware
    Jeschke, S.
    Hafner, C.
    Chentanez, N.
    Macklin, M.
    Mueller-Fischer, M.
    Wojtan, C.
    COMPUTER GRAPHICS FORUM, 2020, 39 (08) : 47 - 54
  • [28] Anchor-free temporal action localization via Progressive Boundary-aware Boosting
    Tang, Yepeng
    Wang, Weining
    Yang, Yanwu
    Zhang, Chunjie
    Liu, Jing
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)
  • [29] Boundary-Aware Network for Kidney Tumor Segmentation
    Hu, Shishuai
    Zhang, Jianpeng
    Xia, Yong
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2020, 2020, 12436 : 189 - 198
  • [30] Boundary-aware Graph Convolution for Semantic Segmentation
    Hu, Hanzhe
    Cui, Jinshi
    Zha, Hongbin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1828 - 1835