Graph Attention Network for Context-Aware Visual Tracking

被引:0
|
作者
Shao, Yanyan [1 ]
Guo, Dongyan [1 ]
Cui, Ying [1 ]
Wang, Zhenhua [2 ]
Zhang, Liyan [3 ]
Zhang, Jianhua [4 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Northwest A&F Univ, Coll Informat Engn, Xianyang 712199, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[4] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Visualization; Shape; Search problems; Object tracking; Feature extraction; Correlation; Context-aware tracking; graph attention mechanism; Siamese network; visual tracking; ONLINE OBJECT TRACKING;
D O I
10.1109/TNNLS.2024.3442290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Siamese-network-based trackers convert the general object tracking as a similarity matching task between a template and a search region. Using convolutional feature cross correlation (Xcorr) for similarity matching, a large number of Siamese trackers are proposed and achieved great success. However, due to the predefined size of the target feature, these trackers suffer from either retaining much background information or losing important foreground information. Moreover, the global matching between the target and search region also largely neglects the part-level structural information and the contextual information of the target. To tackle the aforementioned obstacles, in this article, we propose a simple context-aware Siamese graph attention network, which establishes part-to-part correspondence between the Siamese branches with a complete bipartite graph. The object information from the template is propagated to the search region via a graph attention mechanism. With such a design, a target-aware template input is enabled to replace the prefixed template region, which can adaptively fit the size and aspect ratio variations in different objects. Based on it, we further construct a context-aware feature matching mechanism to embed both the target and the contextual information in the search region. Experiments on challenging benchmarks including GOT-10k, TrackingNet, LaSOT, VOT2020, and OTB-100 demonstrate that the proposed SiamGAT* outperforms many state-of-the-art trackers and achieves leading performance. Code is available at: https://git.io/SiamGAT.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Context-Aware Visual Tracking
    Yang, Ming
    Wu, Ying
    Hua, Gang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (07) : 1195 - 1209
  • [2] CAAN: Context-Aware attention network for visual question answering
    Chen, Chongqing
    Han, Dezhi
    Chang, Chin-Chen
    [J]. Pattern Recognition, 2022, 132
  • [3] CAAN: Context-Aware attention network for visual question answering
    Chen, Chongqing
    Han, Dezhi
    Chang, Chin -Chen
    [J]. PATTERN RECOGNITION, 2022, 132
  • [4] Context-Aware weighted for visual tracking
    Ding, Xiaoxue
    Xu, Haixia
    Huang, Yunjia
    Liu, Yong
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1598 - 1603
  • [5] Graph Neural Network for Context-Aware Recommendation
    Asma Sattar
    Davide Bacciu
    [J]. Neural Processing Letters, 2023, 55 : 5357 - 5376
  • [6] Graph Neural Network for Context-Aware Recommendation
    Sattar, Asma
    Bacciu, Davide
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (05) : 5357 - 5376
  • [7] Context-aware attention network for image recognition
    Jiaxu Leng
    Ying Liu
    Shang Chen
    [J]. Neural Computing and Applications, 2019, 31 : 9295 - 9305
  • [8] Context-aware attention network for image recognition
    Leng, Jiaxu
    Liu, Ying
    Chen, Shang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12): : 9295 - 9305
  • [9] Context-aware Siamese network for object tracking
    Zhang, Jianwei
    Wang, Jingchao
    Zhang, Huanlong
    Miao, Mengen
    Wu, Di
    [J]. IET IMAGE PROCESSING, 2023, 17 (01) : 215 - 226
  • [10] Jointly learning invocations and descriptions for context-aware mashup tagging with graph attention network
    Xin Wang
    Xiao Liu
    Hao Wu
    Jin Liu
    Xiaomei Chen
    Zhou Xu
    [J]. World Wide Web, 2023, 26 : 1295 - 1322