Contextual Enhancement-Interaction and Multi-Scale Weighted Fusion Network for Aerial Tracking

被引:0
|
作者
Wang, Bo [1 ,2 ]
Wang, Xuan [1 ]
Ma, Linglong [1 ,2 ]
Zuo, Yujia [1 ]
Liu, Chenglong [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
aerial tracking; UAV visual object tracking; Siamese tracker; contextual enhancement-interaction; multi-scale fusion; PLUS PLUS;
D O I
10.3390/drones8080343
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Siamese-based trackers have been widely utilized in UAV visual tracking due to their outstanding performance. However, UAV visual tracking encounters numerous challenges, such as similar targets, scale variations, and background clutter. Existing Siamese trackers face two significant issues: firstly, they rely on single-branch features, limiting their ability to achieve long-term and accurate aerial tracking. Secondly, current tracking algorithms treat multi-level similarity responses equally, making it difficult to ensure tracking accuracy in complex airborne environments. To tackle these challenges, we propose a novel UAV tracking Siamese network named the contextual enhancement-interaction and multi-scale weighted fusion network, which is designed to improve aerial tracking performance. Firstly, we designed a contextual enhancement-interaction module to improve feature representation. This module effectively facilitates the interaction between the template and search branches and strengthens the features of each branch in parallel. Specifically, a cross-attention mechanism within the module integrates the branch information effectively. The parallel Transformer-based enhancement structure improves the feature saliency significantly. Additionally, we designed an efficient multi-scale weighted fusion module that adaptively weights the correlation response maps across different feature scales. This module fully utilizes the global similarity response between the template and the search area, enhancing feature distinctiveness and improving tracking results. We conducted experiments using several state-of-the-art trackers on aerial tracking benchmarks, including DTB70, UAV123, UAV20L, and UAV123@10fps, to validate the efficacy of the proposed network. The experimental results demonstrate that our tracker performs effectively in complex aerial tracking scenarios and competes well with state-of-the-art trackers.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] MSPENet: multi-scale adaptive fusion and position enhancement network for human pose estimation
    Xu, Jia
    Liu, Weibin
    Xing, Weiwei
    Wei, Xiang
    VISUAL COMPUTER, 2023, 39 (05): : 2005 - 2019
  • [22] MSPENet: multi-scale adaptive fusion and position enhancement network for human pose estimation
    Jia Xu
    Weibin Liu
    Weiwei Xing
    Xiang Wei
    The Visual Computer, 2023, 39 : 2005 - 2019
  • [23] EMFF-Net: Edge-Enhancement Multi-Scale Feature Fusion Network
    Guan, Xuhui
    Zhou, Jiwang
    Chen, Jian
    Xu, Xiaodan
    Jiang, Yizhang
    Xia, Kaijian
    IEEE Access, 2025, 13 : 25598 - 25611
  • [24] Counting with Self-Weighted Multi-Scale Fusion Networks
    Xiong, Xin
    Shen, Jie
    Li, Ying
    He, Wei
    Li, Peng
    Yan, Wenjie
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [25] Optimally-Weighted Multi-Scale Local Feature Fusion Network for Driver Distraction Recognition
    Fan, Li Shao
    Gao, Shangbing
    IEEE ACCESS, 2022, 10 : 128554 - 128561
  • [26] Optimally-Weighted Multi-Scale Local Feature Fusion Network for Driver Distraction Recognition
    Fan, Li Shao
    Shangbing, Gao
    IEEE Access, 2022, 10 : 128554 - 128561
  • [27] LEARNED IMAGE COMPRESSION WITH MULTI-SCALE SPATIAL AND CONTEXTUAL INFORMATION FUSION
    Liu, Ziyi
    Wang, Hanli
    Su, Taiyi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 706 - 710
  • [28] Multi-Scale and Multi-Stream Fusion Network for Pansharpening
    Jian, Lihua
    Wu, Shaowu
    Chen, Lihui
    Vivone, Gemine
    Rayhana, Rakiba
    Zhang, Di
    REMOTE SENSING, 2023, 15 (06)
  • [29] Attentional single-shot network with multi-scale feature fusion for object detection in aerial images
    Wang, Yusheng
    Wang, Hongzhang
    Tang, Eryong
    Liu, Ye
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4754 - 4758
  • [30] A Multi-Scale Feature Fusion Based Lightweight Vehicle Target Detection Network on Aerial Optical Images
    Yu, Chengrui
    Jiang, Xiaonan
    Wu, Fanlu
    Fu, Yao
    Pei, Junyan
    Zhang, Yu
    Li, Xiangzhi
    Fu, Tianjiao
    Remote Sensing, 2024, 16 (19)