Cross-Parallel Attention and Efficient Match Transformer for Aerial Tracking

被引:1
|
作者
Deng, Anping [1 ,2 ]
Han, Guangliang [1 ]
Zhang, Zhongbo [3 ]
Chen, Dianbing [1 ]
Ma, Tianjiao [1 ]
Liu, Zhichao [1 ,2 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys CIOMP, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[3] Jilin Univ, Sch Math, Changchun 130012, Peoples R China
关键词
visual object tracking; UAV tracking; efficient match transformer; attention method;
D O I
10.3390/rs16060961
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Visual object tracking is a key technology that is used in unmanned aerial vehicles (UAVs) to achieve autonomous navigation. In recent years, with the rapid development of deep learning, tracking algorithms based on Siamese neural networks have received widespread attention. However, because of complex and diverse tracking scenarios, as well as limited computational resources, most existing tracking algorithms struggle to ensure real-time stable operation while improving tracking performance. Therefore, studying efficient and fast-tracking frameworks, and enhancing the ability of algorithms to respond to complex scenarios has become crucial. Therefore, this paper proposes a cross-parallel attention and efficient match transformer for aerial tracking (SiamEMT). Firstly, we carefully designed the cross-parallel attention mechanism to encode global feature information and to achieve cross-dimensional interaction and feature correlation aggregation via parallel branches, highlighting feature saliency and reducing global redundancy information, as well as improving the tracking algorithm's ability to distinguish between targets and backgrounds. Meanwhile, we implemented an efficient match transformer to achieve feature matching. This network utilizes parallel, lightweight, multi-head attention mechanisms to pass template information to the search region features, better matching the global similarity between the template and search regions, and improving the algorithm's ability to perceive target location and feature information. Experiments on multiple drone public benchmark tests verified the accuracy and robustness of the proposed tracker in drone tracking scenarios. In addition, on the embedded artificial intelligence (AI) platform AGX Xavier, our algorithm achieved real-time tracking speed, indicating that our algorithm can be effectively applied to UAV tracking scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Cross-Parallel Transformer: Parallel ViT for Medical Image Segmentation
    Wang, Dong
    Wang, Zixiang
    Chen, Ling
    Xiao, Hongfeng
    Yang, Bo
    Nanni, Loris
    SENSORS, 2023, 23 (23)
  • [2] CPNet: Cross-Parallel Network for Efficient Anomaly Detection
    Jin, Youngsaeng
    Hong, Jonghwan
    Han, David
    Ko, Hanseok
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [3] Cross-Parallel Transformer: Parallel ViT for Medical Image Segmentation (vol 23, 9488, 2023)
    Wang, Dong
    Wang, Zixiang
    Chen, Ling
    Xiao, Hongfeng
    Yang, Bo
    SENSORS, 2024, 24 (02)
  • [4] Efficient transformer tracking with adaptive attention
    Xiao, Dingkun
    Wei, Zhenzhong
    Zhang, Guangjun
    IET COMPUTER VISION, 2024,
  • [5] CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT
    Wang, Kai
    He, Bengbeng
    Zhu, Wei-Ping
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [6] An efficient object tracking based on multi-head cross-attention transformer
    Dai, Jiahai
    Li, Huimin
    Jiang, Shan
    Yang, Hongwei
    EXPERT SYSTEMS, 2025, 42 (02)
  • [7] ParaFormer: Parallel Attention Transformer for Efficient Feature Matching
    Lu, Xiaoyong
    Yan, Yaping
    Kang, Bin
    Du, Songlin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1853 - 1860
  • [8] SCATT: Transformer tracking with symmetric cross-attention
    Zhang, Jianming
    Chen, Wentao
    Dai, Jiangxin
    Zhang, Jin
    APPLIED INTELLIGENCE, 2024, 54 (08) : 6069 - 6084
  • [9] Deblurring transformer tracking with conditional cross-attention
    Fuming Sun
    Tingting Zhao
    Bing Zhu
    Xu Jia
    Fasheng Wang
    Multimedia Systems, 2023, 29 : 1131 - 1144
  • [10] Deblurring transformer tracking with conditional cross-attention
    Sun, Fuming
    Zhao, Tingting
    Zhu, Bing
    Jia, Xu
    Wang, Fasheng
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1131 - 1144