Cross-Parallel Attention and Efficient Match Transformer for Aerial Tracking

被引:1
|
作者
Deng, Anping [1 ,2 ]
Han, Guangliang [1 ]
Zhang, Zhongbo [3 ]
Chen, Dianbing [1 ]
Ma, Tianjiao [1 ]
Liu, Zhichao [1 ,2 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys CIOMP, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[3] Jilin Univ, Sch Math, Changchun 130012, Peoples R China
关键词
visual object tracking; UAV tracking; efficient match transformer; attention method;
D O I
10.3390/rs16060961
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Visual object tracking is a key technology that is used in unmanned aerial vehicles (UAVs) to achieve autonomous navigation. In recent years, with the rapid development of deep learning, tracking algorithms based on Siamese neural networks have received widespread attention. However, because of complex and diverse tracking scenarios, as well as limited computational resources, most existing tracking algorithms struggle to ensure real-time stable operation while improving tracking performance. Therefore, studying efficient and fast-tracking frameworks, and enhancing the ability of algorithms to respond to complex scenarios has become crucial. Therefore, this paper proposes a cross-parallel attention and efficient match transformer for aerial tracking (SiamEMT). Firstly, we carefully designed the cross-parallel attention mechanism to encode global feature information and to achieve cross-dimensional interaction and feature correlation aggregation via parallel branches, highlighting feature saliency and reducing global redundancy information, as well as improving the tracking algorithm's ability to distinguish between targets and backgrounds. Meanwhile, we implemented an efficient match transformer to achieve feature matching. This network utilizes parallel, lightweight, multi-head attention mechanisms to pass template information to the search region features, better matching the global similarity between the template and search regions, and improving the algorithm's ability to perceive target location and feature information. Experiments on multiple drone public benchmark tests verified the accuracy and robustness of the proposed tracker in drone tracking scenarios. In addition, on the embedded artificial intelligence (AI) platform AGX Xavier, our algorithm achieved real-time tracking speed, indicating that our algorithm can be effectively applied to UAV tracking scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] CAR-Transformer: Cross-Attention Reinforcement Transformer for Cross-Lingual Summarization
    Cai, Yuang
    Yuan, Yuyu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17718 - 17726
  • [42] VTST: Efficient Visual Tracking With a Stereoscopic Transformer
    Gu, Fengwei
    Lu, Jun
    Cai, Chengtao
    Zhu, Qidan
    Ju, Zhaojie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2401 - 2416
  • [43] SiamHAS: Siamese Tracker with Hierarchical Attention Strategy for Aerial Tracking
    Liu, Faxue
    Liu, Jinghong
    Chen, Qiqi
    Wang, Xuan
    Liu, Chenglong
    MICROMACHINES, 2023, 14 (04)
  • [44] Siamese Transformer Network for Real-Time Aerial Object Tracking
    Wang, Haijun
    Zhang, Shengyan
    IEEE ACCESS, 2022, 10 : 105201 - 105213
  • [45] Siamese Adaptive Transformer Network for Real-Time Aerial Tracking
    Xing, Daitao
    Tsoukalas, Athanasios
    Evangeliou, Nikolaos
    Giakoumidis, Nikolaos
    Tzes, Anthony
    2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, : 570 - 575
  • [46] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
    Yan, Li
    Huang, Jianming
    Xie, Hong
    Wei, Pengcheng
    Gao, Zhao
    REMOTE SENSING, 2022, 14 (05)
  • [47] EMTCAL: Efficient Multiscale Transformer and Cross-Level Attention Learning for Remote Sensing Scene Classification
    Tang, Xu
    Li, Mingteng
    Ma, Jingjing
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [48] Consistent Weighted Correlation-Based Attention for Transformer Tracking
    Liu, Lei
    Fang, Genwen
    Wang, Jun
    Wang, Shuai
    Wang, Chun
    Shen, Longfeng
    Zhu, Kongfen
    Melo, Silas N.
    ELECTRONICS, 2023, 12 (22)
  • [49] Transformer tracking with multi-scale dual-attention
    Jun Wang
    Changwang Lai
    Wenshuang Zhang
    Yuanyun Wang
    Chenchen Meng
    Complex & Intelligent Systems, 2023, 9 : 5793 - 5806
  • [50] Transformer tracking with multi-scale dual-attention
    Wang, Jun
    Lai, Changwang
    Zhang, Wenshuang
    Wang, Yuanyun
    Meng, Chenchen
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5793 - 5806