STFTrack: Spatio-Temporal-Focused Siamese Network for Infrared UAV Tracking

被引:2
|
作者
Xie, Xueli [1 ]
Xi, Jianxiang [1 ]
Yang, Xiaogang [1 ]
Lu, Ruitao [1 ]
Xia, Wenxin [1 ]
机构
[1] Rocket Force Univ Engn, Dept Automat, Xian 710025, Peoples R China
关键词
Anti-UAV; infrared UAV tracking; spatio-temporal focusing; adaptive search region;
D O I
10.3390/drones7050296
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
The rapid popularity of UAVs has encouraged the development of Anti-UAV technology. Infrared-detector-based visual tracking for UAVs provides an encouraging solution for Anti-UAVs. However, it still faces the problem of tracking instability caused by environmental thermal crossover and similar distractors. To address these issues, we propose a spatio-temporal-focused Siamese network for infrared UAV tracking, called STFTrack. This method employs a two-level target focusing strategy from global to local. First, a feature pyramid-based Siamese backbone is constructed to enhance the feature expression of infrared UAVs through cross-scale feature fusion. By combining template and motion features, we guide prior anchor boxes towards the suspicious region to enable adaptive search region selection, thus effectively suppressing background interference and generating high-quality candidates. Furthermore, we propose an instance-discriminative RCNN based on metric learning to focus on the target UAV among candidates. By measuring calculating the feature distance between the candidates and the template, it assists in discriminating the optimal target from the candidates, thus improving the discrimination of the proposed method to infrared UAV. Extensive experiments on the Anti-UAV dataset demonstrate that the proposed method achieves outstanding performance for infrared tracking, with 91.2% precision, 66.6% success rate, and 67.7% average overlap accuracy, and it exceeded the baseline algorithm by 2.3%, 2.7%, and 3.5%, respectively. The attribute-based evaluation demonstrates that the proposed method achieves robust tracking effects on challenging scenes such as fast motion, thermal crossover, and similar distractors. Evaluation on the LSOTB-TIR dataset shows that the proposed method reaches a precision of 77.2% and a success rate of 63.4%, outperforming other advanced trackers.
引用
下载
收藏
页数:22
相关论文
共 50 条
  • [31] Visual tracking for UAV using adaptive spatio-temporal regularized correlation filters
    Libin Xu
    Mingliang Gao
    Qilei Li
    Guofeng Zou
    Jinfeng Pan
    Jun Jiang
    Applied Intelligence, 2022, 52 : 7566 - 7581
  • [32] Adaptive Image Dehazing and Object Tracking in UAV Videos Based on the Template Updating Siamese Network
    Sun, Lifan
    Chang, Jiashun
    Zhang, Jinjin
    Fan, Bo
    He, Zishu
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 12320 - 12333
  • [33] Siamese Transformer Network: Building an autonomous real-time target tracking system for UAV
    Sun, Xiaolou
    Wang, Qi
    Xie, Fei
    Quan, Zhibin
    Wang, Wei
    Wang, Hao
    Yao, Yuncong
    Yang, Wankou
    Suzuki, Satoshi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 130
  • [34] SiamHSFT: A Siamese Network-Based Tracker with Hierarchical Sparse Fusion and Transformer for UAV Tracking
    Hu, Xiuhua
    Zhao, Jing
    Hui, Yan
    Li, Shuang
    You, Shijie
    SENSORS, 2023, 23 (21)
  • [35] Multi-level prediction Siamese network for real-time UAV visual tracking
    Zhu, Mu
    Zhang, Hui
    Zhang, Jing
    Zhuo, Li
    IMAGE AND VISION COMPUTING, 2020, 103 (103)
  • [36] Siamese Dense Pixel-Level Fusion Network for Real-Time UAV Tracking
    Huang, Zhenyu
    Li, Gun
    Sun, Xudong
    Chen, Yong
    Sun, Jie
    Ni, Zhangsong
    Yang, Yang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 3219 - 3238
  • [37] SiamAPN plus plus : Siamese Attentional Aggregation Network for Real-Time UAV Tracking
    Cao, Ziang
    Fu, Changhong
    Ye, Junjie
    Li, Bowen
    Li, Yiming
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3086 - 3092
  • [38] SiamMAST: Siamese motion-aware spatio-temporal network for video action recognition
    Lu, Xuemin
    Quan, Wei
    Marek, Reformat
    Zhao, Haiquan
    Chen, Jim X. X.
    VISUAL COMPUTER, 2024, 40 (05): : 3163 - 3181
  • [39] SiamMAST: Siamese motion-aware spatio-temporal network for video action recognition
    Xuemin Lu
    Wei Quan
    Reformat Marek
    Haiquan Zhao
    Jim X. Chen
    The Visual Computer, 2024, 40 : 3163 - 3181
  • [40] UAV network intrusion detection method based on spatio-temporal graph convolutional network
    Chen Z.
    Lyu N.
    Chen K.
    Zhang Y.
    Gao W.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (05): : 1068 - 1076