Target-Aware Transformer for Satellite Video Object Tracking

被引:4
|
作者
Lai, Pujian [1 ]
Zhang, Meili [1 ]
Cheng, Gong [1 ]
Li, Shengyang [2 ,3 ]
Huang, Xiankai [4 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Chinese Acad Sci, Ctr Space Utilizat, Key Lab Space Utilizat Technol & Engn, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
[4] Beijing Technol & Business Univ, Business Sch, Beijing 100048, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-direction propagation and fusion (Bi-PF); satellite video object tracking; target-aware enhancement (TAE); CORRELATION FILTER;
D O I
10.1109/TGRS.2023.3339658
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recent years have witnessed the astonishing development of transformer-based paradigm in single object tracking (SOT) in generic videos. However, due to the fact that the targets of interest in satellite videos are small in size and weak in visual appearance, the advancements of transformer-based paradigm in satellite video object tracking are impeded. To alleviate this issue, a novel transformer-based recipe is proposed, which consists of a bi-direction propagation and fusion (Bi-PF) strategy and a target-aware enhancement (TAE) module. Concretely, we first adopt the Bi-PF strategy to make full use of multiscale information to generate discriminative representations of tracking targets. Then, the TAE module is employed to decouple an object query into content-aware embedding and spatial-aware embedding and produce a target prototype to help get high-quality content-aware embedding. It is worth mentioning that, different from the previous methods in satellite video tracking most of which evaluate their performance using only several videos, we conduct extensive experiments on the SatSOT dataset which consists of 105 videos. In particular, the proposed method achieves the success score of 45.6% and the precision score of 57.6%, surpassing the baseline method by 5.0% and 9.5%, respectively.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [31] Target-Aware Camera Placement for Large-Scale Video Surveillance
    Wu, Hongxin
    Zeng, Qinghou
    Guo, Chen
    Zhao, Tiesong
    Wen Chen, Chang
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (12) : 13338 - 13348
  • [32] SiamCA: Siamese visual tracking with customized anchor and target-aware interaction
    Pan, Shuqi
    Zhang, Canlong
    Li, Zhixin
    Hu, Liaojie
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [33] Target-Aware Feature Bottleneck for Real-Time Visual Tracking
    Choi, Janghoon
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [34] Lightweight Target-Aware Attention Learning Network-Based Target Tracking Method
    Zhao, Yanchun
    Zhang, Jiapeng
    Duan, Rui
    Li, Fusheng
    Zhang, Huanlong
    MATHEMATICS, 2022, 10 (13)
  • [35] High-Performance Discriminative Tracking with Target-Aware Feature Embeddings
    Yu, Bin
    Tang, Ming
    Zheng, Linyu
    Zhu, Guibo
    Wang, Jinqiao
    Lu, Hanqing
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 3 - 15
  • [36] Target-Aware Deep Feature Compression for Power Intelligent Inspection Tracking
    Jiang, Wei
    Guo, Zhimin
    Zhang, Huanlong
    Cheng, Liyun
    Tian, Yangyang
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2022, 2022
  • [37] TARGET-AWARE AUTO-AUGMENTATION FOR UNSUPERVISED DOMAIN ADAPTIVE OBJECT DETECTION
    Li, Zhaoyang
    Zhao, Long
    Chen, Weijie
    Yang, Shicai
    Xie, Di
    Pu, Shiliang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3848 - 3852
  • [38] Learning Target-Aware Vision Transformers for Real-Time UAV Tracking
    Li, Shuiwang
    Yang, Xiangyang
    Wang, Xucheng
    Zeng, Dan
    Ye, Hengzhou
    Zhao, Qijun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [39] Target-Aware Molecular Graph Generation
    Tan, Cheng
    Gao, Zhangyang
    Li, Stan Z.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 410 - 427
  • [40] ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking
    Kou, Yutong
    Gao, Jin
    Li, Bing
    Wang, Gang
    Hu, Weiming
    Wang, Yizheng
    Li, Liang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,