Coordinate-aware thermal infrared tracking via natural language modeling

被引:0
|
作者
Yan, Miao [1 ]
Zhang, Ping [1 ,2 ,3 ]
Zhang, Haofei [4 ]
Hao, Ruqian [1 ]
Liu, Juanxiu [1 ]
Wang, Xiaoyang [5 ]
Liu, Lin [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Optoelect Sci & Engn, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen 518000, Peoples R China
[3] Univ Elect Sci & Technol China, Yibin Inst, Yibin 644005, Peoples R China
[4] 208 Res Inst China Ordnance Ind, Beijing 102200, Peoples R China
[5] Univ Exeter, Dept Comp Sci, Streatham Campus, Exeter EX4 4RN, England
基金
中国国家自然科学基金;
关键词
Thermal infrared object tracking; Natural language model; Transformer tracking; OBJECT TRACKING;
D O I
10.1016/j.eswa.2024.126012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thermal infrared (TIR) tracking is pivotal in computer vision tasks due to its all-weather imaging capability. Traditional tracking methods predominantly rely on hand-crafted features, and while deep learning has introduced correlation filtering techniques, these are often constrained by rudimentary correlation operations. Furthermore, transformer-based approaches tend to overlook temporal and coordinate information, which is critical for TIR tracking that lacks texture and color information. In this paper, to address these issues, we apply natural language modeling to TIR tracking and propose a coordinate-aware thermal infrared tracking model called NLMTrack, which enhances the utilization of coordinate and temporal information. NLMTrack applies an encoder that unifies feature extraction and feature fusion, which simplifies the TIR tracking pipeline. To address the challenge of low detail and low contrast in TIR images, on the one hand, we design a multi-level progressive fusion module that enhances the semantic representation and incorporates multi-scale features. On the other hand, the decoder combines the TIR features and the coordinate sequence features using a causal transformer to generate the target sequence step by step. Moreover, we explore an adaptive loss aimed at elevating tracking accuracy and a simple template update strategy to accommodate the target's appearance variations. Experiments show that NLMTrack achieves state-of-the-art performance on multiple benchmarks. The Code is publicly available at https://github.com/ELOESZHANG/NLMTrack.
引用
收藏
页数:12
相关论文
共 26 条
  • [1] Structural target-aware model for thermal infrared tracking
    Yuan, Di
    Shu, Xiu
    Liu, Qiao
    He, Zhenyu
    NEUROCOMPUTING, 2022, 491 : 44 - 56
  • [2] Global-Aware Siamese Network for Thermal Infrared Object Tracking
    Li Chang
    Yang Dedong
    Song Peng
    Guo Chang
    ACTA OPTICA SINICA, 2021, 41 (06)
  • [3] Scene-aware classifier and re-detector for thermal infrared tracking
    Ji, Qingbo
    Zhang, Pengfei
    Chen, Kuicheng
    Zhang, Lei
    Hou, Changbo
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 105
  • [4] Hierarchical spatial-aware Siamese network for thermal infrared object tracking
    Li, Xin
    Liu, Qiao
    Fan, Nana
    He, Zhenyu
    Wang, Hongzhi
    KNOWLEDGE-BASED SYSTEMS, 2019, 166 : 71 - 81
  • [5] Supporting Collaborative Modeling via Natural Language Processing
    Aydemir, Fatma Basak
    Dalpiaz, Fabiano
    CONCEPTUAL MODELING, ER 2020, 2020, 12400 : 223 - 238
  • [6] Thermal Aware Energy Efficient Gurumukhi Unicode Reader for Natural Language Processing
    Kaur, Amanpreet
    Singh, Gurpreet
    Pandey, Bishwajeet
    Fazili, Furqan
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1524 - 1528
  • [7] Modeling the Thermal Infrared Emissivity of Snow and Ice Using Photon Tracking
    Xiong, Chuan
    Yuan, Liang
    Wang, Zhenzhan
    Shi, Jiancheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] Thermal infrared object tracking via unsupervised deep correlation filters
    Huang, Yueping
    He, Yujie
    Lu, Ruitao
    Li, Xiaofeng
    Yang, Xiaogang
    DIGITAL SIGNAL PROCESSING, 2022, 123
  • [9] Thermal Infrared Object Tracking via Siamese Convolutional Neural Networks
    Liu, Qiao
    Yuan, Di
    He, Zhenyu
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 1 - 6
  • [10] Reversible source-aware natural language watermarking via customized lexical substitution
    Jiang, Ziyu
    Wang, Hongxia
    Shi, Zhenhao
    Jiao, Run
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (02)