Enhancing Single Object Tracking With a Hybrid Approach: Temporal Convolutional Networks, Attention Mechanisms, and Spatial–Temporal Memory

被引:0
|
作者
Cheewaprakobkit, Pimpa [1 ,2 ]
Lin, Chih-Yang [3 ]
Shih, Timothy K. [1 ]
Enkhbat, Avirmed [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 32001, Taiwan
[2] Asia Pacific Int Univ, Dept Informat Technol, Sara Buri 18180, Thailand
[3] Natl Cent Univ, Dept Mech Engn, Taoyuan 32001, Taiwan
关键词
Temporal convolutional networks; attention mechanism; spatial-temporal memory; single object tracking; TEMPLATE;
D O I
10.1109/ACCESS.2023.3330644
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network-based tracking tasks have experienced significant advancements in recent years. However, these networks continue to face challenges in effectively adapting to appearance changes in both target and background, as well as linking objects after extended periods. The primary challenge in tracking lies in the frequent changes in a target's appearance throughout the tracking process, which can potentially reduce tracker robustness when faced with issues such as aspect ratio changes, occlusions, scale variations, and confusion from similar objects. To address this challenge, we propose a tracking architecture that combines a temporal convolutional network (TCN) and attention mechanism with spatial-temporal memory. The TCN component empowers the model to capture temporal dependencies, while the attention mechanism reduces computational complexity by focusing on crucial regions based on context. We leverage the target's historical information stored in the spatial-temporal memory network to guide the tracker in better adapting to target deformation. Our model attains a 67.5% average overlap (AO) on the GOT-10K dataset, a 72.1% success score (AUC) on OTB2015, a 65.8% success score (AUC) on UAV123, and achieves 59.0% accuracy on the VOT2018 dataset. These outcomes demonstrate the high effectiveness of our proposed tracker in tracking a single object.
引用
收藏
页码:139211 / 139222
页数:12
相关论文
共 50 条
  • [31] Spatial-temporal convolutional attention for discovering and characterizing functional brain networks in task fMRI
    Liu, Yiheng
    Ge, Enjie
    Kang, Zili
    Qiang, Ning
    Liu, Tianming
    Ge, Bao
    NEUROIMAGE, 2024, 287
  • [32] Information Propagation Prediction Based on Spatial-Temporal Attention and Heterogeneous Graph Convolutional Networks
    Liu, Xiaoyang
    Miao, Chenxiang
    Fiumara, Giacomo
    De Meo, Pasquale
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 945 - 958
  • [33] Enhancing the accuracy of a human emotion recognition method using spatial temporal graph convolutional networks
    Ming-Fong Tsai
    Chiung-Hung Chen
    Multimedia Tools and Applications, 2023, 82 : 11285 - 11303
  • [34] Enhancing the accuracy of a human emotion recognition method using spatial temporal graph convolutional networks
    Tsai, Ming-Fong
    Chen, Chiung-Hung
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 11285 - 11303
  • [35] A method of real-temporal object tracking combined the temporal information and spatial information
    Xiaoshuo Jia
    Zhihui Li
    Kangshun Li
    Shangyou Zeng
    Soft Computing, 2022, 26 : 8689 - 8698
  • [36] A method of real-temporal object tracking combined the temporal information and spatial information
    Jia, Xiaoshuo
    Li, Zhihui
    Li, Kangshun
    Zeng, Shangyou
    SOFT COMPUTING, 2022, 26 (17) : 8689 - 8698
  • [37] A Twofold Convolutional Regression Tracking Network With Temporal and Spatial Mechanism
    Li, Xiaojing
    Huang, Lei
    Wei, Zhiqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1537 - 1551
  • [38] Temporal Convolutional Attention Neural Networks for Time Series Forecasting
    Lin, Yang
    Koprinska, Irena
    Rana, Mashud
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [39] Flood Water Depth Prediction with Convolutional Temporal Attention Networks
    Chaudhary, Priyanka
    Leitao, Joao P.
    Schindler, Konrad
    Wegner, Jan Dirk
    WATER, 2024, 16 (09)
  • [40] Spatial-Semantic and Temporal Attention Mechanism-Based Online Multi-Object Tracking
    Meng, Fanjie
    Wang, Xinqing
    Wang, Dong
    Shao, Faming
    Fu, Lei
    SENSORS, 2020, 20 (06)