Fast and Accurate Visual Tracking with Group Convolution and Pixel-Level Correlation

被引:1
|
作者
Liu, Liduo [1 ,2 ]
Long, Yongji [1 ,2 ]
Li, Guoning [1 ]
Nie, Ting [1 ]
Zhang, Chengcheng [1 ,2 ]
He, Bin [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
基金
美国国家科学基金会;
关键词
feature fusion; pixel-level correlation; Siamese network; attention mechanism; ROBUST;
D O I
10.3390/app13179746
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Visual object trackers based on Siamese networks perform well in visual object tracking (VOT); however, degradation of the tracking accuracy occurs when the target has fast motion, large-scale changes, and occlusion. In this study, in order to solve this problem and enhance the inference speed of the tracker, fast and accurate visual tracking with a group convolution and pixel-level correlation based on a Siamese network is proposed. The algorithm incorporates multi-layer feature information on the basis of Siamese networks. We designed a multi-scale feature aggregated channel attention block (MCA) and a global-to-local-information-fused spatial attention block (GSA), which enhance the feature extraction capability of the network. The use of a pixel-level mutual correlation operation in the network to match the search region with the template region refines the bounding box and reduces background interference. Comparing our work with the latest algorithms, the precision and success rates on the UAV123, OTB100, LaSOT, and GOT10K datasets were improved, and our tracker was able to run at 40FPS, with a better performance in complex scenes such as those with occlusion, illumination changes, and fast-motion situations.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Pixel-level robust digital image correlation
    Cofaru, Corneliu
    Philips, Wilfried
    Van Paepegem, Wim
    OPTICS EXPRESS, 2013, 21 (24): : 29979 - 29999
  • [2] PIXEL-LEVEL GUIDED FACE EDITING WITH FULLY CONVOLUTION NETWORKS
    Li, Zhenxi
    Zhang, Juyong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 307 - 312
  • [3] Memory Network With Pixel-Level Spatio-Temporal Learning for Visual Object Tracking
    Zhou, Zechu
    Zhou, Xinyu
    Chen, Zhaoyu
    Guo, Pinxue
    Liu, Qian-Yu
    Zhang, Wenqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6897 - 6911
  • [4] Lane Detection and Pixel-Level Tracking for Autonomous Vehicles
    Jilin University, China
    SAE Techni. Paper., 1600, 2022
  • [5] Scale-pyramid dynamic atrous convolution for pixel-level labeling
    Li, Zhiqiang
    Jiang, Jie
    Chen, Xi
    Zhang, Min
    Wang, Yong
    Li, Qingli
    Qi, Honggang
    Liu, Min
    Laganiere, Robert
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [6] Scale-pyramid dynamic atrous convolution for pixel-level labeling
    Li, Zhiqiang
    Jiang, Jie
    Chen, Xi
    Zhang, Min
    Wang, Yong
    Li, Qingli
    Qi, Honggang
    Liu, Min
    Laganière, Robert
    Expert Systems with Applications, 2024, 241
  • [7] Edge-aware object pixel-level representation tracking
    Jing, Peiguang
    Huang, Zijian
    Liu, Jing
    Wang, Yating
    Yu, Jiexiao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [8] SiamLight: lightweight networks for object tracking via attention mechanisms and pixel-level cross-correlation
    Lin, Yu-e
    Li, Mengfan
    Liang, Xingzhu
    Xia, Chenxing
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (02)
  • [9] SiamLight: lightweight networks for object tracking via attention mechanisms and pixel-level cross-correlation
    Yu-e Lin
    Mengfan Li
    Xingzhu Liang
    Chenxing Xia
    Journal of Real-Time Image Processing, 2023, 20
  • [10] A hybrid convolutional architecture for accurate image manipulation localization at the pixel-level
    Yixuan Zhang
    Jiguang Zhang
    Shibiao Xu
    Multimedia Tools and Applications, 2021, 80 : 23377 - 23392