Fast and Accurate Visual Tracking with Group Convolution and Pixel-Level Correlation

被引:1
|
作者
Liu, Liduo [1 ,2 ]
Long, Yongji [1 ,2 ]
Li, Guoning [1 ]
Nie, Ting [1 ]
Zhang, Chengcheng [1 ,2 ]
He, Bin [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
基金
美国国家科学基金会;
关键词
feature fusion; pixel-level correlation; Siamese network; attention mechanism; ROBUST;
D O I
10.3390/app13179746
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Visual object trackers based on Siamese networks perform well in visual object tracking (VOT); however, degradation of the tracking accuracy occurs when the target has fast motion, large-scale changes, and occlusion. In this study, in order to solve this problem and enhance the inference speed of the tracker, fast and accurate visual tracking with a group convolution and pixel-level correlation based on a Siamese network is proposed. The algorithm incorporates multi-layer feature information on the basis of Siamese networks. We designed a multi-scale feature aggregated channel attention block (MCA) and a global-to-local-information-fused spatial attention block (GSA), which enhance the feature extraction capability of the network. The use of a pixel-level mutual correlation operation in the network to match the search region with the template region refines the bounding box and reduces background interference. Comparing our work with the latest algorithms, the precision and success rates on the UAV123, OTB100, LaSOT, and GOT10K datasets were improved, and our tracker was able to run at 40FPS, with a better performance in complex scenes such as those with occlusion, illumination changes, and fast-motion situations.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking
    Danelljan, Martin
    Robinson, Andreas
    Khan, Fahad Shahbaz
    Felsberg, Michael
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 472 - 488
  • [42] Development of a novel pixel-level signal processing chain for fast readout 3D integrated CMOS pixel sensors
    Fu, Y.
    Torheim, O.
    Hu-Guo, C.
    Degerli, Y.
    Hu, Y.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2013, 704 : 98 - 103
  • [43] Feature-level and pixel-level fusion routines when coupled to infrared night-vision tracking scheme
    Zhou, Yi
    Mayyas, Abedalroof
    Qattawi, Ala
    Omar, Mohammed
    INFRARED PHYSICS & TECHNOLOGY, 2010, 53 (01) : 43 - 49
  • [44] Fast Visual Tracking With Robustifying Kernelized Correlation Filters
    Liu, Qianbo
    Hu, Guoqing
    Islam, Md Mojahidul
    IEEE ACCESS, 2018, 6 : 43302 - 43314
  • [45] Accurate Scale Estimation for Correlation Filter based Visual Tracking
    Zhai You
    Han Dong
    Xu Baohua
    Guo Xiwei
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [46] RealPixVSR: Pixel-Level Visual Representation Informed Super-Resolution of Real-World Videos
    Park, Tony Nokap
    Jeon, Yunho
    Na, Taeyoung
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 412 - 421
  • [47] Pixel-level bridge crack detection using a deep fusion about recurrent residual convolution and context encoder network
    Li, Gang
    Li, Xiyuan
    Zhou, Jian
    Liu, Dezhi
    Ren, Wei
    MEASUREMENT, 2021, 176
  • [48] Multiscale Pixel-Level and Superpixel-Level Method for Hyperspectral Image Classification: Adaptive Attention and Parallel Multi-Hop Graph Convolution
    Yin, Junru
    Liu, Xuan
    Hou, Ruixia
    Chen, Qiqiang
    Huang, Wei
    Li, Aiguang
    Wang, Peng
    REMOTE SENSING, 2023, 15 (17)
  • [49] A hybrid convolutional architecture for accurate image manipulation localization at the pixel-level (vol 80, pg 23377, 2021)
    Zhang, Yixuan
    Zhang, Jiguang
    Xu, Shibiao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 34163 - 34163
  • [50] Field-road classification for GNSS recordings of agricultural machinery using pixel-level visual features
    Chen, Ying
    Quan, Lei
    Zhang, Xiaoqiang
    Zhou, Kun
    Wu, Caicong
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 210