Fast and Accurate Visual Tracking with Group Convolution and Pixel-Level Correlation

被引:1
|
作者
Liu, Liduo [1 ,2 ]
Long, Yongji [1 ,2 ]
Li, Guoning [1 ]
Nie, Ting [1 ]
Zhang, Chengcheng [1 ,2 ]
He, Bin [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
基金
美国国家科学基金会;
关键词
feature fusion; pixel-level correlation; Siamese network; attention mechanism; ROBUST;
D O I
10.3390/app13179746
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Visual object trackers based on Siamese networks perform well in visual object tracking (VOT); however, degradation of the tracking accuracy occurs when the target has fast motion, large-scale changes, and occlusion. In this study, in order to solve this problem and enhance the inference speed of the tracker, fast and accurate visual tracking with a group convolution and pixel-level correlation based on a Siamese network is proposed. The algorithm incorporates multi-layer feature information on the basis of Siamese networks. We designed a multi-scale feature aggregated channel attention block (MCA) and a global-to-local-information-fused spatial attention block (GSA), which enhance the feature extraction capability of the network. The use of a pixel-level mutual correlation operation in the network to match the search region with the template region refines the bounding box and reduces background interference. Comparing our work with the latest algorithms, the precision and success rates on the UAV123, OTB100, LaSOT, and GOT10K datasets were improved, and our tracker was able to run at 40FPS, with a better performance in complex scenes such as those with occlusion, illumination changes, and fast-motion situations.
引用
收藏
页数:16
相关论文
共 50 条
  • [11] A hybrid convolutional architecture for accurate image manipulation localization at the pixel-level
    Zhang, Yixuan
    Zhang, Jiguang
    Xu, Shibiao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 23377 - 23392
  • [12] Adaptive Video Text Tracking Based on Pixel-level Feature Extraction
    School of Computer Science, Hubei University of Technology, Hubei, Wuhan
    430000, China
    不详
    430223, China
    J. Eng. Sci. Technol. Rev., 2024, 5 (55-61):
  • [13] Enhancing pixel-level crack segmentation with visual mamba and convolutional networks
    Han, Chengjia
    Yang, Handuo
    Yang, Yaowen
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [14] Pixel-Level Hardware Strategy for Large-Scale Convolution Calculation in Neuromorphic Devices
    Zhang, Xianghong
    Liu, Di
    Wu, Jianxin
    Cheng, Enping
    Qin, Congyao
    Gao, Changsong
    Shan, Liuting
    Zou, Yi
    Hu, Yuanyuan
    Guo, Tailiang
    Chen, Huipeng
    ADVANCED FUNCTIONAL MATERIALS, 2024,
  • [15] An Analog Sub-Miliwatt CMOS Image Sensor With Pixel-Level Convolution Processing
    Jendernalik, W.
    Blakiewicz, G.
    Jakusz, J.
    Szczepanski, S.
    Piotrowski, R.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2013, 60 (02) : 279 - 289
  • [16] Correction to: A hybrid convolutional architecture for accurate image manipulation localization at the pixel-level
    Yixuan Zhang
    Jiguang Zhang
    Shibiao Xu
    Multimedia Tools and Applications, 2022, 81 : 34163 - 34163
  • [17] Pixel-Level Segmentation for Multiobject Tracking Using Mask RCNN-FPN
    Swadi, Shivani
    Nissimagoudar, Prabha C.
    Iyer, Nalini C.
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, PT 1, ICSOFTCOMP 2023, 2024, 2030 : 16 - 29
  • [18] Equivalence of Correlation Filter and Convolution Filter in Visual Tracking
    Li, Shuiwang
    Zhao, Qijun
    Feng, Ziliang
    Lu, Li
    IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 623 - 634
  • [19] Fast and Massive Pixel-Level Morphology Detection by Imaging Processing for Inkjet Printing
    Zhang, Haoyang
    Xu, Da
    Ke, Shanrong
    Huang, Meicong
    Chai, Yaling
    Lin, Yi
    Guo, Ziquan
    Chen, Zhong
    MICROMACHINES, 2024, 15 (05)
  • [20] Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning
    Xie, Zhenda
    Lin, Yutong
    Zhang, Zheng
    Cao, Yue
    Lin, Stephen
    Hu, Han
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16679 - 16688