Convolutional Neural Network with Structural Input for Visual Object Tracking

被引:6
|
作者
Fiaz, Mustansar [1 ]
Mahmood, Arif [2 ]
Jung, Soon Ki [1 ]
机构
[1] Kyungpook Natl Univ, Sch Comp Sci & Engn, Daegu, South Korea
[2] Informat Technol Univ, Dept Comp Sci, Lahore, Pakistan
关键词
Deep learning; convolutional neural network; visual tracking; machine learning;
D O I
10.1145/3297280.3297416
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Numerous deep learning approaches have been applied to visual object tracking owing to their capabilities to leverage huge training data for performance improvement. Most of these approaches have limitations with regard to learning target specific information rich features and therefore observe reduced accuracy in the presence of different challenges such as occlusion, scale variations, rotation and clutter. We proposed a deep neural network that takes input in the form of two stacked patches and regresses both the similarity and the dis-similarity scores in single evaluation. Image patches are concatenated depth-wise and fed to a six channel input of the network. The proposed network is generic and exploits the structural differences between the two input patches to obtain more accurate similarity and dissimilarity scores. Online learning is enforced via short-term and long-term updates to improve the tracking performance. Extensive experimental evaluations have been performed on OTB2015 and TempleColor128 benchmark datasets. Comparisons with state-of-the-art methods indicate that the proposed framework has achieved better tracking performance. The proposed tracking framework has obtained improved accuracy in different challenges including occlusion, background clutter, in-plane rotation and scale variations.
引用
收藏
页码:1345 / 1352
页数:8
相关论文
共 50 条
  • [41] Online object tracking via motion-guided convolutional neural network (MGNet)
    Gan, Weihao
    Lee, Ming-Sui
    Wu, Chi-hao
    Kuo, C. -C.
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 53 : 180 - 191
  • [42] Occlusion-related graph convolutional neural network for multi-object tracking
    Zhang, Yubo
    Zheng, Liying
    Huang, Qingming
    [J]. Image and Vision Computing, 2024, 152
  • [43] VISUAL OBJECT TRACKING VIA GRAPH CONVOLUTIONAL REPRESENTATION
    Tu, Zhengzheng
    Zhou, Ajian
    Jiang, Bo
    Luo, Bin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 234 - 239
  • [44] RETRACTED: Visual Object Tracking Based on Deep Neural Network (Retracted Article)
    Diao, Zhifeng
    Sun, Fanglei
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [45] Object detection of transmission line visual images based on deep convolutional neural network
    Zhou Zhu-bo
    Gao Jiao
    Zhang Wei
    Wang Xiao-jing
    Zhang Jiang
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2018, 33 (04) : 317 - 325
  • [46] Enhanced Online Convolutional Neural Networks for Object Tracking
    Zhang, Dengzhuo
    Gao, Yun
    Zhou, Hao
    Li, Tianwen
    [J]. TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696
  • [47] Object Tracking and Detection Using Convolutional Neural Networks
    Sujatha, C. N.
    Sahithi, P.
    Hamsini, R.
    Haripriya, M.
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 97 - 107
  • [48] Recurrent Convolutional Neural Network for Object Recognition
    Liang, Ming
    Hu, Xiaolin
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3367 - 3375
  • [49] Branch-Activated Multi-Domain Convolutional Neural Network for Visual Tracking
    陈一民
    陆蓉蓉
    邹一波
    张燕辉
    [J]. Journal of Shanghai Jiaotong University(Science), 2018, 23 (03) : 360 - 367
  • [50] Branch-Activated Multi-Domain Convolutional Neural Network for Visual Tracking
    Chen Y.
    Lu R.
    Zou Y.
    Zhang Y.
    [J]. Journal of Shanghai Jiaotong University (Science), 2018, 23 (3) : 360 - 367