Online object tracking via motion-guided convolutional neural network (MGNet)

被引:11
|
作者
Gan, Weihao [1 ]
Lee, Ming-Sui [2 ]
Wu, Chi-hao [1 ]
Kuo, C. -C. [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Natl Taiwan Univ, Taipei, Taiwan
关键词
Object tracking; Online tracking; Convolutional neural network; Optical flow; Multi-domain learning; ROBUST VISUAL TRACKING;
D O I
10.1016/j.jvcir.2018.03.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tracking-by-detection (TBD) is widely used in visual object tracking. However, many TBD-based methods ignore the strong motion correlation between current and previous frames. In this work, a motion-guided convolutional neural network (MGNet) solution to online object tracking is proposed. The MGNet tracker is built upon the multi-domain convolutional neural network with two innovations: (1) a motion-guided candidate selection (MCS) scheme based on a dynamic prediction model is proposed to accurately and efficiently generate the candidate regions and (2) the spatial RGB and temporal optical flow are combined as inputs and processed in an unified end-to-end trained network, rather than a two-branch processing network. We compare the performance of the MGNet, the MDNet and several state-of-the-art online object trackers on the OTB and the VOT benchmark datasets, and demonstrate that the temporal correlation between any two consecutive frames in videos can be more effectively captured by the MGNet via extensive performance evaluation.
引用
收藏
页码:180 / 191
页数:12
相关论文
共 50 条
  • [1] CNNTracker: Online discriminative object tracking via deep convolutional neural network
    Chen, Yan
    Yang, Xiangnan
    Zhong, Bineng
    Pan, Shengnan
    Chen, Duansheng
    Zhang, Huizhen
    APPLIED SOFT COMPUTING, 2016, 38 : 1088 - 1098
  • [2] Motion-Guided Cascaded Refinement Network for Video Object Segmentation
    Hu, Ping
    Wang, Gang
    Kong, Xiangfei
    Kuen, Jason
    Tan, Yap-Peng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1400 - 1409
  • [3] Motion-Guided Graph Convolutional Network for Human Action Recognition
    Li, Jingjing
    Huang, Zhangjin
    Zou, Lu
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (07): : 1077 - 1086
  • [4] Motion-Guided Cascaded Refinement Network for Video Object Segmentation
    Hu, Ping
    Wang, Gang
    Kong, Xiangfei
    Kuen, Jason
    Tan, Yap-Peng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (08) : 1957 - 1967
  • [5] Object Discovery from Motion-Guided Tokens
    Bao, Zhipeng
    Tokmakov, Pavel
    Wang, Yu-Xiong
    Gaidon, Adrien
    Hebert, Martial
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22972 - 22981
  • [6] MgNet: A unified framework of multigrid and convolutional neural network
    Juncai He
    Jinchao Xu
    Science China Mathematics, 2019, 62 (07) : 1331 - 1354
  • [7] MgNet: A unified framework of multigrid and convolutional neural network
    Juncai He
    Jinchao Xu
    Science China Mathematics, 2019, 62 : 1331 - 1354
  • [8] Enhanced Online Convolutional Neural Networks for Object Tracking
    Zhang, Dengzhuo
    Gao, Yun
    Zhou, Hao
    Li, Tianwen
    TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696
  • [9] MgNet: A unified framework of multigrid and convolutional neural network
    He, Juncai
    Xu, Jinchao
    SCIENCE CHINA-MATHEMATICS, 2019, 62 (07) : 1331 - 1354
  • [10] Motion-guided and occlusion-aware multi-object tracking with hierarchical matching
    Zheng, Yujin
    Qi, Hang
    Li, Lei
    Li, Shan
    Huang, Yan
    He, Chu
    Wang, Dingwen
    PATTERN RECOGNITION, 2024, 151