Exploiting weak mask representation with convolutional neural networks for accurate object tracking

被引:0
|
作者
Jianglei Huang
Wengang Zhou
Qi Tian
Houqiang Li
机构
[1] University of Science and Technology of China,CAS Key Laboratory of Technology in GIPAS, EEIS Department
[2] University of Texas at San Antonio,undefined
来源
关键词
Object tracking; Deep learning; Mask representation; Data augmentation; Bounding box approximation;
D O I
暂无
中图分类号
学科分类号
摘要
Recent years have witnessed the popularity of Convolutional Neural Networks (CNN) in a variety of computer vision tasks, including video object tracking. Existing object tracking methods with CNN employ either a scalar score or a confidence map as CNN’s output, which suffer the infeasibility of estimating the object’s accurate scale and rotation angle. Specifically, as with other traditional methods, they assume the targets’ scale aspect ratio and rotation angle are fixed. To address the limitation, we propose to take a binary mask as the output of CNN for tracking. To this end, we adapt a semantic segmentation model by online fine-tuning with augmented samples in the initial frame to uncover the target in the following frames. During the generation of training samples, we employ a Crop and Paste method to better utilize context information, add a random value to lightness component to mimic the illumination change, and take a Gaussian filtering approach to mimic the blur. During the tracking, due to the limitation of CNN’s receptive field size and spatial resolution, the network may fail to identify the target if the estimated bounding box is considerably incorrect. Therefore we propose a bounding box approximation method by considering temporal consistency. Excluding the initial training cost, our tracker runs at 41 FPS on a single GeForce 1080Ti GPU. Evaluated on benchmarks including OTB-2015, VOT-2016 and TempleColor, it achieves comparable results with non real-time top trackers and state-of-the-art performance among those real-time ones.
引用
收藏
页码:20961 / 20985
页数:24
相关论文
共 50 条
  • [31] Face Mask Detector Using Convolutional Neural Networks
    Mukherjee, Rajendrani
    Panday, Akash Narain
    Nandy, Sanjukta
    Ghosh, Sushmit
    Bhattacharya, Supratim
    Dey, Apurba
    Choudhury, Saumadip Dey
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 377 - 384
  • [32] GATE CONNECTED CONVOLUTIONAL NEURAL NETWORK FOR OBJECT TRACKING
    Kokul, T.
    Fookes, C.
    Sridharan, S.
    Ramanan, A.
    Pinidiyaarachchi, U. A. J.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2602 - 2606
  • [33] Parallel Convolutional Neural Networks for Object Detection
    Olugboja, Adedeji
    Wang, Zenghui
    Sun, Yanxia
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2021, 12 (04) : 279 - 286
  • [34] Object Detection Using Convolutional Neural Networks
    Galvez, Reagan L.
    Bandala, Argel A.
    Dadios, Elmer P.
    Vicerra, Ryan Rhay P.
    Maningo, Jose Martin Z.
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 2023 - 2027
  • [35] Detecting Object Affordances with Convolutional Neural Networks
    Anh Nguyen
    Kanoulas, Dimitrios
    Caldwell, Darwin G.
    Tsagarakis, Nikos G.
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2765 - 2770
  • [36] Cascaded Convolutional Neural Networks for Object Detection
    Guo, Yajing
    Guo, Xiaoqiang
    Jiang, Zhuqing
    Zhou, Yun
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [37] SELF-REPRESENTATION CONVOLUTIONAL NEURAL NETWORKS
    Gao, Hongchao
    Wang, Xi
    Li, Yujia
    Han, Jizhong
    Hu, Songlin
    Li, Ruixuan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1672 - 1677
  • [38] Problems of representation of electrocardiograms in convolutional neural networks
    Sereda, Iana
    Alekseev, Sergey
    Koneva, Aleksandra
    Khorkin, Alexey
    Osipov, Grigory
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [39] Representation Visualization of Convolutional Neural Networks: A Survey
    Si N.-W.
    Zhang W.-L.
    Qu D.
    Luo X.-Y.
    Chang H.-Y.
    Niu T.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (08): : 1890 - 1892
  • [40] Lateral Representation Learning in Convolutional Neural Networks
    Ballester, Pedro
    Correa, Ulisses Brisolara
    Araujo, Ricardo Matsumura
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,