Reducing the Annotation Effort for Video Object Segmentation Datasets

被引:1
|
作者
Voigtlaender, Paul [1 ]
Luo, Lishu [2 ]
Yuan, Chun [2 ]
Jiang, Yong [2 ]
Leibe, Bastian [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
[2] Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
D O I
10.1109/WACV48630.2021.00310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For further progress in video object segmentation (VOS), larger, more diverse, and more challenging datasets will be necessary. However, densely labeling every frame with pixel masks does not scale to large datasets. We use a deep convolutional network to automatically create pseudolabels on a pixel level from much cheaper bounding box annotations and investigate how far such pseudo-labels can carry us for training state-of-the-art VOS approaches. A very encouraging result of our study is that adding a manually annotated mask in only a single video frame for each object is sufficient to generate pseudo-labels which can be used to train a VOS method to reach almost the same performance level as when training with fully segmented videos. We use this workflow to create pixel pseudolabels for the training set of the challenging tracking dataset TAO, and we manually annotate a subset of the validation set. Together, we obtain the new TAO-VOS benchmark, which we make publicly available at www.vision. rwth-aachen.de/page/taovos. While the performance of state-of-the-art methods on existing datasets starts to saturate, TAO-VOS remains very challenging for current algorithms and reveals their shortcomings.
引用
收藏
页码:3059 / 3068
页数:10
相关论文
共 50 条
  • [31] Application Of Segmentation Of Object Video In Robot
    Gong Heng
    PROCEEDINGS OF THE 2015 INTERNATIONAL INDUSTRIAL INFORMATICS AND COMPUTER ENGINEERING CONFERENCE, 2015, : 1411 - 1414
  • [32] Video object segmentation using SVMs
    Zhao, Y
    Li, HL
    Ahalt, SC
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 333 - 337
  • [33] Compressed Domain Video Object Segmentation
    Porikli, Fatih
    Bashir, Faisal
    Sun, Huifang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (01) : 2 - 14
  • [34] Methods for Referring Video Object Segmentation
    Wei, Caiying
    Jia, Lei
    Computer Engineering and Applications, 61 (02): : 73 - 83
  • [35] Weakly Supervised Video Object Segmentation
    Wang, Yufei
    Hu, Yongjiang
    Liew, Alan Wee-Chung
    Wang, Junhu
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 0315 - 0320
  • [36] Video object segmentation with a Potts model
    Zhao, Jieyu
    Wang, Xiaoquan
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 742 - +
  • [37] Research on Video Object Segmentation Algorithm
    Bo, Guan
    PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1399 - 1402
  • [38] Video Object Segmentation Based on Disparity
    Xingming, Ouyang
    Wei, Wei
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 36 - 44
  • [39] Video Object Segmentation with Referring Expressions
    Khoreva, Anna
    Rohrbach, Anna
    Schiele, Bernt
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 7 - 12
  • [40] Guided Video Object Segmentation by Tracking
    Pelhan, Jer
    Kristan, Matej
    Lukezic, Alan
    Matas, Jiri
    Zajc, Luka Cehovin
    ELEKTROTEHNISKI VESTNIK, 2023, 90 (04): : 147 - 158