Reducing human efforts in video segmentation annotation with reinforcement learning

被引:8
|
作者
Varga, Viktor [1 ]
Lorincz, Andras [1 ]
机构
[1] Eotvos Lorand Univ, Fac Informat, Pazmany Peter Setany 1-A, H-1117 Budapest, Hungary
关键词
ENERGY MINIMIZATION;
D O I
10.1016/j.neucom.2020.02.127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Manual annotation of video segmentation datasets requires an immense amount of human effort, thus, reduction of human annotation costs is an active topic of research. While many papers deal with the propagation of masks through frames of a video, only a few results attempt to optimize annotation task selection. In this paper we present a deep learning based solution to the latter problem and train it using Reinforcement Learning. Our approach utilizes a modified version of the Dueling Deep Q-Network sharing weight parameters across the temporal axis of the video. This technique enables the trained agent to select annotation tasks from the whole video. We evaluate our annotation task selection method by means of a hierarchical supervoxel segmentation based mask propagation algorithm. © 2020 Elsevier B.V.
引用
下载
收藏
页码:247 / 258
页数:12
相关论文
共 50 条
  • [1] Reducing the Annotation Effort for Video Object Segmentation Datasets
    Voigtlaender, Paul
    Luo, Lishu
    Yuan, Chun
    Jiang, Yong
    Leibe, Bastian
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3059 - 3068
  • [2] Reducing Human Annotation Effort Using Self-supervised Learning for Image Segmentation
    Siriborvornratanakul, Thitirat
    ARTIFICIAL INTELLIGENCE IN HCI, PT I, AI-HCI 2024, 2024, 14734 : 436 - 445
  • [3] Segmentation of intentional human gestures for sports video annotation
    Chambers, GS
    Venkatesh, S
    West, GAW
    Bui, HH
    10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2004, : 124 - 129
  • [4] Dynamic Face Video Segmentation via Reinforcement Learning
    Wang, Yujiang
    Dong, Mingzhi
    Shen, Jie
    Wu, Yang
    Cheng, Shiyang
    Pantic, Maja
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6957 - 6967
  • [5] Unsupervised Video Object Segmentation for Deep Reinforcement Learning
    Goel, Vik
    Weng, Jameson
    Poupart, Pascal
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Deep Reinforcement Learning for Object Segmentation in Video Sequences
    Sahba, Farhang
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 857 - 860
  • [7] Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation
    Xie, Guangqi
    Li, Xin
    Lin, Shiqi
    Chen, Zhibo
    Zhang, Li
    Zhang, Kai
    Li, Yue
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [8] Reinforcement Cutting-Agent Learning for Video Object Segmentation
    Han, Junwei
    Yang, Le
    Zhang, Dingwen
    Chang, Xiaojun
    Liang, Xiaodan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9080 - 9089
  • [9] Human-in-the-Loop Video Semantic Segmentation Auto-Annotation
    Qiao, Nan
    Sun, Yuyin
    Liu, Chong
    Xia, Lu
    Luo, Jiajia
    Zhang, Ke
    Kuo, Cheng-Hao
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5870 - 5880
  • [10] Human segmentation in surveillance video with deep learning
    Monica Gruosso
    Nicola Capece
    Ugo Erra
    Multimedia Tools and Applications, 2021, 80 : 1175 - 1199