Embodied Visual Active Learning for Semantic Segmentation

被引:0
|
作者
Nilsson, David [1 ,2 ,3 ]
Pirinen, Aleksis [1 ]
Gartner, Erik [1 ,2 ,3 ]
Sminchisescu, Cristian [1 ,2 ]
机构
[1] Lund Univ, Fac Engn, Dept Math, Lund, Sweden
[2] Google Res, Mountain View, CA 94043 USA
[3] Google, Mountain View, CA 94043 USA
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the task of embodied visual active learning, where an agent is set to explore a 3d environment with the goal to acquire visual scene understanding by actively selecting views for which to request annotation. While accurate on some benchmarks, today's deep visual recognition pipelines tend to not generalize well in certain real-world scenarios, or for unusual viewpoints. Robotic perception, in turn, requires the capability to refine the recognition capabilities for the conditions where the mobile system operates, including cluttered indoor environments or poor illumination. This motivates the proposed task, where an agent is placed in a novel environment with the objective of improving its visual recognition capability. To study embodied visual active learning, we develop a battery of agents - both learnt and pre-specified - and with different levels of knowledge of the environment. The agents are equipped with a semantic segmentation network and seek to acquire informative views, move and explore in order to propagate annotations in the neighbourhood of those views, then refine the underlying segmentation network by online retraining. The trainable method uses deep reinforcement learning with a reward function that balances two competing objectives: task performance, represented as visual recognition accuracy, which requires exploring the environment, and the necessary amount of annotated data requested during active exploration. We extensively evaluate the proposed models using the photorealistic Matterport3D simulator and show that a fully learnt method outperforms comparable pre-specified counterparts, even when requesting fewer annotations.
引用
收藏
页码:2373 / 2383
页数:11
相关论文
共 50 条
  • [1] Self-Supervised Embodied Learning for Semantic Segmentation
    Wang, Juan
    Liu, Xinzhu
    Zhao, Dawei
    Dai, Bin
    Liu, Huaping
    2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL, 2023, : 383 - 390
  • [2] Best Practices in Active Learning for Semantic Segmentation
    Mittal, Sudhanshu
    Niemeijer, Joshua
    Schaefer, Joerg P.
    Brox, Thomas
    PATTERN RECOGNITION, DAGM GCPR 2023, 2024, 14264 : 427 - 442
  • [3] Discwise Active Learning for LiDAR Semantic Segmentation
    Unal O.
    Dai D.
    Unal A.T.
    Van Gool L.
    IEEE Robotics and Automation Letters, 2023, 8 (11) : 7671 - 7678
  • [4] Active Learning for Semantic Segmentation with Expected Change
    Vezhnevets, Alexander
    Buhmann, Joachim M.
    Ferrari, Vittorio
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3162 - 3169
  • [5] Adaptive Superpixel for Active Learning in Semantic Segmentation
    Kim, Hoyoung
    Oh, Minhyeon
    Hwang, Sehyun
    Kwak, Suha
    Ok, Jungseul
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 943 - 953
  • [6] Embodied Active Domain Adaptation for Semantic Segmentation via Informative Path Planning
    Zurbruegg, Rene
    Blum, Hermann
    Cadena, Cesar
    Siegwart, Roland
    Schmid, Lukas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 8691 - 8698
  • [7] Semantic Segmentation with Active Semi-Supervised Learning
    Rangnekar, Aneesh
    Kanan, Christopher
    Hoffman, Matthew
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5955 - 5966
  • [8] Active Reinforcement Learning for the Semantic Segmentation of Urban Images
    Rad, Mahya Jodeiri
    Armenakis, Costas
    CANADIAN JOURNAL OF REMOTE SENSING, 2024, 50 (01)
  • [9] Distributed Active Learning for Semantic Segmentation on Walking Robots
    Puck, L.
    Spielbauer, N.
    Schaeble, D.
    Schnell, T.
    Buettner, T.
    Roennau, A.
    Dillmann, R.
    2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 461 - 467
  • [10] Learning Visual Words for Weakly-Supervised Semantic Segmentation
    Ru, Lixiang
    Du, Bo
    Wu, Chen
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 982 - 988