A Generic Visual Perception Domain Randomisation Framework for Gazebo

被引:0
|
作者
Borrego, Joao [1 ]
Figueiredo, Rui [1 ]
Dehban, Atabak [1 ]
Moreno, Plinio [1 ]
Bernardino, Alexandre [1 ]
Santos-Victor, Jose [1 ]
机构
[1] Inst Super Tecn, Inst Syst & Robot, Lisbon, Portugal
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The impressive results of applying deep neural networks in tasks such as object recognition, language translation, and solving digital games are largely attributed to the availability of massive amounts of high quality labelled data. However, despite numerous promising steps in incorporating these techniques in robotic tasks, the cost of gathering data with a real robot has halted the proliferation of deep learning in robotics. In this work, a plugin for the Gazebo simulator is presented, which allows rapid generation of synthetic data. By introducing variations in simulator-specific or irrelevant aspects of a task, one can train a model which exhibits some degree of robustness against those aspects, and ultimately narrow the reality gap between simulated and real-world data. To show a use-case of the developed software, we build a new dataset for detection and localisation of three object classes: box, cylinder and sphere. Our results in the object detection and localisation task demonstrate that with small datasets generated only in simulation, one can achieve comparable performance to that achieved when training on real-world images.
引用
收藏
页码:237 / 242
页数:6
相关论文
共 50 条
  • [31] A framework for the fusion of visual and tactile modalities for improving robot perception
    Wenchang ZHANG
    Fuchun SUN
    Hang WU
    Haolin YANG
    Science China(Information Sciences), 2017, 60 (01) : 145 - 156
  • [32] Components of visual perception in marketing contexts: a conceptual framework and review
    Kevin L. Sample
    Henrik Hagtvedt
    S. Adam Brasel
    Journal of the Academy of Marketing Science, 2020, 48 : 405 - 421
  • [33] PERCEPTION AND GENERIC MODELING
    WITKIN, A
    PENTLAND, A
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1984, 1 (12): : 1216 - 1216
  • [34] VO-Sim: A Generic Framework for Tuning and Evaluating Visual Odometry Systems
    Alaa, Islam
    Wassal, Amr
    2018 IEEE/ION POSITION, LOCATION AND NAVIGATION SYMPOSIUM (PLANS), 2018, : 432 - 439
  • [35] A generic framework for semantic video indexing based on visual concepts/contexts detection
    Elleuch, Nizar
    Ben Ammar, Anis
    Alimi, Adel M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (04) : 1397 - 1421
  • [36] A generic and visual interfacing framework for bridging the interface between application systems and recognizers
    Peng, Shih-Jung
    Ruzicka, Jan Karel
    Chen, Deng-Jyi
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2006, 22 (05) : 1077 - 1091
  • [37] A generic framework for semantic video indexing based on visual concepts/contexts detection
    Nizar Elleuch
    Anis Ben Ammar
    Adel M. Alimi
    Multimedia Tools and Applications, 2015, 74 : 1397 - 1421
  • [38] Towards a Structural Framework for Explicit Domain Knowledge in Visual Analytics
    Rind, Alexander
    Wagner, Markus
    Aigner, Wolfgang
    2019 IEEE WORKSHOP ON VISUAL ANALYTICS IN HEALTHCARE (VAHC), 2019, : 33 - 40
  • [39] A Generative Adversarial Distribution Matching Framework for Visual Domain Adaptation
    Zhang, Kai
    Kang, Qi
    Pan, Le
    Wang, XueSong
    Cui, Can
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 252 - 256
  • [40] A visual cortex domain model for surface perception of illusory contour figures
    Jeong, EH
    Hong, KH
    Lee, WB
    Kim, WH
    INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, PROCEEDINGS, 1999, : 482 - 488