Sim-to-Real Visual Grasping via State Representation Learning Based on Combining Pixel-Level and Feature-Level Domain Adaptation

被引:4
|
作者
Park, Youngbin [1 ]
Lee, Sang Hyoung [2 ]
Suh, Il Hong [3 ]
机构
[1] Hanyang Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Korea Inst Ind Technol, Innovat Smart Mfg R&D Dept, Seoul, South Korea
[3] CogAplex Co Ltd, Seoul, South Korea
关键词
D O I
10.1109/ICRA48506.2021.9561302
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we present a method to grasp diverse unseen real-world objects using an off-policy actor-critic deep reinforcement learning (RL) with the help of a simulation and the use of as little real-world data as possible. Actor-critic deep RL is unstable and difficult to tune when a raw image is given as an input. Therefore, we use state representation learning (SRL) to make actor-critic RL feasible for visual grasping tasks. Meanwhile, to reduce visual reality gap between simulation and reality, we also employ a typical pixel-level domain adaptation that can map simulated images to realistic ones. In our method, as the SRL model is a common preprocessing module for simulated and real-world data, we perform SRL using real and adapted images. This pixel-level domain adaptation enables the robot to learn grasping skills in a real environment using small amounts of real-world data. However, the controller trained in the simulation should adapt to the real world efficiently. Hence, we propose a method combining a typical pixel-level domain adaptation and the proposed SRL model, where we perform SRL based on a feature-level domain adaptation. In evaluations of vision-based robotics grasping tasks, we show that the proposed method achieves a substantial improvement over a method that only employs a pixel-level or domain adaptation.
引用
收藏
页码:6300 / 6307
页数:8
相关论文
共 31 条
  • [1] Pixel-Level and Feature-Level Domain Adaptation for Heterogeneous SAR Target Recognition
    Chen, Zhuo
    Zhao, Lingjun
    He, Qishan
    Kuang, Gangyao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [2] Multimodel fusion method via sparse representation at pixel-level and feature-level
    Zhang, Shuai
    Liu, Bingqi
    Huang, Fuyu
    OPTICAL ENGINEERING, 2019, 58 (06)
  • [3] Pixel-Level Domain Adaptation for Real-to-Sim Object Pose Estimation
    Qian, Kun
    Duan, Yanhui
    Luo, Chaomin
    Zhao, Yongqiang
    Jing, Xingshuo
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1618 - 1627
  • [4] Joint Feature-level and Pixel-level Domain Adaption for Object Detection in the Wild
    Luo, Qianhui
    Wang, Yue
    Li, Weijie
    Xiong, Rong
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 559 - 565
  • [5] Brain tumor segmentation in multimodal MRI via pixel-level and feature-level image fusion
    Liu, Yu
    Mu, Fuhao
    Shi, Yu
    Cheng, Juan
    Li, Chang
    Chen, Xun
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [6] Domain adaptation based on feature-level and class-level alignment
    Zhao X.-Q.
    Jiang H.-M.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (05): : 1203 - 1210
  • [7] MetaMVUC: Active Learning for Sample-Efficient Sim-to-Real Domain Adaptation in Robotic Grasping
    Gilles, Maximilian
    Furmans, Kai
    Rayyes, Rania
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 3644 - 3651
  • [8] Wildfire Smoke Classification Based on Synthetic Images and Pixel- and Feature-Level Domain Adaptation
    Mao, Jun
    Zheng, Change
    Yin, Jiyan
    Tian, Ye
    Cui, Wenbin
    SENSORS, 2021, 21 (23)
  • [9] Learning discriminative feature representation with pixel-level supervision for forest smoke recognition
    Tao, Huanjie
    Duan, Qianyue
    Lu, Minghao
    Hu, Zhenwu
    PATTERN RECOGNITION, 2023, 143
  • [10] Face recognition based on pixel-level and feature-level fusion of the top-level's wavelet sub-bands
    Huang, Zheng-Hai
    Li, Wen-Juan
    Wang, Jun
    Zhang, Ting
    INFORMATION FUSION, 2015, 22 : 95 - 104