Consistent Depth Prediction for Transparent Object Reconstruction from RGB-D Camera

被引:0
|
作者
Cai, Yuxiang [1 ]
Zhu, Yifan [1 ]
Zhang, Haiwei [1 ]
Ren, Bo [1 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
关键词
SLAM;
D O I
10.1109/ICCV51070.2023.00320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transparent objects are commonly seen in indoor scenes but are hard to estimate. Currently, commercial depth cameras face difficulties in estimating the depth of transparent objects due to the light reflection and refraction on their surface. As a result, they tend to make a noisy and incorrect depth value for transparent objects. These incorrect depth data make the traditional RGB-D SLAM method fails in reconstructing the scenes that contain transparent objects. An exact depth value of the transparent object is required to restore in advance and it is essential that the depth value of the transparent object must keep consistent in different views, or the reconstruction result will be distorted. Previous depth prediction methods of transparent objects can restore these missing depth values but none of them can provide a good result in reconstruction due to the inconsistency prediction. In this work, we propose a real-time reconstruction method using a novel stereo-based depth prediction network to keep the consistency of depth prediction in a sequence of images. Because there is no video dataset about transparent objects currently to train our model, we construct a synthetic RGB-D video dataset with different transparent objects. Moreover, to test generalization capability, we capture video from real scenes using the RealSense D435i RGB-D camera. We compare the metrics on our dataset and SLAM reconstruction results in both synthetic scenes and real scenes with the previous methods. Experiments show our significant improvement in accuracy on depth prediction and scene reconstruction.
引用
收藏
页码:3436 / 3445
页数:10
相关论文
共 50 条
  • [21] Automatic Object Searching by a Mobile Robot with Single RGB-D Camera
    Shim, Vui Ann
    Yuan, Miaolong
    Tan, Boon Hwa
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 56 - 62
  • [22] Performance of RGB-D camera for different object types in greenhouse conditions
    Ringdahl, Ola
    Kurtser, Polina
    Edan, Yael
    2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
  • [23] OBJECT CLASSIFICATION FROM RGB-D IMAGES USING DEPTH CONTEXT KERNEL DESCRIPTORS
    Pan, Hong
    Olsen, Soren Ingvor
    Zhu, Yaping
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 512 - 516
  • [24] Robust Upper Limb Kinematic Reconstruction Using a RGB-D Camera
    Gioi, Salvatore Maria Li
    Loianno, Giuseppe
    Cordella, Francesca
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3831 - 3837
  • [25] Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Zhang, Miao
    Piao, Yongri
    Lu, Huchuan
    Cheng, Li
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (04) : 855 - 876
  • [26] Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection
    Jingjing Li
    Wei Ji
    Miao Zhang
    Yongri Piao
    Huchuan Lu
    Li Cheng
    International Journal of Computer Vision, 2023, 131 : 855 - 876
  • [27] CDNet: Complementary Depth Network for RGB-D Salient Object Detection
    Jin, Wen-Da
    Xu, Jun
    Han, Qi
    Zhang, Yi
    Cheng, Ming-Ming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3376 - 3390
  • [28] Non-rigid Reconstruction with a Single Moving RGB-D Camera
    Elanattil, Shafeeq
    Moghadam, Peyman
    Sridharan, Sridha
    Fookes, Clinton
    Cox, Mark
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1049 - 1055
  • [29] Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera
    Cai, Hongrui
    Feng, Wanquan
    Feng, Xuetao
    Wang, Yan
    Zhang, Juyong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] Achieving Flexible 3D Reconstruction Volumes for RGB-D and RGB Camera Based Approaches
    Mock, Sebastian
    Lensing, Philipp
    Broll, Wolfgang
    COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 221 - 232