3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

被引:11
|
作者
Chen, Yunlu [1 ]
Mensink, Thomas [2 ]
Gavves, Efstratios [1 ]
机构
[1] Univ Amsterdam, Amsterdam, Netherlands
[2] Google Res, Amsterdam, Netherlands
关键词
D O I
10.1109/3DV.2019.00028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key challenge for RGB-D segmentation is how to effectively incorporate 3D geometric information from the depth channel into 2D appearance features. We propose to model the effective receptive field of 2D convolution based on the scale and locality from the 3D neighborhood. Standard convolutions are local in the image space (u, v), often with a fixed receptive field of 3x3 pixels. We propose to define convolutions local with respect to the corresponding point in the 3D real world space (x, y, z), where the depth channel is used to adapt the receptive field of the convolution, which yields the resulting filters invariant to scale and focusing on the certain range of depth. We introduce 3D Neighborhood Convolution (3DN-Conv), a convolutional operator around 3D neighborhoods. Further, we can use estimated depth to use our RGB-D based semantic segmentation model from RGB input. Experimental results validate that our proposed 3DN-Conv operator improves semantic segmentation, using either ground-truth depth (RGB-D) or estimated depth (RGB).
引用
收藏
页码:173 / 182
页数:10
相关论文
共 50 条
  • [21] A brief survey on RGB-D semantic segmentation using deep learning*
    Wang, Changshuo
    Wang, Chen
    Li, Weijun
    Wang, Haining
    [J]. DISPLAYS, 2021, 70
  • [22] 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data
    Yasir, Siddiqui Muhammad
    Sadiq, Amin Muhammad
    Ahn, Hyunsik
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5777 - 5791
  • [23] RGB-D Semantic Segmentation and Label-Oriented Voxelgrid Fusion for Accurate 3D Semantic Mapping
    Shi, Wenjun
    Xu, Jingwei
    Zhu, Dongchen
    Zhang, Guanghui
    Wang, Xianshun
    Li, Jiamao
    Zhang, Xiaolin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 183 - 197
  • [24] An RGB-D Fusion Based Semantic Segmentation Algorithm Based on Neighborhood Metric Relations
    Zhang, Jian
    Chen, Yeheng
    Zhu, Shiqiang
    Li, Yuehua
    [J]. Jiqiren/Robot, 2023, 45 (02): : 156 - 165
  • [25] Semantic Segmentation of RGB-D Images using 3Dand Local Neighbouring Features
    Fooladgar, Fahimeh
    Kasaei, Shohreh
    [J]. 2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 685 - 691
  • [26] 3D Semantic Scene Segmentation with Multi-View RGB-D Images in Indoor Environments
    Bae, Hye-Lim
    Kim, Incheol
    [J]. Journal of Institute of Control, Robotics and Systems, 2023, 29 (03): : 235 - 244
  • [27] ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation
    Cao, Jinming
    Leng, Hanchao
    Lischinski, Dani
    Cohen-Or, Danny
    Tu, Changhe
    Li, Yangyan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7068 - 7077
  • [28] RGB-D Semantic Segmentation for Indoor Modeling Using Deep Learning: A Review
    Rached, Ishraq
    Hajji, Rafika
    Landes, Tania
    [J]. RECENT ADVANCES IN 3D GEOINFORMATION SCIENCE, 3D GEOINFO 2023, 2024, : 587 - 604
  • [29] Regularized Fully Convolutional Networks for RGB-D Semantic Segmentation
    Su, Wen
    Wang, Zengfu
    [J]. 2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [30] A 3D Point Correspondences Uncertainty Aware RGB-D SLAM System
    Pei, Fujun
    Zhou, Zhongxiang
    Zhu, Mingjun
    Zhao, Ning
    [J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1623 - 1627