3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

被引：11

作者：

Chen, Yunlu ^{[1
]}

Mensink, Thomas ^{[2
]}

Gavves, Efstratios ^{[1
]}

机构：

[1] Univ Amsterdam, Amsterdam, Netherlands

[2] Google Res, Amsterdam, Netherlands

来源：

2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019) | 2019年

关键词：

D O I：

10.1109/3DV.2019.00028

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A key challenge for RGB-D segmentation is how to effectively incorporate 3D geometric information from the depth channel into 2D appearance features. We propose to model the effective receptive field of 2D convolution based on the scale and locality from the 3D neighborhood. Standard convolutions are local in the image space (u, v), often with a fixed receptive field of 3x3 pixels. We propose to define convolutions local with respect to the corresponding point in the 3D real world space (x, y, z), where the depth channel is used to adapt the receptive field of the convolution, which yields the resulting filters invariant to scale and focusing on the certain range of depth. We introduce 3D Neighborhood Convolution (3DN-Conv), a convolutional operator around 3D neighborhoods. Further, we can use estimated depth to use our RGB-D based semantic segmentation model from RGB input. Experimental results validate that our proposed 3DN-Conv operator improves semantic segmentation, using either ground-truth depth (RGB-D) or estimated depth (RGB).

引用

页码：173 / 182

页数：10

共 50 条

[1] Depth-Aware CNN for RGB-D Segmentation
Wang, Weiyue
Neumann, Ulrich
[J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 144 - 161
[2] RGB×D: Learning depth-weighted RGB patches for RGB-D indoor semantic segmentation
Cao, Jinming
Leng, Hanchao
Cohen-Or, Daniel
Lischinski, Dani
Chen, Ying
Tu, Changhe
Li, Yangyan
[J]. Neurocomputing, 2021, 462 : 568 - 580
[3] Joining geometric and RGB features for RGB-D semantic segmentation
Zhang, Shaopeng
Zhong, Min
Zeng, Gang
Gan, Rui
[J]. 2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
[4] 2.5D CONVOLUTION FOR RGB-D SEMANTIC SEGMENTATION
Xing, Yajie
Wang, Jingbo
Chen, Xiaokang
Zeng, Gang
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1410 - 1414
[5] Depth-aware Convolutional Neural Networks for accurate 3D Pose Estimation in RGB-D Images
Porzi, Lorenzo
Penate-Sanchez, Adrian
Ricci, Elisa
Moreno-Noguer, Francesc
[J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 5777 - 5783
[6] DEPTH REMOVAL DISTILLATION FOR RGB-D SEMANTIC SEGMENTATION
Fang, Tiyu
Liang, Zhen
Shao, Xiuli
Dong, Zihao
Li, Jinping
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2405 - 2409
[7] RGBxD: Learning depth-weighted RGB patches for RGB-D indoor semantic segmentation
Cao, Jinming
Leng, Hanchao
Cohen-Or, Daniel
Lischinski, Dani
Chen, Ying
Tu, Changhe
Li, Yangyan
[J]. NEUROCOMPUTING, 2021, 462 : 568 - 580
[8] Edge-Aware Convolution for RGB-D Image Segmentation
Chen, Rongsen
Zhang, Fang-Lue
Rhee, Taehyun
[J]. 2020 35TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2020,
[9] Depth-aware lightweight network for RGB-D salient object detection
Ling, Liuyi
Wang, Yiwen
Wang, Chengjun
Xu, Shanyong
Huang, Yourui
[J]. IET IMAGE PROCESSING, 2023, 17 (08) : 2350 - 2361
[10] RGB-D SEMANTIC SEGMENTATION: A REVIEW
Hu, Yaosi
Chen, Zhenzhong
Lin, Weiyao
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,

← 1 2 3 4 5 →