Deep 3D semantic scene extrapolation

被引:0
|
作者
Ali Abbasi
Sinan Kalkan
Yusuf Sahillioğlu
机构
[1] Middle East Technical University,
来源
The Visual Computer | 2019年 / 35卷
关键词
3D scenes; Extrapolation; Convolutional neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Scene extrapolation is a challenging variant of the scene completion problem, which pertains to predicting the missing part(s) of a scene. While the 3D scene completion algorithms in the literature try to fill the occluded part of a scene such as a chair behind a table, we focus on extrapolating the available half-scene information to a full one, a problem that, to our knowledge, has not been studied yet. Our approaches are based on convolutional neural networks (CNN). As input, we take the half of 3D voxelized scenes, then our models complete the other half of scenes as output. Our baseline CNN model consisting of convolutional and ReLU layers with multiple residual connections and Softmax classifier with voxel-wise cross-entropy loss function at the end. We train and evaluate our models on the synthetic 3D SUNCG dataset. We show that our trained networks can predict the other half of the scenes and complete the objects correctly with suitable lengths. With a discussion on the challenges, we propose scene extrapolation as a challenging test bed for future research in deep learning. We made our models available on https://github.com/aliabbasi/d3dsse.
引用
收藏
页码:271 / 279
页数:8
相关论文
共 50 条
  • [41] Exploiting 3D Semantic Scene Priors for Online Traffic Light Interpretation
    Barnes, Dan
    Maddern, Will
    Posner, Ingmar
    2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 573 - 578
  • [42] Extraction of 3D Scene Structure for Semantic Annotation and Retrieval of Unedited Video
    Feldmann, Ingo
    Waizenegger, Wolfgang
    Schreer, Oliver
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 82 - 87
  • [43] 3D Semantic Scene Perception Using Distributed Smart Edge Sensors
    Bultmann, Simon
    Behnke, Sven
    INTELLIGENT AUTONOMOUS SYSTEMS 17, IAS-17, 2023, 577 : 313 - 329
  • [44] Real-time 3D semantic map building in indoor scene
    Shan J.
    Li X.
    Zhang X.
    Jia S.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (05): : 240 - 248
  • [45] Visual-Inertial-Semantic Scene Representation for 3D Object Detection
    Dong, Jingming
    Fei, Xiaohan
    Soatto, Stefano
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3567 - 3577
  • [46] Language-Assisted 3D Feature Learning for Semantic Scene Understanding
    Zhang, Junbo
    Fan, Guofan
    Wang, Guanghan
    Su, Zhengyuan
    Ma, Kaisheng
    Yi, Li
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3445 - 3453
  • [47] Incremental 3D Semantic Scene Graph Prediction from RGB Sequences
    Wu, Shun-Cheng
    Tateno, Keisuke
    Navab, Nassir
    Tombari, Federico
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5064 - 5074
  • [48] BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
    Wu, Zhennan
    Li, Yang
    Yan, Han
    Shang, Taizhang
    Sun, Weixuan
    Wang, Senbo
    Cui, Ruikai
    Liu, Weizhe
    Sato, Hiroyuki
    Li, Hongdong
    Ji, Pan
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
  • [49] 3D Semantic Deep Learning Networks for Leukemia Detection
    Amin, Javaria
    Sharif, Muhammad
    Anjum, Muhammad Almas
    Siddiqa, Ayesha
    Kadry, Seifedine
    Nam, Yunyoung
    Raza, Mudassar
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (01): : 785 - 799
  • [50] Deep Learning on 3D Semantic Segmentation: A Detailed Review
    Betsas, Thodoris
    Georgopoulos, Andreas
    Doulamis, Anastasios
    Grussenmeyer, Pierre
    REMOTE SENSING, 2025, 17 (02)