Distortion-Aware Room Layout Estimation from A Single Fisheye Image

被引:4
|
作者
Meng, Ming [1 ]
Xiao, Likai [1 ]
Zhou, Yi [2 ]
Li, Zhaoxin [3 ]
Zhou, Zhong [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China
[2] Beijing BigView Technol Co Ltd, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Layout estimation; Deformable convolution; Fisheye image dataset; Orthographic projection;
D O I
10.1109/ISMAR52148.2021.00061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Omnidirectional images of 180 degrees or 360 degrees field of view provide the entire visual content around the capture cameras, giving rise to more sophisticated scene understanding and reasoning and bringing broad application prospects for VR/AR/MR. As a result, researches on omni-directional image layout estimation have sprung up in recent years. However, existing layout estimation methods designed for panorama images cannot perform well on fisheye images, mainly due to lack of public fisheye dataset as well as the significantly differences in the positions and degree of distortions caused by different projection models. To fill theses gaps, in this work we first reuse the released large-scale panorama datasets and reproduce them to fisheye images via projection conversion, thereby circumventing the challenge of obtaining high-quality fisheye datasets with ground truth layout annotations. Then, we propose a distortion-aware module according to the distortion of the orthographic projection (i.e., OrthConv) to perform effective features extraction from fisheye images. Additionally, we exploit bidirectional LSTM with two-dimensional step mode for horizontal and vertical prediction to capture the long-range geometric pattern of the object for the global coherent predictions even with occlusion and cluttered scenes. We extensively evaluate our deformable convolution for room layout estimation task. In comparison with state-of-the-art approaches, our approach produces considerable performance gains in real-world dataset as well as in synthetic dataset. This technology provides high-efficiency and low-cost technical implementations for VR house viewing and MR video surveillance. We present an MR-based building video surveillance scene equipped with nine fisheye lens can achieve an immersive hybrid display experience, which can be used for intelligent building management in the future.
引用
收藏
页码:441 / 449
页数:9
相关论文
共 50 条
  • [41] Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness
    Shen, Zhijie
    Zheng, Zishuo
    Lin, Chunyu
    Nie, Lang
    Liao, Kang
    Zheng, Shuai
    Zhao, Yao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17337 - 17345
  • [42] Detecting Fingerprint Distortion from a Single Image
    Si, Xuanbin
    Feng, Jianjiang
    Zhou, Jie
    2012 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2012, : 1 - 6
  • [43] Robust Radial Distortion from a Single Image
    Bukhari, Faisal
    Dailey, Matthew N.
    ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 11 - 20
  • [44] Image Synthesis from Layout with Locality-Aware Mask Adaption
    Li, Zejian
    Wu, Jingyu
    Koh, Immanuel
    Tang, Yongchuan
    Sun, Lingyun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13799 - 13808
  • [45] Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image
    Song, Wenfeng
    Li, Shuai
    Liu, Ji
    Hao, Aimin
    Zhao, Qinping
    Qin, Hong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1220 - 1233
  • [46] Self-training Room Layout Estimation via Geometry-Aware Ray-Casting
    Solarte, Bolivar
    Wu, Chin-Hsuan
    Jhang, Jin-Cheng
    Lee, Jonathan
    Tsai, Yi-Hsuan
    Sun, Min
    COMPUTER VISION - ECCV 2024, PT XLV, 2025, 15103 : 253 - 269
  • [47] LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network
    Jiang, Zhigang
    Xiang, Zhongzheng
    Xu, Jinhua
    Zhao, Ming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1644 - 1653
  • [48] Manhattan Room Layout Reconstruction from a Single 360° Image: A Comparative Study of State-of-the-Art Methods
    Zou, Chuhang
    Su, Jheng-Wei
    Peng, Chi-Han
    Colburn, Alex
    Shan, Qi
    Wonka, Peter
    Chu, Hung-Kuo
    Hoiem, Derek
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1410 - 1431
  • [49] Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image
    Yang, Cheng
    Zheng, Jia
    Dai, Xili
    Tang, Rui
    Ma, Yi
    Yuan, Xiaojun
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 235 - 244
  • [50] Learning to reconstruct 3D non-cuboid room layout from a single RGB image
    Yang, Cheng
    Zheng, Jia
    Dai, Xili
    Tang, Rui
    Ma, Yi
    Yuan, Xiaojun
    arXiv, 2021,