Distortion-Aware Room Layout Estimation from A Single Fisheye Image

被引:4
|
作者
Meng, Ming [1 ]
Xiao, Likai [1 ]
Zhou, Yi [2 ]
Li, Zhaoxin [3 ]
Zhou, Zhong [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China
[2] Beijing BigView Technol Co Ltd, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Layout estimation; Deformable convolution; Fisheye image dataset; Orthographic projection;
D O I
10.1109/ISMAR52148.2021.00061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Omnidirectional images of 180 degrees or 360 degrees field of view provide the entire visual content around the capture cameras, giving rise to more sophisticated scene understanding and reasoning and bringing broad application prospects for VR/AR/MR. As a result, researches on omni-directional image layout estimation have sprung up in recent years. However, existing layout estimation methods designed for panorama images cannot perform well on fisheye images, mainly due to lack of public fisheye dataset as well as the significantly differences in the positions and degree of distortions caused by different projection models. To fill theses gaps, in this work we first reuse the released large-scale panorama datasets and reproduce them to fisheye images via projection conversion, thereby circumventing the challenge of obtaining high-quality fisheye datasets with ground truth layout annotations. Then, we propose a distortion-aware module according to the distortion of the orthographic projection (i.e., OrthConv) to perform effective features extraction from fisheye images. Additionally, we exploit bidirectional LSTM with two-dimensional step mode for horizontal and vertical prediction to capture the long-range geometric pattern of the object for the global coherent predictions even with occlusion and cluttered scenes. We extensively evaluate our deformable convolution for room layout estimation task. In comparison with state-of-the-art approaches, our approach produces considerable performance gains in real-world dataset as well as in synthetic dataset. This technology provides high-efficiency and low-cost technical implementations for VR house viewing and MR video surveillance. We present an MR-based building video surveillance scene equipped with nine fisheye lens can achieve an immersive hybrid display experience, which can be used for intelligent building management in the future.
引用
收藏
页码:441 / 449
页数:9
相关论文
共 50 条
  • [1] DaFIR: Distortion-Aware Representation Learning for Fisheye Image Rectification
    Liao, Zhaokang
    Zhou, Wengang
    Li, Houqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3606 - 3618
  • [2] DAN: Distortion-aware Network for fisheye image rectification using graph reasoning
    Yan, Yongjia
    Liu, Hongzhe
    Zhang, Cheng
    Xu, Cheng
    Xu, Bingxin
    Pan, Weiguo
    Dai, Songyin
    Song, Yiqing
    IMAGE AND VISION COMPUTING, 2025, 156
  • [3] Structure recovery from single omnidirectional image with distortion-aware learning
    Meng, Ming
    Zhou, Yi
    Zuo, Dongshi
    Li, Zhaoxin
    Zhou, Zhong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [4] Distortion-Aware Monocular Depth Estimation for Omnidirectional Images
    Chen, Hong-Xiang
    Li, Kunhong
    Fu, Zhiheng
    Li, Mengyi
    Chen, Zonghao
    Guo, Yulan
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) : 334 - 338
  • [5] ROBUST ROOM LAYOUT ESTIMATION FROM A SINGLE IMAGE WITH GEOMETRIC HINTS
    Deng, Ruifeng
    Chen, Xuejin
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3673 - 3677
  • [6] Distortion-aware Depth Estimation with Gradient Priors from Panoramas of Indoor Scenes
    Yin, Ruihong
    Karaoglu, Sezer
    Gevers, Theo
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 134 - 143
  • [7] Image Quality Assessment Based on Distortion-Aware Decision Fusion
    Peng, Peng
    Li, Zenian
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 644 - 651
  • [8] Distortion-aware Panoramic Image Resizing Using Seam Carving
    Choi, Bohyung
    Lee, Minyoung
    Jung, Seung-Won
    Lu, Yucheng
    2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
  • [9] 360° SINGLE IMAGE SUPER RESOLUTION VIA DISTORTION-AWARE NETWORK AND DISTORTED PERSPECTIVE IMAGES
    Nishiyama, Akito
    Ikehata, Satoshi
    Aizawa, Kiyoharu
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1829 - 1833
  • [10] 3D Room Layout Estimation From a Single RGB Image
    Yan, Chenggang
    Shao, Biyao
    Zhao, Hao
    Ning, Ruixin
    Zhang, Yongdong
    Xu, Feng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 3014 - 3024