Distortion-Aware Room Layout Estimation from A Single Fisheye Image

被引:4
|
作者
Meng, Ming [1 ]
Xiao, Likai [1 ]
Zhou, Yi [2 ]
Li, Zhaoxin [3 ]
Zhou, Zhong [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China
[2] Beijing BigView Technol Co Ltd, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Layout estimation; Deformable convolution; Fisheye image dataset; Orthographic projection;
D O I
10.1109/ISMAR52148.2021.00061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Omnidirectional images of 180 degrees or 360 degrees field of view provide the entire visual content around the capture cameras, giving rise to more sophisticated scene understanding and reasoning and bringing broad application prospects for VR/AR/MR. As a result, researches on omni-directional image layout estimation have sprung up in recent years. However, existing layout estimation methods designed for panorama images cannot perform well on fisheye images, mainly due to lack of public fisheye dataset as well as the significantly differences in the positions and degree of distortions caused by different projection models. To fill theses gaps, in this work we first reuse the released large-scale panorama datasets and reproduce them to fisheye images via projection conversion, thereby circumventing the challenge of obtaining high-quality fisheye datasets with ground truth layout annotations. Then, we propose a distortion-aware module according to the distortion of the orthographic projection (i.e., OrthConv) to perform effective features extraction from fisheye images. Additionally, we exploit bidirectional LSTM with two-dimensional step mode for horizontal and vertical prediction to capture the long-range geometric pattern of the object for the global coherent predictions even with occlusion and cluttered scenes. We extensively evaluate our deformable convolution for room layout estimation task. In comparison with state-of-the-art approaches, our approach produces considerable performance gains in real-world dataset as well as in synthetic dataset. This technology provides high-efficiency and low-cost technical implementations for VR house viewing and MR video surveillance. We present an MR-based building video surveillance scene equipped with nine fisheye lens can achieve an immersive hybrid display experience, which can be used for intelligent building management in the future.
引用
收藏
页码:441 / 449
页数:9
相关论文
共 50 条
  • [31] PSMNet: Position-aware Stereo Merging Network for Room Layout Estimation
    Wang, Haiyan
    Hutchcroft, Will
    Li, Yuguang
    Wan, Zhiqiang
    Boyadzhiev, Ivaylo
    Tian, Yingli
    Kang, Sing Bing
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 8606 - 8615
  • [32] LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image
    Zou, Chuhang
    Colburn, Alex
    Shan, Qi
    Hoiem, Derek
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2051 - 2059
  • [33] Distortion-Aware Self-Supervised Indoor 360°Depth Estimation via Hybrid Projection Fusion and Structural Regularities
    Wang, Xu
    Kong, Weifeng
    Zhang, Qiudan
    Yang, You
    Zhao, Tiesong
    Jiang, Jianmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3998 - 4011
  • [34] Robust Line-Based Radial Distortion Estimation From a Single Image
    Zhang, Luwei
    Shang, Hongbo
    Wu, Fanlu
    Wang, Rui
    Sun, Tao
    Xie, Jingjiang
    IEEE ACCESS, 2019, 7 : 180373 - 180382
  • [35] Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning
    Cho, Hanbyel
    Cho, Yooshin
    Yu, Jaemyung
    Kim, Junmo
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11149 - 11158
  • [36] Multi-person 3D pose estimation from a single image captured by a fisheye camera
    Zhang, Yahui
    You, Shaodi
    Karaoglu, Sezer
    Gevers, Theo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [37] Automatic Calibration of the Fisheye Camera for Egocentric 3D Human Pose Estimation from a Single Image
    Zhang, Yahui
    You, Shaodi
    Gevers, Theo
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1771 - 1780
  • [38] Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye Distortion
    Wakai, Nobuhiko
    Sato, Satoshi
    Ishii, Yasunori
    Yamashita, Takayoshi
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 679 - 698
  • [39] Single Image, Context Aware Action Estimation in Sports
    Lanius, Christian
    Kobayashi, Daisuke
    Ouchi, Kazushige
    Aoki, Yoshimitsu
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 664 - 671
  • [40] Panoramic Image Composed of Multiple Rectilinear Images Generated from a Single Fisheye Image
    Kweon, Gyeong-Il
    JOURNAL OF THE OPTICAL SOCIETY OF KOREA, 2010, 14 (02) : 109 - 120