Monocular depth estimation with multi-view attention autoencoder

被引:0
|
作者
Geunho Jung
Sang Min Yoon
机构
[1] Kookmin University,HCI Lab., College of Computer Science
来源
关键词
Depth estimation; Autoencoder; Attention module;
D O I
暂无
中图分类号
学科分类号
摘要
Depth map estimation from a single RGB image is a fundamental computer vision and image processing task for various applications. Deep learning based depth map estimation has improved prediction accuracy compared with traditional approaches by learning huge numbers of RGB-D images, but challenging issues remain for distorted and blurry reconstruction in object boundaries because the features are not enforced during training. This paper presents a multi-view attention autoencoder embedded in a deep neural network to emphasize self-representative features, which provide robust depth maps by simultaneously accentuating useful features and reducing redundant features to improve depth map estimation performance. Qualitative and quantitative experiments were conducted to verify the proposed network effectiveness, which can be utilized for three-dimensional scene reconstruction and understanding.
引用
收藏
页码:33759 / 33770
页数:11
相关论文
共 50 条
  • [41] Depth Detection in Multi-View Displays
    Yang-Mao, Shys-Fan
    Lin, Yu-Ting
    Wang, Te-Mei
    Ho, Chia-Hang
    Chen, Hsiao-Wei
    Wu, Chang-Shuo
    [J]. IDW/AD '12: PROCEEDINGS OF THE INTERNATIONAL DISPLAY WORKSHOPS, PT 1, 2012, 19 : 627 - 630
  • [42] Multi-View Masked Autoencoder for General Image Representation
    Ji, Seungbin
    Han, Sangkwon
    Rhee, Jongtae
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [43] Unsupervised multi-view stereo network based on multi-stage depth estimation
    Qi, Shuai
    Sang, Xinzhu
    Yan, Binbin
    Wang, Peng
    Chen, Duo
    Wang, Huachun
    Ye, Xiaoqian
    [J]. IMAGE AND VISION COMPUTING, 2022, 122
  • [44] Combining Sentiment Lexica with a Multi-View Variational Autoencoder
    Hoyle, Alexander
    Wolf-Sonkinf, Lawrence
    Wallach, Hanna
    Cotterellfi, Ryan
    Augenstein, Isabelle
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 635 - 640
  • [45] Multi-view representation model based on graph autoencoder
    Li, Jingci
    Lu, Guangquan
    Wu, Zhengtian
    Ling, Fuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 439 - 453
  • [46] Multi-view depth video coding using depth view synthesis
    Na, Sang-Tae
    Oh, Kwan-Jung
    Lee, Cheon
    Ho, Yo-Sung
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1400 - 1403
  • [47] Elastic deep multi-view autoencoder with diversity embedding
    Daneshfar, Fatemeh
    Saifee, Bahar Sar
    Soleymanbaigi, Sayvan
    Aeini, Mohammad
    [J]. INFORMATION SCIENCES, 2025, 689
  • [48] Deep cross-view autoencoder network for multi-view learning
    Mi, Jian-Xun
    Fu, Chang-Qing
    Chen, Tao
    Gou, Tingting
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24645 - 24664
  • [49] Deep cross-view autoencoder network for multi-view learning
    Jian-Xun Mi
    Chang-Qing Fu
    Tao Chen
    Tingting Gou
    [J]. Multimedia Tools and Applications, 2022, 81 : 24645 - 24664
  • [50] Depth map-based disparity estimation technique using multi-view and depth camera
    Um, Gi-Mun
    Kim, Seung-Man
    Hur, Namho
    Lee, Kwan Hang
    Lee, Soo In
    [J]. STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS XIII, 2006, 6055