Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation

被引:0
|
作者
Haowen Deng
Mai Bui
Nassir Navab
Leonidas Guibas
Slobodan Ilic
Tolga Birdal
机构
[1] Informatics at Technische Universität München,Computer Science Department
[2] Corporate Technology Siemens AG,undefined
[3] Stanford University,undefined
来源
关键词
3D computer vision; Point clouds; Camera relocalization; 6D; Camera pose; Object pose; Rotation; Bingham distribution; Posterior distribution; Ambiguity; Uncertainty; Uncertainty estimation;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, we introduce Deep Bingham Networks (DBN), a generic framework that can naturally handle pose-related uncertainties and ambiguities arising in almost all real life applications concerning 3D data. While existing works strive to find a single solution to the pose estimation problem, we make peace with the ambiguities causing high uncertainty around which solutions to identify as the best. Instead, we report a family of poses which capture the nature of the solution space. DBN extends the state of the art direct pose regression networks by (i) a multi-hypotheses prediction head which can yield different distribution modes; and (ii) novel loss functions that benefit from Bingham distributions on rotations. This way, DBN can work both in unambiguous cases providing uncertainty information, and in ambiguous scenes where an uncertainty per mode is desired. On a technical front, our network regresses continuous Bingham mixture models and is applicable to both 2D data such as images and to 3D data such as point clouds. We proposed new training strategies so as to avoid mode or posterior collapse during training and to improve numerical stability. Our methods are thoroughly tested on two different applications exploiting two different modalities: (i) 6D camera relocalization from images; and (ii) object pose estimation from 3D point clouds, demonstrating decent advantages over the state of the art. For the former we contributed our own dataset composed of five indoor scenes where it is unavoidable to capture images corresponding to views that are hard to uniquely identify. For the latter we achieve the top results especially for symmetric objects of ModelNet dataset (Wu et al., in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1912–1920, 2015). The code and dataset accompanying this paper is provided under https://multimodal3dvision.github.io.
引用
收藏
页码:1627 / 1654
页数:27
相关论文
共 50 条
  • [31] Dealing With Uncertainty: Testing Risk-and Ambiguity-Attitude Across Adolescence
    Blankenstein, Neeltje E.
    Crone, Eveline A.
    van den Bos, Wouter
    van Duijvenvoorde, Anna C. K.
    DEVELOPMENTAL NEUROPSYCHOLOGY, 2016, 41 (1-2) : 77 - 92
  • [32] Deep Kernel Learning for Uncertainty Estimation in Multiple Trajectory Prediction Networks
    Strohbeck, Jan
    Mueller, Johannes
    Herrmann, Martin
    Buchholz, Michael
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 11396 - 11402
  • [33] Domain adaptation of networks for camera pose estimation: Learning camera pose estimation without pose labels
    Langerman, Jack
    Qiu, Ziming
    Sörös, Gábor
    Sebok, Dávid
    Wang, Yao
    Huang, Howard
    arXiv, 2021,
  • [34] Attention Span Prediction Using Head-Pose Estimation With Deep Neural Networks
    Singh, Tripti
    Mohadikar, Mohan
    Gite, Shilpa
    Patil, Shruti
    Pradhan, Biswajeet
    Alamri, Abdullah
    IEEE ACCESS, 2021, 9 (09): : 142632 - 142643
  • [35] Pose Partition Networks for Multi-person Pose Estimation
    Nie, Xuecheng
    Feng, Jiashi
    Xing, Junliang
    Yan, Shuicheng
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 705 - 720
  • [36] REDD plus emissions estimation and reporting: dealing with uncertainty
    Pelletier, Johanne
    Martin, Davy
    Potvin, Catherine
    ENVIRONMENTAL RESEARCH LETTERS, 2013, 8 (03):
  • [37] A Survey on Depth Ambiguity of 3D Human Pose Estimation
    Zhang, Siqi
    Wang, Chaofang
    Dong, Wenlong
    Fan, Bin
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [38] Rethinking pose estimation in crowds: overcoming the detection information bottleneck and ambiguity
    Zhou, Mu
    Stoffl, Lucas
    Mathis, Mackenzie Weygandt
    Mathis, Alexander
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14643 - 14653
  • [39] Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement
    Chen, Bo
    Cao, Jiewei
    Parra, Alvaro
    Chin, Tat-Jun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2816 - 2824
  • [40] Use of Bayesian belief networks for dealing with ambiguity in integrated groundwater management
    Henriksen, Hans Jorgen
    Zorrilla-Miras, Pedro
    de la Hera, Africa
    Brugnach, Marcella
    INTEGRATED ENVIRONMENTAL ASSESSMENT AND MANAGEMENT, 2012, 8 (03) : 430 - 444