Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation

被引:0
|
作者
Haowen Deng
Mai Bui
Nassir Navab
Leonidas Guibas
Slobodan Ilic
Tolga Birdal
机构
[1] Informatics at Technische Universität München,Computer Science Department
[2] Corporate Technology Siemens AG,undefined
[3] Stanford University,undefined
来源
关键词
3D computer vision; Point clouds; Camera relocalization; 6D; Camera pose; Object pose; Rotation; Bingham distribution; Posterior distribution; Ambiguity; Uncertainty; Uncertainty estimation;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, we introduce Deep Bingham Networks (DBN), a generic framework that can naturally handle pose-related uncertainties and ambiguities arising in almost all real life applications concerning 3D data. While existing works strive to find a single solution to the pose estimation problem, we make peace with the ambiguities causing high uncertainty around which solutions to identify as the best. Instead, we report a family of poses which capture the nature of the solution space. DBN extends the state of the art direct pose regression networks by (i) a multi-hypotheses prediction head which can yield different distribution modes; and (ii) novel loss functions that benefit from Bingham distributions on rotations. This way, DBN can work both in unambiguous cases providing uncertainty information, and in ambiguous scenes where an uncertainty per mode is desired. On a technical front, our network regresses continuous Bingham mixture models and is applicable to both 2D data such as images and to 3D data such as point clouds. We proposed new training strategies so as to avoid mode or posterior collapse during training and to improve numerical stability. Our methods are thoroughly tested on two different applications exploiting two different modalities: (i) 6D camera relocalization from images; and (ii) object pose estimation from 3D point clouds, demonstrating decent advantages over the state of the art. For the former we contributed our own dataset composed of five indoor scenes where it is unavoidable to capture images corresponding to views that are hard to uniquely identify. For the latter we achieve the top results especially for symmetric objects of ModelNet dataset (Wu et al., in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1912–1920, 2015). The code and dataset accompanying this paper is provided under https://multimodal3dvision.github.io.
引用
收藏
页码:1627 / 1654
页数:27
相关论文
共 50 条
  • [21] Pose estimation and behavior classification of broiler chickens based on deep neural networks
    Fang, Cheng
    Zhang, Tiemin
    Zheng, Haikun
    Huang, Junduan
    Cuan, Kaixuan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 180
  • [22] Vehicle Pose Estimation in WAMI Imagery via Deep Convolutional Neural Networks
    Yi, Meng
    Wang, Dong
    Yang, Fan
    Xu, Jonathan
    Cai, Yiran
    Blasch, Erik
    Sheaff, Carolyn
    Chen, Genshe
    Ling, Haibin
    PROCEEDINGS OF THE 2016 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON) AND OHIO INNOVATION SUMMIT (OIS), 2016, : 233 - 240
  • [23] Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation
    Ning, Guanghan
    Zhang, Zhi
    He, Zhiquan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (05) : 1246 - 1259
  • [24] Probabilistic pose estimation using a Bingham distribution-based linear filter
    Srivatsan, Rangaprasad Arun
    Xu, Mengyun
    Zevallos, Nicolas
    Choset, Howie
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (13-14): : 1610 - 1631
  • [25] Online deep Bingham network for probabilistic orientation estimation
    Li, Wenjie
    Liu, Jia
    Hao, Wei
    Liu, Haisong
    Ren, Dayong
    Wang, Yanyan
    Chen, Lijun
    IET COMPUTER VISION, 2023, 17 (06) : 663 - 675
  • [26] Techniques for Dealing with Uncertainty in Cognitive Radio Networks
    Salahdine, Fatima
    Kaabouch, Naima
    El Ghazi, Hassan
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [27] A deep structure for human pose estimation
    Zhao, Lin
    Gao, Xinbo
    Tao, Dacheng
    Li, Xuelong
    SIGNAL PROCESSING, 2015, 108 : 36 - 45
  • [28] Aneurysm Pose Estimation with Deep Learning
    Assis, Youssef
    Liao, Liang
    Pierre, Fabien
    Anxionnat, Rene
    Kerrien, Erwan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 543 - 553
  • [29] Deep probabilistic human pose estimation
    Petrov, Ilia
    Shakhuro, Vlad
    Konushin, Anton
    IET COMPUTER VISION, 2018, 12 (05) : 578 - 585
  • [30] Computer Vision Approaches based on Deep Learning and Neural Networks: Deep Neural Networks for Video Analysis of Human Pose Estimation
    Nishani, Eralda
    Cico, Betim
    2017 6TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2017, : 242 - 245