Bimodal Camera Pose Prediction for Endoscopy

被引:4
|
作者
Rau, Anita [1 ,2 ]
Bhattarai, Binod [1 ,3 ]
Agapito, Lourdes [1 ]
Stoyanov, Danail [1 ]
机构
[1] UCL, Comp Sci Dept, London WC1E 6BT, England
[2] Stanford Univ, Biomed Data Sci Dept, Stanford, CA 94305 USA
[3] Univ Aberdeen, Sch Nat & Comp Sci, Aberdeen AB24 3FX, Scotland
来源
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
3D reconstruction; camera pose estimation; endoscopy; SLAM; surgical AI; COLONOSCOPY;
D O I
10.1109/TMRB.2023.3320267
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deducing the 3D structure of endoscopic scenes from images is exceedingly challenging. In addition to deformation and view-dependent lighting, tubular structures like the colon present problems stemming from their self-occluding and repetitive anatomical structure. In this paper, we propose SimCol, a synthetic dataset for camera pose estimation in colonoscopy, and a novel method that explicitly learns a bimodal distribution to predict the endoscope pose. Our dataset replicates real colonoscope motion and highlights the drawbacks of existing methods. We publish 18k RGB images from simulated colonoscopy with corresponding depth and camera poses and make our data generation environment in Unity publicly available. We evaluate different camera pose prediction methods and demonstrate that, when trained on our data, they generalize to real colonoscopy sequences, and our bimodal approach outperforms prior unimodal work. Our project and dataset can be found here: https://www.github.com/anitarau/simcol.
引用
收藏
页码:978 / 989
页数:12
相关论文
共 50 条
  • [1] Camera Pose Matters: Improving Depth Prediction by Mitigating Pose Distribution Bias
    Zhao, Yunhan
    Kong, Shu
    Fowlkes, Charless
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15754 - 15763
  • [2] Camera Pose Estimation Based on Feature Extraction and Description for Robotic Gastrointestinal Endoscopy
    Xu, Yuwei
    Feng, Lijuan
    Xia, Zeyang
    Xiong, Jing
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT III, 2021, 13015 : 113 - 122
  • [3] Domain adaptation of networks for camera pose estimation: Learning camera pose estimation without pose labels
    Langerman, Jack
    Qiu, Ziming
    Sörös, Gábor
    Sebok, Dávid
    Wang, Yao
    Huang, Howard
    arXiv, 2021,
  • [4] CAMERA CALIBRATION WITH POSE GUIDANCE
    Ren, Yuzhuo
    Hu, Feng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2180 - 2184
  • [5] Hybrid Camera Pose Estimation
    Camposeco, Federico
    Cohen, Andrea
    Pollefeys, Marc
    Sattler, Torsten
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 136 - 144
  • [6] Distributed Consensus on Camera Pose
    Jorstad, Anne
    DeMenthon, Daniel
    Wang, I-Jeng
    Burlina, Philippe
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (09) : 2396 - 2407
  • [7] OnionNet: Single-View Depth Prediction and Camera Pose Estimation for Unlabeled Video
    Gu, Tianhao
    Wang, Zhe
    Li, Dongdong
    Yang, Hai
    Du, Wenli
    Zhou, Yangming
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (04) : 995 - 1009
  • [8] Vision-Based Neurosurgical Guidance: Unsupervised Localization and Camera-Pose Prediction
    Sarwin, Gary
    Carretta, Alessandro
    Staartjes, Victor
    Zoli, Matteo
    Mazzatenta, Diego
    Regli, Luca
    Serra, Carlo
    Konukoglu, Ender
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VI, 2024, 15006 : 736 - 746
  • [9] Camera Pose Estimation using Human Head Pose Estimation
    Fischer, Robert
    Hoedlmoser, Michael
    Gelautz, Margrit
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 877 - 886
  • [10] Camera Pose Auto-encoders for Improving Pose Regression
    Shavit, Yoli
    Keller, Yosi
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 140 - 157