Image-to-Point Registration via Cross-Modality Correspondence Retrieval

被引:0
|
作者
Bie, Lin [1 ]
Li, Siqi [1 ]
Cheng, Kai [2 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Army Engn Univ, Command Control Coll, Nanjing, Peoples R China
关键词
Image-to-Point Cloud registration; cross-modality correspondence retrieval; frustum point retrieval; combined correspondence retrieval; virtual point cloud;
D O I
10.1145/3652583.3658074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-to-Point Cloud registration between 2D images and 3D LiDAR point clouds is a significant task in computer vision. The traditional registration pipeline first establishes correspondences between images and point clouds and then performs pose estimation based on the generated matches. However, 2D-3D correspondences are inherently difficult to be established due to the large modality gap between images and LiDAR point clouds. To this end, we build a bridge to alleviate the 2D-3D modality gap, which aligns LiDAR point clouds to the virtual points generated by images. In this way, the modality gap can be alleviated to the domain gap of different types of point clouds, i.e. original point clouds and virtual point clouds. Concretely, our framework conducts feature fusion from the LiDAR and virtual point cloud by utilizing the Transformer layer. To relieve the domain gap, a frustum points retrieval module and a combined correspondences retrieval module are proposed based on the consistency of the feature and position descriptor to select the correct correspondences among the candidates, which are generated from the simultaneous retrieval of features and position descriptors. In the implementation procedure, we design a frustum retrieval loss and a combined correspondence retrieval loss for cross-modality correspondence retrieval. Experimental results and comparison with state-of-the-art Image-to-Point Cloud methods on KITTI and nuScenes datasets demonstrate our proposed method has achieved superior performance.
引用
收藏
页码:266 / 274
页数:9
相关论文
共 50 条
  • [1] Boosting Cross-Modality Image Registration
    Barbu, Adrian
    Ionasec, Razvan
    [J]. 2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 89 - +
  • [2] CorrI2P: Deep Image-to-Point Cloud Registration via Dense Correspondence
    Ren, Siyu
    Zeng, Yiming
    Hou, Junhui
    Chen, Xiaodong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1198 - 1208
  • [3] Correlative techniques for cross-modality medical image registration
    Richardson, DB
    Bury, EA
    [J]. MEDICAL IMAGING 1996: IMAGE PROCESSING, 1996, 2710 : 368 - 375
  • [4] Cross-Modality Medical Image Retrieval with Deep Features
    Mbilinyi, Ashery
    Schuldt, Heiko
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2632 - 2639
  • [5] Cross-Modality Personalization for Retrieval
    Murrugarra-Llerena, Nils
    Kovashka, Adriana
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6422 - 6431
  • [6] Accurate Registration of Cross-Modality Geometry via Consistent Clustering
    Zhao, Mingyang
    Huang, Xiaoshui
    Jiang, Jingen
    Mou, Luntian
    Yan, Dong-Ming
    Ma, Lei
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4055 - 4067
  • [7] CROSS-MODALITY HASHING WITH PARTIAL CORRESPONDENCE
    Gu, Yun
    Xue, Haoyang
    Yang, Jie
    Shi, Pengfei
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1925 - 1929
  • [8] Cross-Modality Image Registration Using a Training-Time Privileged Third Modality
    Yang, Qianye
    Atkinson, David
    Fu, Yunguan
    Syer, Tom
    Yan, Wen
    Punwani, Shonit
    Clarkson, Matthew J.
    Barratt, Dean C.
    Vercauteren, Tom
    Hu, Yipeng
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3421 - 3431
  • [9] A metric for testing the accuracy of cross-modality image registration: Validation and application
    Black, KJ
    Videen, TO
    Perlmutter, JS
    [J]. JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1996, 20 (05) : 855 - 861
  • [10] General cross-modality registration framework for visible and infrared UAV target image registration
    Yu Luo
    Hao Cha
    Lei Zuo
    Peng Cheng
    Qing Zhao
    [J]. Scientific Reports, 13