Adaptation via Proxy: Building Instance-Aware Proxy for Unsupervised Domain Adaptive 3D Object Detection

被引:1
|
作者
Li, Ziyu [1 ,2 ]
Yao, Yuncong [1 ,2 ]
Quan, Zhibin [1 ,2 ]
Qi, Lei [3 ]
Feng, Zhen-Hua [4 ]
Yang, Wankou [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, England
来源
基金
中国国家自然科学基金;
关键词
Object detection; intelligent vehicle perception; domain adaptation; point cloud; instance-aware; unsupervised learning; autonomous vehicles;
D O I
10.1109/TIV.2023.3343878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D detection task plays a crucial role in the perception system of intelligent vehicles. LiDAR-based 3D detectors perform well on particular autonomous driving benchmarks, but may poorly generalize to other domains. Existing 3D domain adaptive detection methods usually require annotation-related statistics or continuous refinement of pseudo-labels. The former is not always feasible for practical applications, while the latter lacks sufficient accurate supervision. In this work, we propose a novel unsupervised domain adaptive framework, namely Adaptation Via Proxy (AVP), that explicitly leverages cross-domain relationships to generate adequate high-quality samples, thus mitigating domain shifts for existing LiDAR-based 3D detectors. Specifically, we first train the detector on source domain with the curriculum example mining (CEM) strategy to enhance its generalization capability. Then, we integrate the profitable instance knowledge from the source domain with the contextual information from the target domain, to construct the instance-aware proxy, which is a data collection with diverse training scenes and stronger supervision. Finally, we fine-tune the pre-trained detector on the proxy data for further optimizing the detector to overcome domain gaps. To build the instance-aware proxy, two components are proposed, i.e., the multi-view multi-scale aggregation (MMA) method for producing high-quality pseudo-labels, and the hybrid instance augmentation (HIA) technique for integrating the knowledge from source annotations to enhance supervision. Note that AVP is architecture-agnostic thus it can be easily injected with any LiDAR-based 3D detectors. Extensive experiments on Waymo, nuScenes, KITTI and Lyft demonstrate the superiority of the proposed method over the state-of-the-art approaches for different adaptation scenarios.
引用
下载
收藏
页码:3478 / 3492
页数:15
相关论文
共 50 条
  • [41] Weakly supervised object detection from remote sensing images via self-attention distillation and instance-aware mining
    Yang, Peng
    Zhou, Shi
    Wang, Linlin
    Yang, Guowei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 39073 - 39095
  • [42] Weakly supervised object detection from remote sensing images via self-attention distillation and instance-aware mining
    Yang, Peng
    Zhou, Shi
    Wang, Linlin
    Yang, Guowei
    Multimedia Tools and Applications, 83 (13): : 39073 - 39095
  • [43] Self-Distillation for Unsupervised 3D Domain Adaptation
    Cardace, Adriano
    Spezialetti, Riccardo
    Ramirez, Pierluigi Zama
    Salti, Samuele
    Di Stefano, Luigi
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4155 - 4166
  • [44] Unsupervised Domain Adaptation for 3D Human Pose Estimation
    Zhang, Xiheng
    Wong, Yongkang
    Kankanhalli, Mohan S.
    Geng, Weidong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
  • [45] ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing
    Mattolin, Giulio
    Zanella, Luca
    Ricci, Elisa
    Wang, Yiming
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 423 - 433
  • [46] Joint deep feature learning and unsupervised visual domain adaptation for cross-domain 3D object retrieval
    Li, Wen-Hui
    Xiang, Shu
    Nie, Wei-Zhi
    Song, Dan
    Liu, An-An
    Li, Xuan-Ya
    Hao, Tong
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (05)
  • [47] Distribution Aware VoteNet for 3D Object Detection
    Liang, Junxiong
    An, Pei
    Ma, Jie
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1583 - 1591
  • [48] Unsupervised Domain Adaptation for Vertebrae Detection and Identification in 3D CT Volumes Using a Domain Sanity Loss
    Sager, Pascal
    Salzmann, Sebastian
    Burn, Felice
    Stadelmann, Thilo
    JOURNAL OF IMAGING, 2022, 8 (08)
  • [49] FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
    Chen, Yilun
    Yu, Zhiding
    Chen, Yukang
    Lan, Shiyi
    Anandkumar, Anima
    Jia, Jiaya
    Alvarez, Jose M.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8360 - 8371
  • [50] Source-free unsupervised multi-source domain adaptation via proxy task for person re-identification
    Yi Ding
    Zhikui Duan
    Shiren Li
    The Visual Computer, 2022, 38 : 1871 - 1882