Adaptation via Proxy: Building Instance-Aware Proxy for Unsupervised Domain Adaptive 3D Object Detection

被引:1
|
作者
Li, Ziyu [1 ,2 ]
Yao, Yuncong [1 ,2 ]
Quan, Zhibin [1 ,2 ]
Qi, Lei [3 ]
Feng, Zhen-Hua [4 ]
Yang, Wankou [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, England
来源
基金
中国国家自然科学基金;
关键词
Object detection; intelligent vehicle perception; domain adaptation; point cloud; instance-aware; unsupervised learning; autonomous vehicles;
D O I
10.1109/TIV.2023.3343878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D detection task plays a crucial role in the perception system of intelligent vehicles. LiDAR-based 3D detectors perform well on particular autonomous driving benchmarks, but may poorly generalize to other domains. Existing 3D domain adaptive detection methods usually require annotation-related statistics or continuous refinement of pseudo-labels. The former is not always feasible for practical applications, while the latter lacks sufficient accurate supervision. In this work, we propose a novel unsupervised domain adaptive framework, namely Adaptation Via Proxy (AVP), that explicitly leverages cross-domain relationships to generate adequate high-quality samples, thus mitigating domain shifts for existing LiDAR-based 3D detectors. Specifically, we first train the detector on source domain with the curriculum example mining (CEM) strategy to enhance its generalization capability. Then, we integrate the profitable instance knowledge from the source domain with the contextual information from the target domain, to construct the instance-aware proxy, which is a data collection with diverse training scenes and stronger supervision. Finally, we fine-tune the pre-trained detector on the proxy data for further optimizing the detector to overcome domain gaps. To build the instance-aware proxy, two components are proposed, i.e., the multi-view multi-scale aggregation (MMA) method for producing high-quality pseudo-labels, and the hybrid instance augmentation (HIA) technique for integrating the knowledge from source annotations to enhance supervision. Note that AVP is architecture-agnostic thus it can be easily injected with any LiDAR-based 3D detectors. Extensive experiments on Waymo, nuScenes, KITTI and Lyft demonstrate the superiority of the proposed method over the state-of-the-art approaches for different adaptation scenarios.
引用
下载
收藏
页码:3478 / 3492
页数:15
相关论文
共 50 条
  • [31] SF-UDA3D: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection
    Saltori, Cristiano
    Lathuiliere, Stephane
    Sebe, Nicu
    Ricci, Elisa
    Galasso, Fabio
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 771 - 780
  • [32] SRDAN: Scale-aware and Range-aware Domain Adaptation Network for Cross-dataset 3D Object Detection
    Zhang, Weichen
    Li, Wen
    Xu, Dong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6765 - 6775
  • [33] CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection
    Peng, Xidong
    Zhu, Xinge
    Ma, Yuexin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2047 - 2055
  • [34] MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
    Chen, Xuesong
    Shi, Shaoshuai
    Zhu, Benjin
    Cheung, Ka Chun
    Xu, Hang
    Li, Hongsheng
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 680 - 697
  • [35] Objformer: Boosting 3D object detection via instance-wise interaction
    Tao, Manli
    Zhao, Chaoyang
    Tang, Ming
    Wang, Jinqiao
    PATTERN RECOGNITION, 2024, 146
  • [36] SC-UDA: Style and Content Gaps aware Unsupervised Domain Adaptation for Object Detection
    Yu, Fuxun
    Wang, Di
    Chen, Yinpeng
    Karianakis, Nikolaos
    Shen, Tong
    Yu, Pei
    Lymberopoulos, Dimitrios
    Lu, Sidi
    Shi, Weisong
    Chen, Xiang
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1061 - 1070
  • [37] Unsupervised Domain-Adaptive Object Detection via Localization Regression Alignment
    Piao, Zhengquan
    Tang, Linbo
    Zhao, Baojun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (11) : 1 - 12
  • [38] Weighted Unsupervised Learning for 3D Object Detection
    Kowsari, Kamran
    Alassaf, Manal H.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 584 - 593
  • [39] TEMI-MOT: Towards Efficient Multi-Modality Instance-Aware Feature Learning for 3D Multi-Object Tracking
    Hu, Yufeng
    Zhou, Sanping
    Dong, Jinpeng
    Zheng, Nanning
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [40] Weakly supervised object detection from remote sensing images via self-attention distillation and instance-aware mining
    Peng Yang
    Shi Zhou
    Linlin Wang
    Guowei Yang
    Multimedia Tools and Applications, 2024, 83 : 39073 - 39095