SAN: Selective Alignment Network for Cross-Domain Pedestrian Detection

被引:14
|
作者
Jiao, Yifan [1 ]
Yao, Hantao [2 ]
Xu, Changsheng [2 ,3 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Proposals; Feature extraction; Detectors; Visualization; Training; Image color analysis; Adaptation models; Cross-domain pedestrian detection; instance-level adaptation network; image-level adaptation network; pedestrian detection;
D O I
10.1109/TIP.2021.3049948
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-domain pedestrian detection, which has been attracting much attention, assumes that the training and test images are drawn from different data distributions. Existing methods focus on aligning the descriptions of whole candidate instances between source and target domains. Since there exists a giant visual difference among the candidate instances, aligning whole candidate instances between two domains cannot overcome the inter-instance difference. Compared with aligning the whole candidate instances, we consider that aligning each type of instances separately is a more reasonable manner. Therefore, we propose a novel Selective Alignment Network for cross-domain pedestrian detection, which consists of three components: a Base Detector, an Image-Level Adaptation Network, and an Instance-Level Adaptation Network. The Image-Level Adaptation Network and Instance-Level Adaptation Network can be regarded as the global-level and local-level alignments, respectively. Similar to the Faster R-CNN, the Base Detector, which is composed of a Feature module, an RPN module and a Detection module, is used to infer a robust pedestrian detector with the annotated source data. Once obtaining the image description extracted by the Feature module, the Image-Level Adaptation Network is proposed to align the image description with an adversarial domain classifier. Given the candidate proposals generated by the RPN module, the Instance-Level Adaptation Network firstly clusters the source candidate proposals into several groups according to their visual features, and thus generates the pseudo label for each candidate proposal. After generating the pseudo labels, we align the source and target domains by maximizing and minimizing the discrepancy between the prediction of two classifiers iteratively. Extensive evaluations on several benchmarks demonstrate the effectiveness of the proposed approach for cross-domain pedestrian detection.
引用
收藏
页码:2155 / 2167
页数:13
相关论文
共 50 条
  • [1] Cross-domain pedestrian detection via feature alignment and image quality assessment
    Yao, Jun
    Guo, Zhilin
    Yu, Junjie
    Yan, Nan
    Wang, Qiong
    Yu, Wei
    [J]. ISCIENCE, 2024, 27 (04)
  • [2] AFAN: Augmented Feature Alignment Network for Cross-Domain Object Detection
    Wang, Hongsong
    Liao, Shengcai
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4046 - 4056
  • [3] Multi-Level Alignment Network for Cross-Domain Ship Detection
    Xu, Chujie
    Zheng, Xiangtao
    Lu, Xiaoqiang
    [J]. REMOTE SENSING, 2022, 14 (10)
  • [4] Cross-Domain Defect Detection Network
    Zhou, Zhenkang
    Lan, Chuwen
    Gao, Zehua
    [J]. 2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 272 - 279
  • [5] Adapting Object Detectors via Selective Cross-Domain Alignment
    Zhu, Xinge
    Pang, Jiangmiao
    Yang, Ceyuan
    Shi, Jianping
    Lin, Dahua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 687 - 696
  • [6] Adversarial Category Alignment Network for Cross-domain Sentiment Classification
    Qu, Xiaoye
    Zou, Zhikang
    Cheng, Yu
    Yang, Yang
    Zhou, Pan
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2496 - 2508
  • [7] A deep selective learning network for cross-domain recommendation
    Liu, Huiting
    Liu, Qian
    Li, Peipei
    Zhao, Peng
    Wu, Xindong
    [J]. APPLIED SOFT COMPUTING, 2022, 125
  • [8] GCCD: A Generative Cross-Domain Change Detection Network
    Zhang, Mengmeng
    Guo, Chengwang
    Zhang, Yuxiang
    Liu, Huan
    Li, Wei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [9] DIVERGENCE-GUIDED FEATURE ALIGNMENT FOR CROSS-DOMAIN OBJECT DETECTION
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2240 - 2244
  • [10] DOMAIN-INVARIANT REGION PROPOSAL NETWORK FOR CROSS-DOMAIN DETECTION
    Yang, Xuebin
    Wan, Shouhong
    Jin, Peiquan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,