End-to-End Object Detection with Enhanced Positive Sample Filter

被引:1
|
作者
Song, Xiaolin [1 ]
Chen, Binghui
Li, Pengyu
Wang, Biao
Zhang, Honggang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 03期
基金
中国国家自然科学基金;
关键词
end-to-end object detection; Enhanced Positive Sample Filter; Dual-stream Feature Enhancement; Disentangled Max Pooling Filter;
D O I
10.3390/app13031232
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Discarding Non-Maximum Suppression (NMS) post-processing and realizing fully end-to-end object detection is a recent research focus. Previous works have proved that the one-to-one label assignment strategy provides the chance to eliminate NMS during inference. However, this strategy might also result in multiple predictions with high scores due to the inconsistency of label assignment during training. Thus, how to adaptively identify only one positive sample as a final prediction for each Ground-Truth instance remains important. In this paper, we propose an Enhanced Positive Sample Filter (EPSF) to filter out the single positive sample for each Ground-Truth instance and lower the confidence of other negative samples. This is mainly achieved with two components: a Dual-stream Feature Enhancement module (DsFE) and a Disentangled Max Pooling Filter (DeMF). DsFE makes full use of representations trained with different targets so as to provide rich information clues for positive sample selection, while DeMF enhances the feature discriminability in potential foreground regions with disentangled pooling. With the proposed methods, our end-to-end detector achieves a better performances against existing NMS-free object detectors on COCO, PASCAL VOC, CrowdHuman and Caltech datasets.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network
    Rabbi, Jakaria
    Ray, Nilanjan
    Schubert, Matthias
    Chowdhury, Subir
    Chao, Dennis
    [J]. REMOTE SENSING, 2020, 12 (09)
  • [32] RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation
    Nie, Chang
    Wang, Guangming
    Liu, Zhe
    Cavalli, Luca
    Pollefeys, Marc
    Wang, Hesheng
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9857 - 9866
  • [33] Exploring End-to-End object detection with transformers versus YOLOv8 for enhanced citrus fruit detection within trees
    Jrondi, Zineb
    Moussaid, Abdellatif
    Hadi, Moulay Youssef
    [J]. SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [34] MSFgNet: A Novel Compact End-to-End Deep Network for Moving Object Detection
    Patil, Prashant W.
    Murala, Subrahmanyam
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (11) : 4066 - 4077
  • [35] AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWLEDGE DISTILLATION
    Hao, Yu
    Fu, Yanwei
    Jiang, Yu-Gang
    Tian, Qi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1 - 6
  • [36] Pruning DETR: efficient end-to-end object detection with sparse structured pruning
    Sun, Huaiyuan
    Zhang, Shuili
    Tian, Xve
    Zou, Yuanyuan
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 129 - 135
  • [37] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
    Wang, Hanyu
    Wang, Ping
    Qian, Xueming
    [J]. IEEE ACCESS, 2018, 6 : 30296 - 30308
  • [38] SalNet: Edge Constraint Based End-to-End Model for Salient Object Detection
    Han, Le
    Li, Xuelong
    Dong, Yongsheng
    [J]. PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 186 - 198
  • [39] Density Map Guided Region Localization for End-to-End Small Object Detection
    Bo LI
    Kai HUANG
    Junhui LI
    Yufu LIAO
    [J]. Journal of Systems Science and Information, 2023, 11 (06) : 776 - 794
  • [40] Pruning DETR: efficient end-to-end object detection with sparse structured pruning
    Huaiyuan Sun
    Shuili Zhang
    Xve Tian
    Yuanyuan Zou
    [J]. Signal, Image and Video Processing, 2024, 18 : 129 - 135