Learning Balance Feature for Object Detection

被引:0
|
作者
Zhang, Zhiqiang [1 ,2 ]
Qiu, Xin [1 ]
Li, Yongzhou [1 ]
机构
[1] Chinese Acad Sci, Inst Microelect, 3 Beitucheng West Rd, Beijing 100029, Peoples R China
[2] Univ Chinese Acad Sci, 19 A Yuquan Rd, Beijing 100049, Peoples R China
关键词
object detection; Feature Pyramid Network; feature transformer; feature balance; FPN;
D O I
10.3390/electronics11172765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of studying scale variation, the Feature Pyramid Network (FPN) replaces the image pyramid and has become one of the most popular object detection methods for detecting multi-scale objects. State-of-the-art methods have FPN inserted into a pipeline between the backbone and the detection head to enable shallow features with more semantic information. However, FPN is insufficient for object detection on various scales, especially for small-scale object detection. One of the reasons is that the features are extracted at different network depths, which introduces gaps between features. That is, as the network becomes deeper and deeper, the high-level features have more semantics but less content description. This paper proposes a new method that includes a multi-scale receptive fields extraction module, a feature constructor module, and an attention module to improve the detection efficiency of FPN for objects of various scales and to bridge the gap in content description and semantics between different layers. Together, these three modules make the detector capable of selecting the most suitable feature for objects. Especially for the attention module, this paper chooses to use a parallel structure to simultaneously extract channel and spatial attention from the same features. When we use Adopting Adaptive Training Sample Selection (ATSS) and FreeAnchor as the baseline and ResNet50 as the backbone, the experimental results on the MS COCO dataset show that our algorithm can enhance the mean average precision (mAP) by 3.7% and 2.4% compared to FPN, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] EFLDet: enhanced feature learning for object detection
    Liao, Yongwei
    Zhang, Guipeng
    Yang, Zhenguo
    Liu, Wenyin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 1033 - 1045
  • [2] EFLDet: enhanced feature learning for object detection
    Yongwei Liao
    Guipeng Zhang
    Zhenguo Yang
    Wenyin Liu
    [J]. Neural Computing and Applications, 2022, 34 : 1033 - 1045
  • [3] Adaptive learning feature pyramid for object detection
    Wong, Fukoeng
    Hu, Haifeng
    [J]. IET COMPUTER VISION, 2019, 13 (08) : 742 - 748
  • [4] Learning a discriminative feature for object detection based on feature fusing and context learning
    You Lei
    Wang Hongpeng
    Wang Yuan
    [J]. 2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 543 - 547
  • [5] Object Detection Based on Feature Balance Pyramid in UAV Imagery
    Xu, Jiao
    Xu, Jian
    Xu, Zeming
    Xie, Zhengguang
    [J]. 6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 119 - 124
  • [6] Local to Global Feature Learning for Salient Object Detection
    Feng, Xuelu
    Zhou, Sanping
    Zhu, Zixin
    Wang, Le
    Hua, Gang
    [J]. PATTERN RECOGNITION LETTERS, 2022, 162 : 81 - 88
  • [7] Joint Pose Estimator and Feature Learning for Object Detection
    Ali, Karim
    Fleuret, Francois
    Hasler, David
    Fua, Pascal
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1373 - 1380
  • [8] Employing feature mixture for active learning of object detection
    Zhang, Licheng
    Lam, Siew-Kei
    Luo, Dingsheng
    Wu, Xihong
    [J]. NEUROCOMPUTING, 2024, 594
  • [9] Small object detection using deep feature learning and feature fusion network
    Tong, Kang
    Wu, Yiquan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [10] Object detection based on visual memory: a feature learning and feature imagination process
    Dai, Houde
    Lin, Mingqiang
    Jiang, Wei
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2020, 14 (04) : 515 - 531