Object Detection Algorithm Based on Adaptive Feature Fusion and Cosine Similarity IoU-NMS

被引:0
|
作者
Ma S. [1 ,2 ]
Li N. [1 ]
Peng G. [3 ]
Yang X. [1 ,4 ]
Hou Z. [1 ]
机构
[1] School of Computer Science and Technology, Xi’an University of Posts and Telecommunications, Xi’an
[2] Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing, Xi’an University of Posts and Telecommunications, Xi’an
[3] School of Computer Science and Technology, XIDIAN University, Xi’an
[4] Xi’an Key Laboratory of Big Data and Intelligent Computing, Xi’an University of Posts and Telecommunications, Xi’an
关键词
cosine similarity; deep learning; intersection over union; multi-level feature fusion; non-maximum suppression; object detection;
D O I
10.3724/SP.J.1089.2024.19786
中图分类号
学科分类号
摘要
To address the problem of missing or repeating detection in the classical anchor-based RetinaNet, anchor-free FCOS, and other object detection algorithms, this paper proposes a novel object detection algorithm based on adaptive feature fusion and cosIoU-NMS. Firstly, the algorithm leverages an adaptive feature fusion module to obtain rich context and spatial information by weighted fusion of adjacent three-layer features in multi-scale features. Then, the cosIoU, which measures the cosine similarity and overlap area between detection boxes, is calculated to locate the target more precisely. Finally, by replacing Greedy-NMS with our cosIoU-NMS, redundant boxes with high confidence scores can be effectively suppressed, and thus retaining more accurate detection results. Based on RetinaNet and FCOS, the experimental results on the PASCAL VOC dataset demonstrate the detection accuracy of our proposed algorithm achieves 81.3% and 82.3%, with relative gains of 2.8 and 1.2 percentage points, respectively. On the MS COCO dataset, the accuracy reaches 36.8% and 38.0%, which is increased by 1.0 and 0.7 percentage points, respectively. The algorithm can improve the capability of feature representation, remove redundant detection boxes, and significantly boost the detection performance. © 2024 Institute of Computing Technology. All rights reserved.
引用
收藏
页码:112 / 121
页数:9
相关论文
共 34 条
  • [1] He Zewen, Zhang Wensheng, High resolution information reserved anchor-free detection algorithm, Journal of Computer-Aided Design & Computer Graphics, 33, 4, pp. 580-589, (2021)
  • [2] Yuan Shuai, Wang Kang, Shan Yi, Et al., Multi-scale object detection method based on multi-branch parallel dilated convolution, Journal of Computer-Aided Design & Computer Graphics, 33, 6, pp. 864-872, (2021)
  • [3] Liu L, Ouyang W L, Wang X G, Et al., Deep learning for generic object detection: A survey, International Journal of Computer Vision, 128, 2, pp. 261-318, (2020)
  • [4] Chen Keqi, Zhu Zhiliang, Deng Xiaoming, Et al., Deep learning for multi-scale object detection: a survey, Journal of Software, 32, 4, pp. 1201-1227, (2021)
  • [5] Cheng Xu, Song Chen, Shi Jingang, Et al., A survey of generic object detection methods based on deep learning, Acta Electronica Sinica, 49, 7, pp. 1428-1438, (2021)
  • [6] Chen Hongkun, Luo Huilan, Multi-scale semantic information fusion for object detection, Journal of Electronics & Information Technology, 43, 7, pp. 2087-2095, (2021)
  • [7] Girshick R, Donahue J, Darrell T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [8] Girshick R., Fast R-CNN, Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [9] Ren S Q, He K M, Girshick R, Et al., Faster R-CNN: towards real-time object detection with region proposal networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
  • [10] Zou Z X, Chen K Y, Shi Z W, Et al., Object detection in 20 years: a survey, Proceedings of the IEEE, 111, 3, pp. 257-276, (2023)