Improved YOLO V3 Algorithm and Its Application in Small Target Detection

被引:47
|
作者
Ju Moran [1 ,2 ,3 ,4 ,5 ]
Luo Haibo [1 ,2 ,4 ,5 ]
Wang Zhongbo [1 ,2 ,3 ,4 ,5 ]
He Miao [1 ,2 ,3 ,4 ,5 ]
Chang Zheng [1 ,2 ,3 ,4 ,5 ]
Hui Bin [1 ,2 ,3 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110016, Liaoning, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Key Lab Opt Elect Informat Proc, Shenyang 110016, Liaoning, Peoples R China
[5] Key Lab Image Understanding & Comp Vis, Shenyang 110016, Liaoning, Peoples R China
关键词
machine vision; small target detection; YOLOV3; VEDAI dataset; K-means clustering algorithm;
D O I
10.3788/AOS201939.0715004
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
This study proposes an improved detection algorithm of YOLO V3 specially applied in small target detection to solve the problems of low detection and high false alarm rates of small targets in an image. The resolution of small targets is low, and their features arc not obvious; thus, this study proposes 2X upsampling for the feature map down-sampled by 8 X of the previous network, and the feature map upsampled by 2 X is concatenated with the output of the second ResNet block unit. A feature fusion target detection layer, whose feature map is down-sampled by 3 X, is established. Two ResNet units in the second ResNet block unit of Darknet53 in the YOLO V3 network structure arc added to obtain more features of the small target. The K-means clustering algorithm is used to select the number of candidate anchor boxes and aspect ratio dimensions. A comparative experiment is performed based on the improved YOLO V3 algorithm on the VEDAI dataset and YOLO V3 algorithm. The results show that the improved YOLO V3 algorithm can efficiently detect small targets and improve the mean average precision and recall rate of small targets.
引用
收藏
页数:8
相关论文
共 17 条
  • [1] Aerial Target Detection Based on Improved Faster R-CNN
    Feng Xiaoyu
    Mei Wei
    Hu Dashuai
    [J]. ACTA OPTICA SINICA, 2018, 38 (06)
  • [2] Fu C., 2017, ARXIV, P1
  • [3] Girshick R., 2015, P IEEE INT C COMPUTE, DOI [DOI 10.1109/ICCV.2015.169, 10.1109/ICCV.2015.169]
  • [4] He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
  • [5] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [6] Multi-Objective Detection of Traffic Scenes Based on Improved SSD
    Hua Xia
    Wang Xinqing
    Wang Dong
    Ma Zhaoye
    Shao Faming
    [J]. ACTA OPTICA SINICA, 2018, 38 (12)
  • [7] Joseph RK, 2016, CRIT POL ECON S ASIA, P1
  • [8] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [9] Vehicle detection in aerial imagery : A small target detection benchmark
    Razakarivony, Sebastien
    Jurie, Frederic
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 34 : 187 - 203
  • [10] REDMON J, 2016, PROC CVPR IEEE, P779, DOI [DOI 10.1109/CVPR.2016.91, 10.1109/CVPR.2016.91]