Improved YOLO V3 Algorithm and Its Application in Small Target Detection

被引：47

作者：

Ju Moran ^{[1
,2
,3
,4
,5
]}

Luo Haibo ^{[1
,2
,4
,5
]}

Wang Zhongbo ^{[1
,2
,3
,4
,5
]}

He Miao ^{[1
,2
,3
,4
,5
]}

Chang Zheng ^{[1
,2
,3
,4
,5
]}

Hui Bin ^{[1
,2
,3
,4
,5
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China

[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110016, Liaoning, Peoples R China

[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[4] Chinese Acad Sci, Key Lab Opt Elect Informat Proc, Shenyang 110016, Liaoning, Peoples R China

[5] Key Lab Image Understanding & Comp Vis, Shenyang 110016, Liaoning, Peoples R China

来源：

ACTA OPTICA SINICA | 2019年 / 39卷 / 07期

关键词：

machine vision; small target detection; YOLOV3; VEDAI dataset; K-means clustering algorithm;

D O I：

10.3788/AOS201939.0715004

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

This study proposes an improved detection algorithm of YOLO V3 specially applied in small target detection to solve the problems of low detection and high false alarm rates of small targets in an image. The resolution of small targets is low, and their features arc not obvious; thus, this study proposes 2X upsampling for the feature map down-sampled by 8 X of the previous network, and the feature map upsampled by 2 X is concatenated with the output of the second ResNet block unit. A feature fusion target detection layer, whose feature map is down-sampled by 3 X, is established. Two ResNet units in the second ResNet block unit of Darknet53 in the YOLO V3 network structure arc added to obtain more features of the small target. The K-means clustering algorithm is used to select the number of candidate anchor boxes and aspect ratio dimensions. A comparative experiment is performed based on the improved YOLO V3 algorithm on the VEDAI dataset and YOLO V3 algorithm. The results show that the improved YOLO V3 algorithm can efficiently detect small targets and improve the mean average precision and recall rate of small targets.

引用

页数：8

共 17 条

[1] Aerial Target Detection Based on Improved Faster R-CNN
Feng Xiaoyu
Mei Wei
Hu Dashuai
[J]. ACTA OPTICA SINICA, 2018, 38 (06)
[2] Fu C., 2017, ARXIV, P1
[3] Girshick R., 2015, P IEEE INT C COMPUTE, DOI [DOI 10.1109/ICCV.2015.169, 10.1109/ICCV.2015.169]
[4] He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
[5] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[6] Multi-Objective Detection of Traffic Scenes Based on Improved SSD
Hua Xia
Wang Xinqing
Wang Dong
Ma Zhaoye
Shao Faming
[J]. ACTA OPTICA SINICA, 2018, 38 (12)
[7] Joseph RK, 2016, CRIT POL ECON S ASIA, P1
[8] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
[9] Vehicle detection in aerial imagery : A small target detection benchmark
Razakarivony, Sebastien
Jurie, Frederic
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 34 : 187 - 203
[10] REDMON J, 2016, PROC CVPR IEEE, P779, DOI [DOI 10.1109/CVPR.2016.91, 10.1109/CVPR.2016.91]

← 1 2 →