A Coarse-to-Fine Multi-class Object Detection in Drone Images Using Convolutional Neural Networks

被引：0

作者：

Aburasain, R. Y. ^{[1
]}

Edirisinghe, E. A. ^{[2
]}

Zamim, M. Y. ^{[3
]}

机构：

[1] Jazan Univ, Dept Comp Sci, Jazan, Saudi Arabia

[2] Univ Loughborough, Dept Comp Sci, Loughborough, Leics, England

[3] Minist Educ, Jazan, Saudi Arabia

来源：

DIGITAL INTERACTION AND MACHINE INTELLIGENCE, MIDI 2021 | 2022年 / 440卷

关键词：

Drones; Multiclass object detection; Convolution neural networks; Unmanned aerial vehicles; PALM TREE DETECTION; IDENTIFICATION;

D O I：

10.1007/978-3-031-11432-8_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-class object detection has a rapid evolution in the last few years with the rise of deep Convolutional Neural Networks (CNNs) learning based, in particular. However, the success approaches are based on high resolution ground level images and extremely large volume of data as in COCO and VOC datasets. On the other hand, the availability of the drones has been increased in the last few years and hence several new applications have been established. One of such is understanding drone footage by analysing, detecting, recognizing different objects in the covered area. In this study conducted, a collection of large images captured by a drone flying at a fixed altitude in a desert area located within the United Arab Emirates (UAE) is given and it is utilised for training and evaluating the CNN networks to be investigated. Three state-of-the-art CNN architectures, namely SSD500 with VGGNet-16 meta-architecture, SSD-500 with ResNet meta-architecture and YOLO-V3 with Darknet-53 are optimally configured, re-trained, tested and evaluated for the detection of three different classes of objects in the captured footage, namely, palm trees, group-of-animals/cattle and animal sheds in farms. Our preliminary experiments revealed that YOLO-V3 outperformed SSD-500 with VGGNet-16 by a large margin and has a considerable improvement as compared to using SSD-500 with ResNet. Therefore, it has been selected for further investigation, aiming to propose an efficient coarse-to-fine object detection model for multi-class object detection in drone images. To this end, the impact of changing the activation function of the hidden units and the pooling type in the pooling layer has been investigated in detail. In addition, the impact of tuning the learning rate and the selection of the most effective optimization method for general hyperparameters tuning is also investigated. The result demonstrated that the multi-class object detector developed has precision of 0.99, a recall of 0.94 and an F-score of 0.96, proving the efficiency of the multi-class object detection network developed.

引用

页码：12 / 33

页数：22

共 50 条

[1] Object Detection Using Convolutional Neural Networks in a Coarse-to-Fine Manner
Li, Xiaobin
Wang, Shengjin
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (11) : 2037 - 2041
[2] A Coarse-to-Fine Taxonomy of Constellations for Fast Multi-class Object Detection
Fidler, Sanja
Boben, Marko
Leonardis, Ales
[J]. COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 687 - 700
[3] Coarse-to-fine salient object detection based on deep convolutional neural networks
Li, Ying
Cui, Fan
Xue, Xizhe
Chan, Jonathan Cheung-Wai
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 21 - 32
[4] Airplane Detection Using Convolutional Neural Networks in a Coarse-to-fine Manner
Li, Xiaobin
Wang, Shengjin
Jiang, Bitao
Chan, Xiaobing
[J]. PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 235 - 239
[5] Unsupervised feature selection for multi-class object detection using convolutional neural networks
Matsugu, M
Cardon, P
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 864 - 869
[6] Convolutional neural networks for multi-class brain disease detection using MRI images
Talo, Muhammed
Yildirim, Ozal
Baloglu, Ulas Baran
Aydin, Galip
Acharya, U. Rajendra
[J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2019, 78
[7] Coarse-to-Fine Image Super-Resolution Using Convolutional Neural Networks
Zhou, Liguo
Wang, Zhongyuan
Wang, Shu
Luo, Yimin
[J]. MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 73 - 81
[8] Coarse-to-Fine Trained Multi-Scale Convolutional Neural Networks for Image Classification
Dou, Haobin
Wu, Xihong
[J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
[9] Convolutional Neural Networks for Multi-class Intrusion Detection System
Potluri, Sasanka
Ahmed, Shamim
Diedrich, Christian
[J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION, MIKE 2018, 2018, 11308 : 225 - 238
[10] Multi-class object detection system using hybrid convolutional neural network architecture
Jay Laxman Borade
Muddana A Lakshmi
[J]. Multimedia Tools and Applications, 2022, 81 : 31727 - 31751

← 1 2 3 4 5 →