A Coarse-to-Fine Multi-class Object Detection in Drone Images Using Convolutional Neural Networks

被引:0
|
作者
Aburasain, R. Y. [1 ]
Edirisinghe, E. A. [2 ]
Zamim, M. Y. [3 ]
机构
[1] Jazan Univ, Dept Comp Sci, Jazan, Saudi Arabia
[2] Univ Loughborough, Dept Comp Sci, Loughborough, Leics, England
[3] Minist Educ, Jazan, Saudi Arabia
关键词
Drones; Multiclass object detection; Convolution neural networks; Unmanned aerial vehicles; PALM TREE DETECTION; IDENTIFICATION;
D O I
10.1007/978-3-031-11432-8_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-class object detection has a rapid evolution in the last few years with the rise of deep Convolutional Neural Networks (CNNs) learning based, in particular. However, the success approaches are based on high resolution ground level images and extremely large volume of data as in COCO and VOC datasets. On the other hand, the availability of the drones has been increased in the last few years and hence several new applications have been established. One of such is understanding drone footage by analysing, detecting, recognizing different objects in the covered area. In this study conducted, a collection of large images captured by a drone flying at a fixed altitude in a desert area located within the United Arab Emirates (UAE) is given and it is utilised for training and evaluating the CNN networks to be investigated. Three state-of-the-art CNN architectures, namely SSD500 with VGGNet-16 meta-architecture, SSD-500 with ResNet meta-architecture and YOLO-V3 with Darknet-53 are optimally configured, re-trained, tested and evaluated for the detection of three different classes of objects in the captured footage, namely, palm trees, group-of-animals/cattle and animal sheds in farms. Our preliminary experiments revealed that YOLO-V3 outperformed SSD-500 with VGGNet-16 by a large margin and has a considerable improvement as compared to using SSD-500 with ResNet. Therefore, it has been selected for further investigation, aiming to propose an efficient coarse-to-fine object detection model for multi-class object detection in drone images. To this end, the impact of changing the activation function of the hidden units and the pooling type in the pooling layer has been investigated in detail. In addition, the impact of tuning the learning rate and the selection of the most effective optimization method for general hyperparameters tuning is also investigated. The result demonstrated that the multi-class object detector developed has precision of 0.99, a recall of 0.94 and an F-score of 0.96, proving the efficiency of the multi-class object detection network developed.
引用
收藏
页码:12 / 33
页数:22
相关论文
共 50 条
  • [1] Object Detection Using Convolutional Neural Networks in a Coarse-to-Fine Manner
    Li, Xiaobin
    Wang, Shengjin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (11) : 2037 - 2041
  • [2] A Coarse-to-Fine Taxonomy of Constellations for Fast Multi-class Object Detection
    Fidler, Sanja
    Boben, Marko
    Leonardis, Ales
    [J]. COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 687 - 700
  • [3] Coarse-to-fine salient object detection based on deep convolutional neural networks
    Li, Ying
    Cui, Fan
    Xue, Xizhe
    Chan, Jonathan Cheung-Wai
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 21 - 32
  • [4] Airplane Detection Using Convolutional Neural Networks in a Coarse-to-fine Manner
    Li, Xiaobin
    Wang, Shengjin
    Jiang, Bitao
    Chan, Xiaobing
    [J]. PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 235 - 239
  • [5] Unsupervised feature selection for multi-class object detection using convolutional neural networks
    Matsugu, M
    Cardon, P
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 864 - 869
  • [6] Convolutional neural networks for multi-class brain disease detection using MRI images
    Talo, Muhammed
    Yildirim, Ozal
    Baloglu, Ulas Baran
    Aydin, Galip
    Acharya, U. Rajendra
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2019, 78
  • [7] Coarse-to-Fine Image Super-Resolution Using Convolutional Neural Networks
    Zhou, Liguo
    Wang, Zhongyuan
    Wang, Shu
    Luo, Yimin
    [J]. MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 73 - 81
  • [8] Coarse-to-Fine Trained Multi-Scale Convolutional Neural Networks for Image Classification
    Dou, Haobin
    Wu, Xihong
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [9] Convolutional Neural Networks for Multi-class Intrusion Detection System
    Potluri, Sasanka
    Ahmed, Shamim
    Diedrich, Christian
    [J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION, MIKE 2018, 2018, 11308 : 225 - 238
  • [10] Multi-class object detection system using hybrid convolutional neural network architecture
    Jay Laxman Borade
    Muddana A Lakshmi
    [J]. Multimedia Tools and Applications, 2022, 81 : 31727 - 31751