mSODANet: A network for multi-scale object detection in aerial images using hierarchical dilated convolutions *

被引:73
|
作者
Chalavadi, Vishnu [1 ]
Jeripothula, Prudviraj [1 ]
Datla, Rajeshreddy [1 ,2 ]
Babu, Sobhan Ch [1 ]
Mohan, Krishna C. [1 ]
机构
[1] Indian Inst Technol Hyderabad, Dept Comp Sci & Engn, Visual Learning & Intelligence Grp VIGIL, Kandi 502285, Sangareddy, India
[2] Adv Data Proc Res Inst ADRIN, Dept Space, Akbar Rd, Manovikas Nagar 500009, Secunderabad, India
关键词
Multi-scale object detection; Contextual features; Dilated convolutions; Aerial images;
D O I
10.1016/j.patcog.2022.108548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A B S T R A C T The object detection in aerial images is one of the most commonly used tasks in the wide-range of computer vision applications. However, the object detection is more challenging due to the following issues: (a) the pixel occupancy vary among the different scales of objects, (b) the distribution of objects is not uniform in aerial images, (c) the appearance of an object varies with different view-points and illumination conditions, and (d) the number of objects, even though they belong to same type, vary across the images. To address these issues, we propose a novel network for multi-scale object detection in aerial images using hierarchical dilated convolutions, called as mSODANet. In particular, we probe hierarchical dilated network using parallel dilated convolutions to learn the contextual information of different types of objects at multiple scales and multiple field-of-views. The introduced hierarchical dilated network captures the visual information of aerial image more effectively and enhances the detection capability of the model. Further, the extensive experiments conducted on three challenging publicly available datasets, i.e., Visdrone2019, DOTA (OBB & HBB), NWPU VHR-10, demonstrate the effectiveness of the proposed mSODANet and achieve the state-of-the-art performance on all three datasets. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images
    Zhang R.
    Shao Z.
    Aleksei P.
    Wang J.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06): : 895 - 903
  • [2] SDMNet: Spatially dilated multi-scale network for object detection for drone aerial imagery
    Battish, Neeraj
    Kaur, Dapinder
    Chugh, Moksh
    Poddar, Shashi
    IMAGE AND VISION COMPUTING, 2024, 150
  • [3] Multi-Scale Cross Distillation for Object Detection in Aerial Images
    Wang, Kun
    Wang, Zi
    Li, Zhang
    Teng, Xichao
    Li, Yang
    COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 452 - 471
  • [4] MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
    Zhou, Liming
    Zhao, Shuai
    Wan, Ziye
    Liu, Yang
    Wang, Yadi
    Zuo, Xianyu
    DRONES, 2024, 8 (05)
  • [5] Multi-Scale Neural Network With Dilated Convolutions for Image Deblurring
    Ople, Jose Jaena Mari
    Yeh, Pin-Yi
    Sun, Shih-Wei
    Tsai, I-Te
    Hua, Kai-Lung
    IEEE ACCESS, 2020, 8 : 53942 - 53952
  • [6] A new multi-scale backbone network for object detection based on asymmetric convolutions
    Ma, Xianghua
    Yang, Zhenkun
    SCIENCE PROGRESS, 2021, 104 (02)
  • [7] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Sang-Woong Lee
    Multimedia Tools and Applications, 2018, 77 : 18689 - 18707
  • [8] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Lee, Sang-Woong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18689 - 18707
  • [9] Small object detection in unmanned aerial vehicle images using multi-scale hybrid attention
    Song, Gang
    Du, Hongwei
    Zhang, Xinyue
    Bao, Fangxun
    Zhang, Yunfeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [10] YOLO-DroneMS: Multi-Scale Object Detection Network for Unmanned Aerial Vehicle (UAV) Images
    Zhao, Xueqiang
    Chen, Yangbo
    DRONES, 2024, 8 (11)