mSODANet: A network for multi-scale object detection in aerial images using hierarchical dilated convolutions *

被引:73
|
作者
Chalavadi, Vishnu [1 ]
Jeripothula, Prudviraj [1 ]
Datla, Rajeshreddy [1 ,2 ]
Babu, Sobhan Ch [1 ]
Mohan, Krishna C. [1 ]
机构
[1] Indian Inst Technol Hyderabad, Dept Comp Sci & Engn, Visual Learning & Intelligence Grp VIGIL, Kandi 502285, Sangareddy, India
[2] Adv Data Proc Res Inst ADRIN, Dept Space, Akbar Rd, Manovikas Nagar 500009, Secunderabad, India
关键词
Multi-scale object detection; Contextual features; Dilated convolutions; Aerial images;
D O I
10.1016/j.patcog.2022.108548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A B S T R A C T The object detection in aerial images is one of the most commonly used tasks in the wide-range of computer vision applications. However, the object detection is more challenging due to the following issues: (a) the pixel occupancy vary among the different scales of objects, (b) the distribution of objects is not uniform in aerial images, (c) the appearance of an object varies with different view-points and illumination conditions, and (d) the number of objects, even though they belong to same type, vary across the images. To address these issues, we propose a novel network for multi-scale object detection in aerial images using hierarchical dilated convolutions, called as mSODANet. In particular, we probe hierarchical dilated network using parallel dilated convolutions to learn the contextual information of different types of objects at multiple scales and multiple field-of-views. The introduced hierarchical dilated network captures the visual information of aerial image more effectively and enhances the detection capability of the model. Further, the extensive experiments conducted on three challenging publicly available datasets, i.e., Visdrone2019, DOTA (OBB & HBB), NWPU VHR-10, demonstrate the effectiveness of the proposed mSODANet and achieve the state-of-the-art performance on all three datasets. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] DMA-YOLO: multi-scale object detection method with attention mechanism for aerial images
    Li, Ya-ling
    Feng, Yong
    Zhou, Ming-liang
    Xiong, Xian-cai
    Wang, Yong-heng
    Qiang, Bao-hua
    VISUAL COMPUTER, 2024, 40 (06): : 4505 - 4518
  • [22] A Novel Multi-Scale Transformer for Object Detection in Aerial Scenes
    Lu, Guanlin
    He, Xiaohui
    Wang, Qiang
    Shao, Faming
    Wang, Hongwei
    Wang, Jinkang
    DRONES, 2022, 6 (08)
  • [23] Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
    Zhang, Wang
    Liu, Chunsheng
    Chang, Faliang
    Song, Ye
    REMOTE SENSING, 2020, 12 (11)
  • [24] CONTEXT-AWARE HIERARCHICAL FEATURE ATTENTION NETWORK FOR MULTI-SCALE OBJECT DETECTION
    Xu, Xuelong
    Luo, Xiangfeng
    Ma, Liyan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2011 - 2015
  • [25] Hierarchical alignment network for domain adaptive object detection in aerial images
    Ma, You
    Chai, Lin
    Jin, Lizuo
    Yan, Jun
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 208 : 39 - 52
  • [26] Multi-Scale Object Detection Using Feature Fusion Recalibration Network
    Guo, Ziyuan
    Zhang, Weimin
    Liang, Zhenshuo
    Shi, Yongliang
    Huang, Qiang
    IEEE ACCESS, 2020, 8 : 51664 - 51673
  • [27] MEL-YOLO: A Novel YOLO Network With Multi-Scale, Effective, and Lightweight Methods for Small Object Detection in Aerial Images
    Yang, Yang
    Feng, Fangtao
    Liu, Guisuo
    Di, Juxing
    IEEE ACCESS, 2024, 12 : 194280 - 194295
  • [28] Multi-scale Context Enhancement Network for Object Detection
    Wang, Yanan
    Ma, Yingdong
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 6 - 11
  • [29] Multi-scale semantic enhancement network for object detection
    Guo, Dongen
    Wu, Zechen
    Feng, Jiangfan
    Zou, Tao
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [30] StairsNet: Mixed Multi-scale Network for Object Detection
    Gao, Weiyi
    Cao, Wenlong
    Zhai, Jian
    Rui, Jianwu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 303 - 314