A lightweight object detection method based on fine-grained information extraction and exchange in UAV aerial images

被引:0
|
作者
Zhou, Liming
Zhao, Shuai
Li, Shilong
Wang, Yadi [1 ]
Liu, Yang
Zuo, Xianyu
机构
[1] Henan Univ, Henan Key Lab Big Data Anal & Proc, Kaifeng 475000, Peoples R China
关键词
UAV aerial image object detection; Feature extraction; Feature aggregation; Feature recombination; NETWORKS;
D O I
10.1016/j.knosys.2025.113253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objects in unmanned aerial vehicle (UAV) images are easily disturbed by complex backgrounds, and objects different positions in these images often have notable differences in size because of different shooting angles. To effectively detect objects, current mainstream methods improve the detection accuracy through complex iterative convolution operations and attention mechanisms. However, these methods not only improve the accuracy but also result in a high memory overhead and feature redundancy, which brings unbearable load pressure to the UAV platform. Therefore, to refine the multi-granularity object information of UAV aerial images via a lightweight manner and improve the precision of multi-scale object detection, we design an portable lightweight multi-scale UAV image object detection network (UAVDNet) based on MConvBottleNet. First, to overcome the challenge that the conventional convolution receptive field is unitary and easily loses the fine-grained information described above, we design a multifunctional convolution (MConv) module achieve multi-receptive field information aggregation and feature weighting through hierarchical mechanism. Second, we propose MConvBottleNet to simultaneously aggregate local and global information using residual connections and channel shuffling operations on the basis of the diverse information provided by MConv. Third, to effectively exploit the context information in high-level semantic feature maps and preserve the original fine-grained details to the maximum possible extent, we design an inter-layer cascaded information aggregation pooling (ICIAP) module, which, together with MConvBottleNet, constitutes the feature extraction network. Finally, we propose a fusion network based on the feature recombination and enhancement (FRE) module, denoted as FRENet, which can take advantage of the information-complementary characteristic different channel layers to obtain overall channel enhancement results and effectively improve the ability to detect multi-scale objects. Experiments on the VisDrone dataset show that UAVDNet achieves an average detection accuracy of 48.1% with only 4.4M parameters.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A fine-grained image classification method based on information interaction
    Zhu, Shuo
    Zhang, Xukang
    Wang, Yu
    Wang, Zongyang
    Sun, Jiahao
    IET IMAGE PROCESSING, 2024, 18 (14) : 4852 - 4861
  • [22] PCLDet: Prototypical Contrastive Learning for Fine-Grained Object Detection in Remote Sensing Images
    Ouyang, Lihan
    Guo, Guangmiao
    Fang, Leyuan
    Ghamisi, Pedram
    Yue, Jun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [23] FOF: a fine-grained object detection and feature extraction end-to-end network
    Wenzhong Shen
    Jinpeng Chen
    Jie Shao
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [24] FOF: a fine-grained object detection and feature extraction end-to-end network
    Shen, Wenzhong
    Chen, Jinpeng
    Shao, Jie
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [25] Efficient object detection and segmentation for fine-grained recognition
    Angelova, Anelia
    Zhu, Shenghuo
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 811 - 818
  • [26] Transmission Line Image Object Detection Method Considering Fine-Grained Contexts
    Wan, Neng
    Tang, Xuming
    Liu, Siyan
    Chen, Jiangqi
    Guo, Kegui
    Li, Luyao
    Liu, Shuai
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 499 - 502
  • [27] YOLO-ERF: lightweight object detector for UAV aerial images
    Wang, Xin
    He, Ning
    Hong, Chen
    Sun, Fengxi
    Han, Wenjing
    Wang, Qi
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3329 - 3339
  • [28] YOLO-ERF: lightweight object detector for UAV aerial images
    Xin Wang
    Ning He
    Chen Hong
    Fengxi Sun
    Wenjing Han
    Qi Wang
    Multimedia Systems, 2023, 29 (6) : 3329 - 3339
  • [29] Towards Nested and Fine-Grained Open Information Extraction
    Wang, Jiawei
    Zheng, Xin
    Yang, Qiang
    Qu, Jianfeng
    Xu, Jiajie
    Chen, Zhigang
    Li, Zhixu
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE GRAPH EMPOWERS NEW INFRASTRUCTURE CONSTRUCTION, 2021, 1466 : 185 - 197
  • [30] Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification
    Ge, ZongYuan
    McCool, Chris
    Sanderson, Conrad
    Wang, Peng
    Liu, Lingqiao
    Reid, Ian
    Corke, Peter
    2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 442 - 447