High-Quality Object Detection Method for UAV Images Based on Improved DINO and Masked Image Modeling

被引:1
|
作者
Lu, Wanjie [1 ]
Niu, Chaoyang [1 ]
Lan, Chaozhen [1 ]
Liu, Wei [1 ]
Wang, Shiju [1 ]
Yu, Junming [2 ]
Hu, Tao [1 ]
机构
[1] PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450052, Peoples R China
[2] China Elect Technol Grp Corp, Res Inst 27, Zhengzhou 450047, Peoples R China
关键词
UAV image; object detection; masked image modeling; global-local hybrid; NETWORK;
D O I
10.3390/rs15194740
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The extensive application of unmanned aerial vehicle (UAV) technology has increased academic interest in object detection algorithms for UAV images. Nevertheless, these algorithms present issues such as low accuracy, inadequate stability, and insufficient pre-training model utilization. Therefore, a high-quality object detection method based on a performance-improved object detection baseline and pretraining algorithm is proposed. To fully extract global and local feature information, a hybrid backbone based on the combination of convolutional neural network (CNN) and vision transformer (ViT) is constructed using an excellent object detection method as the baseline network for feature extraction. This backbone is then combined with a more stable and generalizable optimizer to obtain high-quality object detection results. Because the domain gap between natural and UAV aerial photography scenes hinders the application of mainstream pre-training models to downstream UAV image object detection tasks, this study applies the masked image modeling (MIM) method to aerospace remote sensing datasets with a lower volume than mainstream natural scene datasets to produce a pre-training model for the proposed method and further improve UAV image object detection accuracy. Experimental results for two UAV imagery datasets show that the proposed method achieves better object detection performance compared to state-of-the-art (SOTA) methods with fewer pre-training datasets and parameters.
引用
下载
收藏
页数:24
相关论文
共 50 条
  • [1] Foreign Object Detection Method for Railway Contact Network Based on Improved DINO
    Shi, Tianyun
    Hou, Bo
    Li, Guohua
    Dai, Mingrui
    Zhongguo Tiedao Kexue/China Railway Science, 2024, 45 (04): : 158 - 167
  • [2] PREDATOR UAV PRODUCES HIGH-QUALITY IMAGES
    FULGHUM, DA
    AVIATION WEEK & SPACE TECHNOLOGY, 1994, 141 (22): : 62 - 62
  • [3] Predator UAV produces high-quality images
    Aviation Week and Space Technology (New York), 1994, 141 (22):
  • [4] Research on Object Detection and Recognition Method for UAV Aerial Images Based on Improved YOLOv5
    Zhang, Heng
    Shao, Faming
    He, Xiaohui
    Zhang, Zihan
    Cai, Yonggen
    Bi, Shaohua
    DRONES, 2023, 7 (06)
  • [5] Effective mmWave Radar Object Detection Pretraining Based on Masked Image Modeling
    Zhuang, Long
    Jiang, Tiezhen
    Wang, Jianhua
    An, Qi
    Xiao, Kai
    Wang, Anqi
    IEEE SENSORS JOURNAL, 2024, 24 (03) : 3999 - 4010
  • [6] PYRAMID MASKED IMAGE MODELING FOR TRANSFORMER-BASED AERIAL OBJECT DETECTION
    Zhang, Cong
    Liu, Tianshan
    Ju, Yakun
    Lam, Kin-Man
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1675 - 1679
  • [7] High-quality facial-expression image generation for UAV pedestrian detection
    Tang, Yumin
    Fan, Jing
    Qu, Jinshuai
    FRONTIERS IN SPACE TECHNOLOGIES, 2022, 3
  • [8] Object Detection Based on Improved YOLOv7 for UAV Aerial Image
    Cui, Liqun
    Cao, Huawei
    Computer Engineering and Applications, 60 (20): : 189 - 197
  • [9] Small object detection in UAV image based on improved YOLOv5
    Zhang, Jian
    Wan, Guoyang
    Jiang, Ming
    Lu, Guifu
    Tao, Xiuwen
    Huang, Zhiyuan
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2023, 11 (01)
  • [10] High-Quality Angle Prediction for Oriented Object Detection in Remote Sensing Images
    Wang, Guanchun
    Zhang, Xiangrong
    Zhu, Peng
    Tang, Xu
    Chen, Puhua
    Jiao, Licheng
    Zhou, Huiyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61