High-Quality Object Detection Method for UAV Images Based on Improved DINO and Masked Image Modeling

被引:4
|
作者
Lu, Wanjie [1 ]
Niu, Chaoyang [1 ]
Lan, Chaozhen [1 ]
Liu, Wei [1 ]
Wang, Shiju [1 ]
Yu, Junming [2 ]
Hu, Tao [1 ]
机构
[1] PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450052, Peoples R China
[2] China Elect Technol Grp Corp, Res Inst 27, Zhengzhou 450047, Peoples R China
关键词
UAV image; object detection; masked image modeling; global-local hybrid; NETWORK;
D O I
10.3390/rs15194740
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The extensive application of unmanned aerial vehicle (UAV) technology has increased academic interest in object detection algorithms for UAV images. Nevertheless, these algorithms present issues such as low accuracy, inadequate stability, and insufficient pre-training model utilization. Therefore, a high-quality object detection method based on a performance-improved object detection baseline and pretraining algorithm is proposed. To fully extract global and local feature information, a hybrid backbone based on the combination of convolutional neural network (CNN) and vision transformer (ViT) is constructed using an excellent object detection method as the baseline network for feature extraction. This backbone is then combined with a more stable and generalizable optimizer to obtain high-quality object detection results. Because the domain gap between natural and UAV aerial photography scenes hinders the application of mainstream pre-training models to downstream UAV image object detection tasks, this study applies the masked image modeling (MIM) method to aerospace remote sensing datasets with a lower volume than mainstream natural scene datasets to produce a pre-training model for the proposed method and further improve UAV image object detection accuracy. Experimental results for two UAV imagery datasets show that the proposed method achieves better object detection performance compared to state-of-the-art (SOTA) methods with fewer pre-training datasets and parameters.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] An industrial product surface anomaly detection method based on masked image modeling
    Tang, Shancheng
    Li, Heng
    Dai, Fenghua
    Yang, Jiqing
    Jin, Zicheng
    Lu, Jianhui
    Zhang, Ying
    NONDESTRUCTIVE TESTING AND EVALUATION, 2024,
  • [22] DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment
    Bai, Yunpeng
    Wang, Xintao
    Cao, Yan-Pei
    Gee, Yixiao
    Yuan, Chun
    Shane, Ying
    COMPUTER VISION - ECCV 2024, PT XXXI, 2025, 15089 : 472 - 488
  • [23] A Hyperparameter Quality Assessment Method for UAV Object Detection Based on IER Rule
    Kang, Xiao
    Mu, Quanqi
    Han, Wence
    Zhu, Hailong
    He, Wei
    Huang, Zhipeng
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3876 - 3883
  • [24] Cross-Modal Oriented Object Detection of UAV Aerial Images Based on Image Feature
    Wang, Huiying
    Wang, Chunping
    Fu, Qiang
    Zhang, Dongdong
    Kou, Renke
    Yu, Ying
    Song, Jian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 21
  • [25] AUTOMATIC BANDWIDTH ESTIMATION STRATEGY FOR HIGH-QUALITY NON-PARAMETRIC MODELING BASED MOVING OBJECT DETECTION
    Cuevas, Carlos
    Garcia, Narciso
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1757 - 1760
  • [26] Small-Object Detection for UAV-Based Images
    Yu, Mingrui
    Leung, Henry
    2023 IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON, 2023,
  • [27] High-Quality Proposals for Weakly Supervised Object Detection
    Cheng, Gong
    Yang, Junyu
    Gao, Decheng
    Guo, Lei
    Han, Junwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 5794 - 5804
  • [28] Efficient, High-Quality Image Contour Detection
    Catanzaro, Bryan
    Su, Bor-Yiing
    Sundaram, Narayanan
    Lee, Yunsup
    Murphy, Mark
    Keutzer, Kurt
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 2381 - 2388
  • [29] A Wheat Spike Detection Method in UAV Images Based on Improved YOLOv5
    Zhao, Jianqing
    Zhang, Xiaohu
    Yan, Jiawei
    Qiu, Xiaolei
    Yao, Xia
    Tian, Yongchao
    Zhu, Yan
    Cao, Weixing
    REMOTE SENSING, 2021, 13 (16)
  • [30] Object Detection of UAV Images from Orthographic Perspective Based on Improved YOLOv5s
    Lu, Feng
    Li, Kewei
    Nie, Yunfeng
    Tao, Yejia
    Yu, Yihao
    Huang, Linbo
    Wang, Xing
    SUSTAINABILITY, 2023, 15 (19)