Robust and Discriminative Feature Learning via Mutual Information Maximization for Object Detection in Aerial Images

被引:0
|
作者
Sun, Xu [1 ]
Yu, Yinhui [1 ]
Cheng, Qing [1 ]
机构
[1] Jilin Univ, Sch Commun Engn, Changchun 130012, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 03期
基金
中国国家自然科学基金;
关键词
Aerial images; object detection; mutual information; contrast learning; attention mechanism;
D O I
10.32604/cmc.2024.052725
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection in unmanned aerial vehicle (UAV) aerial images has become increasingly important in military and civil applications. General object detection models are not robust enough against interclass similarity and intraclass variability of small objects, and UAV-specific nuisances such as uncontrolled weather conditions. Unlike previous approaches focusing on high-level semantic information, we report the importance of underlying features to improve detection accuracy and robustness from the information-theoretic perspective. Specifically, we propose a robust and discriminative feature learning approach through mutual information maximization (RD-MIM), which can be integrated into numerous object detection methods for aerial images. Firstly, we present the rank sample mining method to reduce underlying feature differences between the natural image domain and the aerial image domain. Then, we design a momentum contrast learning strategy to make object features similar to the same category and dissimilar to different categories. Finally, we construct a transformer-based global attention mechanism to boost object location semantics by leveraging the high interrelation of different receptive fields. We conduct extensive experiments on the VisDrone and Unmanned Aerial Vehicle Benchmark Object Detection and Tracking (UAVDT) datasets to prove the effectiveness of the proposed method. The experimental results show that our approach brings considerable robustness gains to basic detectors and advanced detection methods, achieving relative growth rates of 51.0% and 39.4% in corruption robustness, respectively. Our code is available at https:// github.com/cq100/RD-MIM (accessed on 2 August 2024).
引用
收藏
页码:4149 / 4171
页数:23
相关论文
共 50 条
  • [1] Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization
    Liu, Zhenguang
    Yu, Xinyang
    Wang, Ruili
    Ye, Shuai
    Ma, Zhe
    Dong, Jianfeng
    He, Sifeng
    Qian, Feng
    Zhang, Xiaobo
    Zimmermann, Roger
    Yang, Lei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 144 - 152
  • [2] Learning to Reduce Information Bottleneck for Object Detection in Aerial Images
    Shen, Yuchen
    Zhang, Dong
    Song, Zhihao
    Jiang, Xuesong
    Ye, Qiaolin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [3] Learning Discriminative Features for Ground-Based Cloud Classification via Mutual Information Maximization
    Liu, Shuang
    Zhang, Zhong
    Xiao, Baihua
    Cao, Xiaozhong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (07): : 1422 - 1425
  • [4] Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images
    Liang, Dong
    Geng, Qixiang
    Wei, Zongqi
    Vorontsov, Dmitry A.
    Kim, Ekaterina L.
    Wei, Mingqiang
    Zhou, Huiyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] OrtDet: An Orientation Robust Detector via Transformer for Object Detection in Aerial Images
    Zhao, Ling
    Liu, Tianhua
    Xie, Shuchun
    Huang, Haoze
    Qi, Ji
    REMOTE SENSING, 2022, 14 (24)
  • [6] Graph matching vs mutual information maximization for object detection
    Shams, LB
    Brady, MJ
    Schaal, S
    NEURAL NETWORKS, 2001, 14 (03) : 345 - 354
  • [7] Learning a discriminative feature for object detection based on feature fusing and context learning
    You Lei
    Wang Hongpeng
    Wang Yuan
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 543 - 547
  • [8] Community detection in hypergraphs via mutual information maximization
    Kritschgau, Juergen
    Kaiser, Daniel
    Rodriguez, Oliver Alvarado
    Amburg, Ilya
    Bolkema, Jessalyn
    Grubb, Thomas
    Lan, Fangfei
    Maleki, Sepideh
    Chodrow, Phil
    Kay, Bill
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Loop Closure Detection via Maximization of Mutual Information
    Zhang, Ge
    Yan, Xiaoqiang
    Ye, Yangdong
    IEEE ACCESS, 2019, 7 : 124217 - 124232
  • [10] Detection of neovascularization in retinal images using mutual information maximization
    Kar, Sudeshna Sil
    Maity, Santi P.
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 194 - 208