A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection

被引：24

作者：

Lu, Wanjie ^{[1
]}

Lan, Chaozhen ^{[2
]}

Niu, Chaoyang ^{[1
]}

Liu, Wei ^{[1
]}

Lyu, Liang ^{[2
]}

Shi, Qunshan ^{[2
]}

Wang, Shiju ^{[1
]}

机构：

[1] PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China

[2] PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING | 2023年 / 16卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Transformers; Feature extraction; Detectors; Autonomous aerial vehicles; Computational modeling; Training; Convolutional neural network (CNN); hybrid network; object detection; transformer; unmanned aerial vehicle (UAV) image; NETWORK;

D O I：

10.1109/JSTARS.2023.3234161

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The object detection of unmanned aerial vehicle (UAV) images has widespread applications in numerous fields; however, the complex background, diverse scales, and uneven distribution of objects in UAV images make object detection a challenging task. This study proposes a convolution neural network transformer hybrid model to achieve efficient object detection in UAV images, which has three advantages that contribute to improving object detection performance. First, the efficient and effective cross-shaped window (CSWin) transformer can be used as a backbone to obtain image features at different levels, and the obtained features can be input into the feature pyramid network to achieve multiscale representation, which will contribute to multiscale object detection. Second, a hybrid patch embedding module is constructed to extract and utilize low-level information such as the edges and corners of the image. Finally, a slicing-based inference method is constructed to fuse the inference results of the original image and sliced images, which will improve the small object detection accuracy without modifying the original network. Experimental results on public datasets illustrate that the proposed method can improve performance more effectively than several popular and state-of-the-art object detection methods.

引用

下载

页码：1211 / 1231

页数：21

共 50 条

[31] TransHSI: A Hybrid CNN-Transformer Method for Disjoint Sample-Based Hyperspectral Image Classification
Zhang, Ping
Yu, Haiyang
Li, Pengao
Wang, Ruili
REMOTE SENSING, 2023, 15 (22)
[32] HAU-Net: Hybrid CNN-transformer for breast ultrasound image segmentation
Zhang, Huaikun
Lian, Jing
Yi, Zetong
Wu, Ruichao
Lu, Xiangyu
Ma, Pei
Ma, Yide
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
[33] Hybrid CNN-transformer based meta-learning approach for personalized image aesthetics assessment
Yan, Xingao
Shao, Feng
Chen, Hangwei
Jiang, Qiuping
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
[34] A CNN-transformer hybrid approach for an intrusion detection system in advanced metering infrastructure
Ruizhe Yao
Ning Wang
Peng Chen
Di Ma
Xianjun Sheng
Multimedia Tools and Applications, 2023, 82 : 19463 - 19486
[35] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
Wang, Hongmei
Li, Lin
Li, Chenkai
Lu, Xuanyu
IEEE ACCESS, 2023, 11 : 78956 - 78969
[36] Wild horseshoe crab image denoising based on CNN-transformer architecture
Lili Han
Xiuping Liu
Qingqing Wang
Tao Xu
Scientific Reports, 15 (1)
[37] Hybrid CNN-Transformer Features for Visual Place Recognition
Wang, Yuwei
Qiu, Yuanying
Cheng, Peitao
Zhang, Junyu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1109 - 1122
[38] CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
Xiao, Fengjun
Zhang, Zhuxi
Yao, Ye
MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3819 - 3832
[39] CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
Fengjun Xiao
Zhuxi Zhang
Ye Yao
Multimedia Systems, 2023, 29 (6) : 3819 - 3832
[40] AstroYOLO: A hybrid CNN-Transformer deep-learning object-detection model for blue horizontal-branch stars
He, Yuchen
Wu, Jingjing
Wang, Wenyu
Jiang, Bin
Zhang, Yanxia
PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF JAPAN, 2023, 75 (06) : 1311 - 1323

← 1 2 3 4 5 →