A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection
被引:24
|
作者:
Lu, Wanjie
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Lu, Wanjie
[1
]
Lan, Chaozhen
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Lan, Chaozhen
[2
]
Niu, Chaoyang
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Niu, Chaoyang
[1
]
Liu, Wei
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Liu, Wei
[1
]
Lyu, Liang
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Lyu, Liang
[2
]
Shi, Qunshan
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Shi, Qunshan
[2
]
Wang, Shiju
论文数: 0引用数: 0
h-index: 0
机构:
PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R ChinaPLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
Wang, Shiju
[1
]
机构:
[1] PLA Strateg Support Force Informat Engn Univ, Inst Data & Target Engn, Zhengzhou 450001, Peoples R China
[2] PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R China
The object detection of unmanned aerial vehicle (UAV) images has widespread applications in numerous fields; however, the complex background, diverse scales, and uneven distribution of objects in UAV images make object detection a challenging task. This study proposes a convolution neural network transformer hybrid model to achieve efficient object detection in UAV images, which has three advantages that contribute to improving object detection performance. First, the efficient and effective cross-shaped window (CSWin) transformer can be used as a backbone to obtain image features at different levels, and the obtained features can be input into the feature pyramid network to achieve multiscale representation, which will contribute to multiscale object detection. Second, a hybrid patch embedding module is constructed to extract and utilize low-level information such as the edges and corners of the image. Finally, a slicing-based inference method is constructed to fuse the inference results of the original image and sliced images, which will improve the small object detection accuracy without modifying the original network. Experimental results on public datasets illustrate that the proposed method can improve performance more effectively than several popular and state-of-the-art object detection methods.
机构:
College of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan,030024, ChinaCollege of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan,030024, China
Zhang, Yingjun
Bai, Xiaohui
论文数: 0引用数: 0
h-index: 0
机构:
College of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan,030024, ChinaCollege of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan,030024, China
Bai, Xiaohui
Xie, Binhong
论文数: 0引用数: 0
h-index: 0
机构:
College of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan,030024, ChinaCollege of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan,030024, China
机构:University of Shanghai for Science and Technology,Shanghai Engineering Research Center of Assistive Devices, School of Medical Instrument and Food Engineering
Zhihong Yu
Feifei Lee
论文数: 0引用数: 0
h-index: 0
机构:University of Shanghai for Science and Technology,Shanghai Engineering Research Center of Assistive Devices, School of Medical Instrument and Food Engineering
Feifei Lee
Qiu Chen
论文数: 0引用数: 0
h-index: 0
机构:University of Shanghai for Science and Technology,Shanghai Engineering Research Center of Assistive Devices, School of Medical Instrument and Food Engineering
机构:
Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R ChinaHuazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
Guo, Xiayu
Lin, Xian
论文数: 0引用数: 0
h-index: 0
机构:
Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R ChinaHuazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
Lin, Xian
Yang, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R ChinaHuazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
Yang, Xin
Yu, Li
论文数: 0引用数: 0
h-index: 0
机构:
Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R ChinaHuazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
Yu, Li
Cheng, Kwang-Ting
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Sch Engn, Kowloon, Hong Kong, Peoples R ChinaHuazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
Cheng, Kwang-Ting
Yan, Zengqiang
论文数: 0引用数: 0
h-index: 0
机构:
Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R ChinaHuazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China