Improved YOLOv5 for Aerial Images Based on Attention Mechanism
被引:1
|
作者:
Li, Zebin
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
Intelligent Percept & Proc Technol Lab, Beijing 100124, Peoples R China
Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R ChinaChinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
Li, Zebin
[1
,2
,3
]
Fan, Bangkui
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
Intelligent Percept & Proc Technol Lab, Beijing 100124, Peoples R ChinaChinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
Fan, Bangkui
[1
,2
]
Xu, Yulong
论文数: 0引用数: 0
h-index: 0
机构:
Intelligent Percept & Proc Technol Lab, Beijing 100124, Peoples R ChinaChinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
Xu, Yulong
[2
]
Sun, Renwu
论文数: 0引用数: 0
h-index: 0
机构:
Intelligent Percept & Proc Technol Lab, Beijing 100124, Peoples R ChinaChinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
Sun, Renwu
[2
]
机构:
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Intelligent Percept & Proc Technol Lab, Beijing 100124, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
Object detection based on unmanned aerial vehicle (UAV) platforms is essential for both engineering and research. Complex scale problems in UAV application scenarios require strong regression localization capabilities from target detection algorithms. Nonetheless, due to the constraints of UAV platform, it is difficult to increase accuracy by deepening the network. Therefore, this paper presents an improved YOLOv5 with an attention mechanism, consisting a Convolution-Swin Transformer Block (CSTB) utilizing Swin Transformer as well as a Convolution-block Attention Module (CBAM) to improve network positioning accuracy. In addition, this paper incorporates Bidirectional Feature Pyramid Network (BiFPN), Spatial Pyramid Pooling-Fast (SPPF) and some network components to increase the average precision while maintaining the limited size of the model. Experiments on Visdrone2019 dataset show that the proposed approach can raise the mean Average Precision (mAP) by 5.4% compared to YOLOv5, with only 18% increase in model size.
机构:
Fujian Chuanzheng Commun Coll, Fuzhou, Peoples R ChinaFujian Chuanzheng Commun Coll, Fuzhou, Peoples R China
Lan, Min-Li
Yang, Dan
论文数: 0引用数: 0
h-index: 0
机构:
East China Jiaotong Univ, Sch Civil Engn & Architecture, Nanchang, Peoples R ChinaFujian Chuanzheng Commun Coll, Fuzhou, Peoples R China
Yang, Dan
Zhou, Shuang-Xi
论文数: 0引用数: 0
h-index: 0
机构:
East China Jiaotong Univ, Sch Civil Engn & Architecture, Nanchang, Peoples R China
Guangzhou Maritime Univ, Sch Civil & Engn Management, Guangzhou, Peoples R ChinaFujian Chuanzheng Commun Coll, Fuzhou, Peoples R China
Zhou, Shuang-Xi
Ding, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Hangzhou City Univ, Dept Civil Engn, Hangzhou 310015, Peoples R ChinaFujian Chuanzheng Commun Coll, Fuzhou, Peoples R China
机构:
College of Artificial Intelligence and Software, Liaoning Petrochemical University, Liaoning, Fushun,113000, ChinaCollege of Artificial Intelligence and Software, Liaoning Petrochemical University, Liaoning, Fushun,113000, China
Chen, Jiahui
Wang, Xiaohong
论文数: 0引用数: 0
h-index: 0
机构:
College of Information and Control Engineering, Liaoning Petrochemical University, Liaoning, Fushun,113000, ChinaCollege of Artificial Intelligence and Software, Liaoning Petrochemical University, Liaoning, Fushun,113000, China
机构:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, 210023, ChinaCollege of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, 210023, China
Gu, Wencheng
Sun, Kexue
论文数: 0引用数: 0
h-index: 0
机构:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, 210023, ChinaCollege of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, 210023, China
机构:
Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130000, Peoples R ChinaChinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130000, Peoples R China
Xiu, Jihong
Liu, Xiaojia
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130000, Peoples R China
Univ Chinese Acad Sci, Beijing 100000, Peoples R ChinaChinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130000, Peoples R China