Attention-based scale sequence network for small object detection

被引:2
|
作者
Lee, Young-Woon [1 ]
Kim, Byung-Gyu [2 ]
机构
[1] Sunmoon Univ, Dept Comp Engn, Asan, South Korea
[2] Sookmyung Womens Univ, Div Artificial Intelligence Engn, Seoul, South Korea
关键词
Small object detection; Feature pyramid network; Scale sequence; Attention mechanism; Deep learning;
D O I
10.1016/j.heliyon.2024.e32931
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recently, with the remarkable development of deep learning technology, achievements are being updated in various computer vision fields. In particular, the object recognition field is receiving the most attention. Nevertheless, recognition performance for small objects is still challenging. Its performance is of utmost importance in realistic applications such as searching for missing persons through aerial photography. The core structure of the object recognition neural network is the feature pyramid network (FPN). You Only Look Once (YOLO) is the most widely used representative model following this structure. In this study, we proposed an attention-based scale sequence network (ASSN) that improves the scale sequence feature pyramid network (ssFPN), enhancing the performance of the FPN-based detector for small objects. ASSN is a lightweight attention module optimized for FPN-based detectors and has the versatility to be applied to any model with a corresponding structure. The proposed ASSN demonstrated performance improvements compared to the baselines (YOLOv7 and YOLOv8) in average precision (AP) of up to 0.6%. Additionally, the AP for small objects (AP(s)) showed also improvements of up to 1.9%. Furthermore, ASSN exhibits higher performance than ssFPN while achieving lightweightness and optimization, thereby improving computational complexity and processing speed. ASSN is open-source based on YOLO version 7 and 8. This can be found in our public repository: https://github.com/smu-ivpl/ASSN.git
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Channel and spatial attention-based Siamese network for visual object tracking
    Tian, Shishun
    Chen, Zixi
    Chen, Bolin
    Zou, Wenbin
    Li, Xia
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [32] Pay "Attention" to Adverse Weather: Weather-aware Attention-based Object Detection
    Chaturvedi, Saket S.
    Zhang, Lan
    Yuan, Xiaoyong
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4573 - 4579
  • [33] RETRACTED: A Novel Attention-Based Lightweight Network for Multiscale Object Detection in Underwater Images (Retracted Article)
    Wang, Jinkang
    He, Xiaohui
    Shao, Faming
    Lu, Guanlin
    Jiang, Qunyan
    Hu, Ruizhe
    Li, Jinxin
    JOURNAL OF SENSORS, 2022, 2022
  • [34] Attention-based Proposals Refinement for 3D Object Detection
    Minh-Quan Dao
    Hery, Elwan
    Fremont, Vincent
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 197 - 205
  • [35] Attention-based object detection with saliency loss in remote sensing images
    Wu, Qin
    Yuan, Xingxing
    Yao, Zikang
    Chai, Zhilei
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (01)
  • [36] Scale-Aware Attention-Based PillarsNet (SAPN) Based 3D Object Detection for Point Cloud
    Song, Xiang
    Zhan, Weiqin
    Che, Xiaoyu
    Jiang, Huilin
    Yang, Biao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [37] Indoor Climate Prediction Using Attention-Based Sequence-to- Sequence Neural Network
    Setiawan, Karli Eka
    Elwirehardja, Gregorius N.
    Pardamean, Bens
    CIVIL ENGINEERING JOURNAL-TEHRAN, 2023, 9 (05): : 1105 - 1120
  • [38] Tagging Malware Intentions by Using Attention-Based Sequence-to-Sequence Neural Network
    Huang, Yi-Ting
    Chen, Yu-Yuan
    Yang, Chih-Chun
    Sun, Yeali
    Hsiao, Shun-Wen
    Chen, Meng Chang
    INFORMATION SECURITY AND PRIVACY, ACISP 2019, 2019, 11547 : 660 - 668
  • [39] Attention-Based Network for Weak Labels in Neonatal Seizure Detection
    Isaev, Dmitry Yu.
    Tchapyjnikov, Dmitry
    Cotten, C. Michael
    Tanaka, David
    Martinez, Natalia
    Bertran, Martin
    Sapiro, Guillermo
    Carlson, David
    MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 126, 2020, 126 : 479 - 506
  • [40] Automatic epilepsy detection with an attention-based multiscale residual network
    Wang X.
    Li M.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (02): : 253 - 261