Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis

被引:8
|
作者
Yan, Jiawei [1 ,2 ]
Zhao, Jianqing [1 ,2 ]
Cai, Yucheng [1 ,2 ]
Wang, Suwan [1 ,2 ]
Qiu, Xiaolei [1 ,2 ]
Yao, Xia [1 ,2 ,3 ]
Tian, Yongchao [1 ,4 ]
Zhu, Yan [1 ,2 ]
Cao, Weixing [1 ,2 ]
Zhang, Xiaohu [1 ,2 ,4 ]
机构
[1] Nanjing Agr Univ, Natl Engn & Technol Ctr Informat Agr, Nanjing 210095, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Crop Syst Anal & Decis Making, Nanjing 210095, Peoples R China
[3] Jiangsu Key Lab Informat Agr, Nanjing 210095, Peoples R China
[4] Jiangsu Collaborat Innovat Ctr Modern Crop Prod, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Wheat spike detection; Deep learning network; Attention score; Interpretive analysis; CONVOLUTIONAL NEURAL-NETWORKS; SMALL OBJECT DETECTION; IMPROVED YOLOV5;
D O I
10.1186/s13007-023-01020-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundDetecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characteristics to design a suitable wheat spike detection model. It remains unclear whether the complex detection layers of the network play their intended role.ResultsThis study proposes an interpretive analysis method for quantitatively evaluating the role of three-scale detection layers in a deep learning-based wheat spike detection model. The attention scores in each detection layer of the YOLOv5 network are calculated using the Gradient-weighted Class Activation Mapping (Grad-CAM) algorithm, which compares the prior labeled wheat spike bounding boxes with the attention areas of the network. By refining the multi-scale detection layers using the attention scores, a better wheat spike detection network is obtained. The experiments on the Global Wheat Head Detection (GWHD) dataset show that the large-scale detection layer performs poorly, while the medium-scale detection layer performs best among the three-scale detection layers. Consequently, the large-scale detection layer is removed, a micro-scale detection layer is added, and the feature extraction ability in the medium-scale detection layer is enhanced. The refined model increases the detection accuracy and reduces the network complexity by decreasing the network parameters.ConclusionThe proposed interpretive analysis method to evaluate the contribution of different detection layers in the wheat spike detection network and provide a correct network improvement scheme. The findings of this study will offer a useful reference for future applications of deep network refinement in this field.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
    Jiawei Yan
    Jianqing Zhao
    Yucheng Cai
    Suwan Wang
    Xiaolei Qiu
    Xia Yao
    Yongchao Tian
    Yan Zhu
    Weixing Cao
    Xiaohu Zhang
    [J]. Plant Methods, 19
  • [2] MDFN: Multi-scale deep feature learning network for object detection
    Ma, Wenchi
    Wu, Yuanwei
    Cen, Feng
    Wang, Guanghui
    [J]. PATTERN RECOGNITION, 2020, 100
  • [3] Improving YOLOX network for multi-scale fire detection
    Wang, Taofang
    Wang, Jun
    Wang, Chao
    Lei, Yi
    Cao, Rui
    Wang, Li
    [J]. VISUAL COMPUTER, 2024, 40 (09): : 6493 - 6505
  • [4] MULTI-SCALE ENHANCED DEEP NETWORK FOR ROAD DETECTION
    Lu, Xiaoyan
    Zhong, Yanfei
    Zhao, Ji
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3947 - 3950
  • [5] Adaptive aerial object detection based on multi-scale deep learning
    Liu, Fang
    Han, Xiao
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (05):
  • [6] Deep Learning for Multi-scale Object Detection: A Survey
    Chen, Ke-Qi
    Zhu, Zhi-Liang
    Deng, Xiao-Ming
    Ma, Cui-Xia
    Wang, Hong-An
    [J]. Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04): : 1201 - 1227
  • [7] Multi-scale Deep Learning for Gesture Detection and Localization
    Neverova, Natalia
    Wolf, Christian
    Taylor, Graham W.
    Nebout, Florian
    [J]. COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 474 - 490
  • [8] Multi-scale Deep Representation Learning for Face Detection
    Han, Jifei
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [9] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Xiaowei Wang
    Shoulin Yin
    Hang Li
    Jiachi Wang
    Lin Teng
    [J]. International Journal of Wireless Information Networks, 2020, 27 : 503 - 517
  • [10] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Wang, Xiaowei
    Yin, Shoulin
    Li, Hang
    Wang, Jiachi
    Teng, Lin
    [J]. INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2020, 27 (04) : 503 - 517