Fine-Grained Classification of Wild Mushrooms Based on Feature Fusion and Attention Mechanism

被引:0
|
作者
Qian Jiaxin [1 ]
Yu Pengfei [1 ]
Li Haiyan [1 ]
Li Hongsong [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650500, Yunnan, Peoples R China
关键词
image recognition; fine-grained classification; feature fusion; attention mechanism; transfer learning;
D O I
10.3788/LOP212774
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Identifying the species of wild mushrooms is important to prevent mistaking the toxic type of mushrooms for nontoxic ones. Therefore, to improve the accuracy of the fine-grained classification of wild mushrooms, a parallel addition convolutional block attention module (PA_CBAM), which is improved from the convolutional block attention module (CBAM), is proposed. PA_CBAM changes the connections of the channel and spatial attention modules from serial to parallel and adds their results together. Consequently, the interference caused by cascading these attention modules is solved. In addition, the proposed method improves the performance of ResNet50 by referring to the concept of a feature pyramid, whose accuracies of the Top-1 and Top-5 are 86. 03% and 97. 19%, which are 0. 86 and 0. 73 percentage points higher than those of the original method, respectively. Furthermore, the Top-1 and Top-5 reach 88. 52% and 97. 58% using PA_CBAM, which are 3. 03 and 0. 69 percentage points higher, respectively. Moreover, to adapt the model for mobile terminals, combined with migration learning, the MobileNet_v2+PA_CBAM recognition method is proposed, obtaining an accuracy of 94. 87%, which is 0. 66 percentage points higher than that previously obtained. The results show that PA_CBAM has a better recognition and generalization effect in the fine-grained classification of wild mushrooms. Meanwhile, the size of MobileNet_v2+PA_CBAM is only 27. 8 MB, and the recognition time required for a picture is only 1. 3 ms, which is an ideal model for deploying wild mushrooms classification on mobile devices.
引用
收藏
页数:10
相关论文
共 28 条
  • [1] Real-Time Semantic Segmentation Network Based on Regional Self-Attention
    Bao Hailong
    Wan Min
    Liu Zhongxian
    Qin Mian
    Cui Haoyu
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [2] Mask-Wearing Detection Method Based on YOLO-Mask
    Cao Chengshuo
    Yuan Jie
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [3] CatoDogo, 2020, Mushrooms classification-Common genus's im-ages[EB/ OL]
  • [4] Research on Identification of Wild Mushroom Species Based on Improved Xception Transfer Learning
    Chen Degang
    Azragul
    Yin Pengbo
    Lu Yanuo
    Li Shunping
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [5] Image Super-Resolution Reconstruction Method Based on Self-Attention Deep Network
    Chen Zihan
    Wu Haobo
    Pei Haodong
    Chen Rong
    Hu Jiaxin
    Shi Hengtong
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (04)
  • [6] Fan Shuaichang, 2020, Chinese Journal of Sensors and Actuators, V33, P74, DOI 10.3969/j.issn.1004-1699.2020.01.014
  • [7] Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition
    Fu, Jianlong
    Zheng, Heliang
    Mei, Tao
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4476 - 4484
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Howard J., 2020, Deep Learning for Coders with fastai and PyTorch, VVolume 66
  • [10] Fastai: A Layered API for Deep Learning
    Howard, Jeremy
    Gugger, Sylvain
    [J]. INFORMATION, 2020, 11 (02)