Fine-Grained Image Recognition of Wild Mushroom Based on Multiscale Feature Guide

被引:5
|
作者
Zhang Zhigang [1 ]
Yu Pengfei [1 ]
Li Haiyan [1 ]
Li Hongsong [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650500, Yunnan, Peoples R China
基金
美国国家航空航天局;
关键词
image recognition; fine-grained; multi-scale; feature guide; attention mechanism; joint feature;
D O I
10.3788/LOP202259.1210016
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning technology is proposed to solve the social problem of the frequent occurrences of wild mushroom poisoning in China. However, due to the small difference between classes and complex image backgrounds, fine-grained recognition accuracy is low. To solve this problem, this paper proposes an improved ResNeXt50 network. First, a multiscale feature guide (MSFG) module is designed, which guides the network to learn and use low and high-level features fully through short connections. Then, the improved attention mechanism module is used to reduce the network's learning for complex backgrounds. Finally, the different hierarchical features in the model are fused, and the obtained joint features are used for recognition. Experimental results show that the accuracy of the proposed network on the test set can reach 96.47%, which is 2.64 percentage points higher than the unimproved ResNeXt50 network. Comparison results show that the accuracy of the improved network model is 8.10 percentage points, 5.13 percentage points, 3.24 percentage points, 3.30 percentage points, and 4.25 percentage points better than VGG19, DenseNet121, Inception_v3, ResNet50, and ShuffleNet_v2, respectively.
引用
收藏
页数:10
相关论文
共 26 条
  • [1] Chen D G, 2021, LASER OPTOELECTRON P, V58
  • [2] Recognition of Waterborne Pathogens Based on Spectral Similarity Analysis
    Feng Chun
    Zhao Nanjing
    Yin Gaofang
    Gan Tingting
    Chen Min
    Yang Jinqiang
    Liu Jianguo
    Liu Wenqing
    [J]. ACTA OPTICA SINICA, 2020, 40 (03)
  • [3] Rare earth element composition of Paleogene vertebrate fossils from Toadstool Geologic Park, Nebraska, USA
    Grandstaff, D. E.
    Terry, D. O., Jr.
    [J]. APPLIED GEOCHEMISTRY, 2009, 24 (04) : 733 - 745
  • [4] Gui M Y, 2014, EDIBLE FUNGI, V36, P14
  • [5] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [6] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [7] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [8] Li Y X, 2021, LASER OPTOELECTRON P, V58
  • [9] Liu B, 2015, Software Guide, V14, P60
  • [10] A Simple Pooling-Based Design for Real-Time Salient Object Detection
    Liu, Jiang-Jiang
    Hou, Qibin
    Cheng, Ming-Ming
    Feng, Jiashi
    Jiang, Jianmin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3912 - 3921