Crop Disease Diagnosis with Deep Learning-Based Image Captioning and Object Detection

被引:8
|
作者
Lee, Dong In [1 ]
Lee, Ji Hwan [2 ]
Jang, Seung Ho [3 ]
Oh, Se Jong [4 ]
Doo, Ill Chul [4 ]
机构
[1] Hankuk Univ Foreign Studies, Comp & Elect Syst Engn, Yongin 17035, South Korea
[2] Hankuk Univ Foreign Studies, Artificial Intelligence Convergence, Yongin 17035, South Korea
[3] Hankuk Univ Foreign Studies, Stat, Yongin 17035, South Korea
[4] Hankuk Univ Foreign Studies, Artificial Intelligence Educ, Yongin 17035, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
新加坡国家研究基金会;
关键词
crop diseases diagnosis; farm-tech; deep learning; Inceptionv3; transformer; image captioning; YOLOv5; object detection;
D O I
10.3390/app13053148
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The number of people participating in urban farming and its market size have been increasing recently. However, the technologies that assist the novice farmers are still limited. There are several previously researched deep learning-based crop disease diagnosis solutions. However, these techniques only focus on CNN-based disease detection and do not explain the characteristics of disease symptoms based on severity. In order to prevent the spread of diseases in crops, it is important to identify the characteristics of these disease symptoms in advance and cope with them as soon as possible. Therefore, we propose an improved crop disease diagnosis solution which can give practical help to novice farmers. The proposed solution consists of two representative deep learning-based methods: Image Captioning and Object Detection. The Image Captioning model describes prominent symptoms of the disease, according to severity in detail, by generating diagnostic sentences which are grammatically correct and semantically comprehensible, along with presenting the accurate name of it. Meanwhile, the Object Detection model detects the infected area to help farmers recognize which part is damaged and assure them of the accuracy of the diagnosis sentence generated by the Image Captioning model. The Image Captioning model in the proposed solution employs the InceptionV3 model as an encoder and the Transformer model as a decoder, while the Object Detection model of the proposed solution employs the YOLOv5 model. The average BLEU score of the Image Captioning model is 64.96%, which can be considered to have high performance of sentence generation and, meanwhile, the mAP50 for the Object Detection model is 0.382, which requires further improvement. Those results indicate that the proposed solution allows the precise and elaborate information of the crop diseases, thereby increasing the overall reliability of the diagnosis.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Deep learning-based solar image captioning
    Baek, Ji-Hye
    Kim, Sujin
    Choi, Seonghwan
    Park, Jongyeob
    Kim, Dongil
    [J]. ADVANCES IN SPACE RESEARCH, 2024, 73 (06) : 3270 - 3281
  • [2] Deep Learning-based Object Detection for Crop Monitoring in Soybean Fields
    Pratama, Muhammad Taufiq
    Kim, Sangwook
    Ozawa, Seiichi
    Ohkawa, Takenao
    Chona, Yuya
    Tsuji, Hiroyuki
    Murakami, Noriyuki
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [3] A survey of deep learning-based object detection methods in crop counting
    Huang, Yuning
    Qian, Yurong
    Wei, Hongyang
    Lu, Yiguo
    Ling, Bowen
    Qin, Yugang
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 215
  • [4] Deep Learning-Based Thermal Image Reconstruction and Object Detection
    Batchuluun, Ganbayar
    Kang, Jin Kyu
    Nguyen, Dat Tien
    Pham, Tuyen Danh
    Arsalan, Muhammad
    Park, Kang Ryoung
    [J]. IEEE ACCESS, 2021, 9 : 5951 - 5971
  • [5] Deep Reinforcement Learning-based Image Captioning with Embedding Reward
    Ren, Zhou
    Wang, Xiaoyu
    Zhang, Ning
    Lv, Xutao
    Li, Li-Jia
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1151 - 1159
  • [6] Deep Learning-Based Object Detection Improvement for Tomato Disease
    Zhang, Yang
    Song, Chenglong
    Zhang, Dongwen
    [J]. IEEE ACCESS, 2020, 8 : 56607 - 56614
  • [7] Transformer based Multitask Learning for Image Captioning and Object Detection
    Basak, Debolena
    Srijith, P. K.
    Desarkar, Maunendra Sankar
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PAKDD 2024, 2024, 14646 : 260 - 272
  • [8] Oppositional Harris Hawks Optimization with Deep Learning-Based Image Captioning
    Kavitha, V. R.
    Nimala, K.
    Beno, A.
    Ramya, K. C.
    Kadry, Seifedine
    Kang, Byeong-Gwon
    Nam, Yunyoung
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (01): : 579 - 593
  • [9] A Survey of Deep Learning-Based Object Detection
    Jiao, Licheng
    Zhang, Fan
    Liu, Fang
    Yang, Shuyuan
    Li, Lingling
    Feng, Zhixi
    Qu, Rong
    [J]. IEEE ACCESS, 2019, 7 : 128837 - 128868
  • [10] From Show to Tell: A Survey on Deep Learning-Based Image Captioning
    Stefanini, Matteo
    Cornia, Marcella
    Baraldi, Lorenzo
    Cascianelli, Silvia
    Fiameni, Giuseppe
    Cucchiara, Rita
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 539 - 559