High-Accuracy Tomato Leaf Disease Image-Text Retrieval Method Utilizing LAFANet

被引:1
|
作者
Xu, Jiaxin [1 ]
Zhou, Hongliang [1 ]
Hu, Yufan [1 ]
Xue, Yongfei [1 ]
Zhou, Guoxiong [1 ]
Li, Liujun [2 ]
Dai, Weisi [1 ]
Li, Jinyang [1 ]
机构
[1] Cent South Univ Forestry & Technol, Coll Comp & Informat Engn, Changsha 410004, Peoples R China
[2] Univ Idaho, Dept Soil & Water Syst, Moscow, ID 83844 USA
来源
PLANTS-BASEL | 2024年 / 13卷 / 09期
关键词
LAFANet; TLDITRD; LFA; FNE-ANS; AR; image-text retrieval; cross-modal; SYSTEM;
D O I
10.3390/plants13091176
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Tomato leaf disease control in the field of smart agriculture urgently requires attention and reinforcement. This paper proposes a method called LAFANet for image-text retrieval, which integrates image and text information for joint analysis of multimodal data, helping agricultural practitioners to provide more comprehensive and in-depth diagnostic evidence to ensure the quality and yield of tomatoes. First, we focus on six common tomato leaf disease images and text descriptions, creating a Tomato Leaf Disease Image-Text Retrieval Dataset (TLDITRD), introducing image-text retrieval into the field of tomato leaf disease retrieval. Then, utilizing ViT and BERT models, we extract detailed image features and sequences of textual features, incorporating contextual information from image-text pairs. To address errors in image-text retrieval caused by complex backgrounds, we propose Learnable Fusion Attention (LFA) to amplify the fusion of textual and image features, thereby extracting substantial semantic insights from both modalities. To delve further into the semantic connections across various modalities, we propose a False Negative Elimination-Adversarial Negative Selection (FNE-ANS) approach. This method aims to identify adversarial negative instances that specifically target false negatives within the triplet function, thereby imposing constraints on the model. To bolster the model's capacity for generalization and precision, we propose Adversarial Regularization (AR). This approach involves incorporating adversarial perturbations during model training, thereby fortifying its resilience and adaptability to slight variations in input data. Experimental results show that, compared with existing ultramodern models, LAFANet outperformed existing models on TLDITRD dataset, with top1, top5, and top10 reaching 83.3% and 90.0%, and top1, top5, and top10 reaching 80.3%, 93.7%, and 96.3%. LAFANet offers fresh technical backing and algorithmic insights for the retrieval of tomato leaf disease through image-text correlation.
引用
收藏
页数:29
相关论文
共 39 条
  • [11] Automated method for feature-based image registration with high-accuracy
    Wen, Gong-Jian
    Lu, Jin-Jian
    Wang, Ji-Yang
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (09): : 2293 - 2301
  • [12] High-accuracy method for holographic image projection with suppressed speckle noise
    Pang, Hui
    Wang, Jiazhou
    Cao, Axiu
    Deng, Qiling
    OPTICS EXPRESS, 2016, 24 (20): : 22766 - 22776
  • [13] A TRANSFORMER-BASED CROSS-MODAL IMAGE-TEXT RETRIEVAL METHOD USING FEATURE DECOUPLING AND RECONSTRUCTION
    Zhang, Huan
    Sun, Yingzhi
    Liao, Yu
    Xu, SiYuan
    Yang, Rui
    Wang, Shuang
    Hou, Biao
    Jiao, Licheng
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1796 - 1799
  • [14] A High-Accuracy and Fast Retrieval Method of Atmospheric Parameters Based on Genetic-BP
    Tian, Jiasheng
    Shi, Jian
    IEEE ACCESS, 2022, 10 : 19458 - 19468
  • [15] DDR-Unet: A High-Accuracy and Efficient Ore Image Segmentation Method
    Li, Fei
    Liu, Xiaoyan
    Yin, Yufeng
    Li, Zongping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [16] Simulation of cross-modal image-text retrieval algorithm under convolutional neural network structure and hash method
    Yang, XianBen
    Zhang, Wei
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (05): : 7106 - 7132
  • [17] High-Accuracy, High-Efficiency Compensation Method in Two-Dimensional Digital Image Correlation
    Xu, X.
    Zhang, Q.
    Su, Y.
    Cai, Y.
    Xue, W.
    Gao, Z.
    Xue, Y.
    Lv, Z.
    Fu, S.
    EXPERIMENTAL MECHANICS, 2017, 57 (06) : 831 - 846
  • [18] High-Accuracy and High-Efficiency Compensation Method in Two-Dimensional Digital Image Correlation
    Xu, Xiaohai
    Zhang, Qingchuan
    INTERNATIONAL DIGITAL IMAGING CORRELATION SOCIETY, 2017, : 63 - 65
  • [19] High-Accuracy, High-Efficiency Compensation Method in Two-Dimensional Digital Image Correlation
    X. Xu
    Q. Zhang
    Y. Su
    Y. Cai
    W. Xue
    Z. Gao
    Y. Xue
    Z. Lv
    S. Fu
    Experimental Mechanics, 2017, 57 : 831 - 846
  • [20] A high-accuracy method for simulating the XCO2 global distribution using GOSAT retrieval data
    MingWei Zhao
    XingYing Zhang
    TianXiang Yue
    Chun Wang
    Ling Jiang
    JingLu Sun
    Science China Earth Sciences, 2017, 60 : 143 - 155