Synthetic data augmentation by diffusion probabilistic models to enhance weed recognition

被引:15
|
作者
Chen, Dong [1 ]
Qi, Xinda [1 ]
Zheng, Yu [1 ]
Lu, Yuzhen [2 ]
Huang, Yanbo [3 ]
Li, Zhaojian [4 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Biosyst & Agr Engn, E Lansing, MI 48824 USA
[3] USDA ARS, Genet & Sustainable Agr Res Unit, Starkville, MS 39762 USA
[4] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
关键词
Computer vision; Data augmentation; Deep learning; Generative modeling; Precision weed management; Site-specific weed control; GENERATIVE ADVERSARIAL NETWORKS;
D O I
10.1016/j.compag.2023.108517
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Weed management plays an important role in crop yield and quality protection. Conventional weed control methods largely rely on intensive, blanket herbicide application, which incurs significant management costs and poses hazards to the environment and human health. Machine vision-based automated weeding has gained increasing attention for sustainable weed management through weed recognition and site-specific treatments. However, it remains a challenging task to reliably recognize weeds in variable field conditions, in part due to the difficulty curating large-scale, expert-labeled weed image datasets for supervised training of weed recognition algorithms. Data augmentation methods, including traditional geometric/color transformations and more advanced generative adversarial networks (GANs) can supplement data collection and labeling efforts by algorithmically expanding the scale of datasets. Recently, diffusion models have emerged in the field of image synthesis, providing a new means for augmenting image datasets to power machine vision systems. This study presents a novel investigation of the efficacy of diffusion models for generating weed images to enhance weed identification. Experiments on two public multi-class large weed datasets showed that diffusion models yielded the best trade-off between sample fidelity and diversity and obtained the highest Fre ' chet Inception Distance, compared to GANs (BigGAN, StyleGAN2, StyleGAN3). For instance, on a ten-class weed dataset (CottonWeedID10), the inclusion of synthetic weed images led to improvements by 1.17% (97.30% to 98.47), 1.21% (97.92% to 99.13%), and 2.30% (96.06% to 98.27%) in accuracy, precision, and recall, respectively, in weed classification by four deep learning models (i.e., VGG16, Inception-v3, Inception-v3, and ResNet50). Models trained using only 10% of real images with the remainder being synthetic data resulted in testing accuracy exceeding 94%.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Data Augmentation for Training Dialog Models Robust to Speech Recognition Errors
    Wang, Longshaokan
    Fazel-zarandi, Maryam
    Tiwari, Aditya
    Matsoukas, Spyros
    Polymenakos, Lazaros
    NLP FOR CONVERSATIONAL AI, 2020, : 63 - 70
  • [22] Erasing-inpainting-based data augmentation using denoising diffusion probabilistic models with limited samples for generalized surface defect inspection
    Tao, Huanjie
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 208
  • [23] On Calibrating Diffusion Probabilistic Models
    Pang, Tianyu
    Lu, Cheng
    Du, Chao
    Lin, Min
    Yan, Shuicheng
    Deng, Zhijie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Weed Image Augmentation by ControlNet-Added Stable Diffusion
    Deng, Boyang
    Lu, Yuzhen
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [25] Hedging as Reward Augmentation in Probabilistic Graphical Models
    Bhattacharjya, Debarun
    Marinescu, Radu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [26] Upsampling Aggregated Network Traffic Data with Denoising Diffusion Probabilistic Models
    Dupuis, Nicolas
    Van Damme, Axel
    Dierickx, Philippe
    Delaby, Olivier
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [27] Data augmentation for face recognition
    Lv, Jiang-Jing
    Shao, Xiao-Hu
    Huang, Jia-Shui
    Zhou, Xiang-Dong
    Zhou, Xi
    NEUROCOMPUTING, 2017, 230 : 184 - 196
  • [28] A Novel Data Augmentation Method Based on Denoising Diffusion Probabilistic Model for Fault Diagnosis Under Imbalanced Data
    Yang, Xiongyan
    Ye, Tianyi
    Yuan, Xianfeng
    Zhu, Weijie
    Mei, Xiaoxue
    Zhou, Fengyu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (05) : 7820 - 7831
  • [29] Using denoising diffusion probabilistic models to enhance quality of limited-view photoacoustic tomography
    De Santi, Bruno
    Awasthi, Navchetan
    Manohar, Srirang
    PHOTONS PLUS ULTRASOUND: IMAGING AND SENSING 2024, 2024, 12842
  • [30] Improving Art Style Classification Through Data Augmentation Using Diffusion Models
    Moyano, Miguel angel Martin
    Garcia-Aguilar, Ivan
    Lopez-Rubio, Ezequiel
    Luque-Baena, Rafael M.
    ELECTRONICS, 2024, 13 (24):