Human-Guided Data Augmentation via Diffusion Model for Surface Defect Recognition Under Limited Data

被引:0
|
作者
Fang, Tiyu [1 ]
Zhang, Mingxin [1 ]
Song, Ran [1 ]
Li, Xiaolei [1 ]
Wei, Zhiyuan [1 ]
Zhang, Wei [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
基金
中国国家自然科学基金;
关键词
Diffusion models; Generative adversarial networks; Training; Data augmentation; Image segmentation; Training data; Reinforcement learning; Image synthesis; Data models; Tires; Diffusion model; reinforcement learning (RL) from human feedback; surface defect recognition (SDR); INSPECTION;
D O I
10.1109/TIM.2025.3541684
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Surface defect recognition (SDR) with limited data is a common challenge in industrial production. Recent methods generally utilize generative adversarial networks (GANs) to generate defect samples as training data for improving the performance of SDR. However, the instability of GAN training often results in uncontrollable and low-quality samples under severe data constraints, making it difficult for the existing methods to effectively handle SDR tasks with different granularities. To address the issue, this article proposes a human-guided data augmentation method under extremely limited data. Its core idea is to introduce human feedback into a diffusion model for synthesizing controllable and high-quality defect samples by reinforcement learning (RL), aiming to improve various-granularity SDR tasks such as defect classification and segmentation. First, a conditional diffusion model (CDM) is constructed to generate controllable defect samples using semantic labels, which learn defect distribution from a small number of annotated defect samples. Then, a reward model is designed to evaluate the outcome of the CDM by human feedback. Next, based on the trained reward model, the CDM is further optimized by proximal policy optimization (PPO). Finally, the refined CDM is used to generate high-quality defect samples as training data for enhancing defect classification and segmentation. Extensive experiments on NEU-Seg, magnetic-tile (MT), and the collected Tire datasets demonstrate that our method outperforms the state-of-the-art generative methods in terms of generated image quality. Furthermore, the performance of defect classification and segmentation has also shown significant enhancements based on the generated samples, with a maximum improvement of 16.90% in accuracy and 12.85% in mean intersection over union (mIoU) compared to results obtained without data augmentation.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Generative Data Augmentation Guided by Triplet Loss for Speech Emotion Recognition
    Wang, Shijun
    Hemati, Hamed
    Gudnason, Jon
    Borth, Damian
    INTERSPEECH 2022, 2022, : 391 - 395
  • [32] Label-Guided Data Augmentation for Chinese Named Entity Recognition
    Jiang, Miao
    Chen, Honghui
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [33] Data Augmentation in Earth Observation: A Diffusion Model Approach
    Sousa, Tiago
    Ries, Benoit
    Guelfi, Nicolas
    INFORMATION, 2025, 16 (02)
  • [34] Phased Data Augmentation for Training a Likelihood-Based Generative Model with Limited Data
    Mimura, Yuta
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2025, 13 (01): : 126 - 135
  • [35] Voice Conversion Based Data Augmentation to Improve Children's Speech Recognition in Limited Data Scenario
    Shahnawazuddin, S.
    Adiga, Nagaraj
    Kumar, Kunal
    Poddar, Aayushi
    Ahmad, Waquar
    INTERSPEECH 2020, 2020, : 4382 - 4386
  • [36] Synthetic data augmentation by diffusion probabilistic models to enhance weed recognition
    Chen, Dong
    Qi, Xinda
    Zheng, Yu
    Lu, Yuzhen
    Huang, Yanbo
    Li, Zhaojian
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 216
  • [37] An Interpretable Latent Denoising Diffusion Probabilistic Model for Fault Diagnosis Under Limited Data
    Zhang, Tian
    Lin, Jing
    Jiao, Jinyang
    Zhang, Han
    Li, Hao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 10354 - 10365
  • [38] A GAN-based data augmentation method for human activity recognition via the caching ability
    Shi, Junhao
    Zuo, Decheng
    Zhang, Zhan
    INTERNET TECHNOLOGY LETTERS, 2021, 4 (05)
  • [39] Cross-domain attention-guided generative data augmentation for medical image analysis with limited data
    Xu, Zhenghua
    Tang, Jiaqi
    Qi, Chang
    Yao, Dan
    Liu, Caihua
    Zhan, Yuefu
    Lukasiewicz, Thomas
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [40] Unsupervised Surface Defect Detection Using Deep Autoencoders and Data Augmentation
    Mujeeb, Abdul
    Dai, Wenting
    Erdt, Marius
    Sourin, Alexei
    2018 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2018, : 391 - 398