Improving machine learning-based bitewing segmentation with synthetic data

被引:0
|
作者
Tolstaya, Ekaterina [1 ]
Tichy, Antonin [1 ,2 ,3 ]
Paris, Sebastian [4 ]
Schwendicke, Falk [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, LMU Univ Hosp, Dept Conservat Dent & Periodontol, Goethestr 70, D-80336 Munich, Germany
[2] Charles Univ Prague, Inst Dent Med, Fac Med 1, Karlovo Namesti 32, Prague 12111, Czech Republic
[3] Gen Univ Hosp Prague, Karlovo Namesti 32, Prague 12111, Czech Republic
[4] Charite Univ Med Berlin, Operat & Prevent Dent, Assmannshauser Str 4-6, D-14197 Berlin, Germany
关键词
Artificial intelligence; Dentistry; Generative adversarial network; Diffusion model; Dataset imbalance; Synthetic medical data; NETWORKS;
D O I
10.1016/j.jdent.2025.105679
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Objectives: Class imbalance in datasets is one of the challenges of machine learning (ML) in medical image analysis. We employed synthetic data to overcome class imbalance when segmenting bitewing radiographs as an exemplary task for using ML. Methods: After segmenting bitewings into classes, i.e. dental structures, restorations, and background, the pixellevel representation of implants in the training set (1543 bitewings) and testing set (177 bitewings) was 0.03 % and 0.07 %, respectively. A diffusion model and a generative adversarial network (pix2pix) were used to generate a dataset synthetically enriched in implants. A U-Net segmentation model was trained on (1) the original dataset, (2) the synthetic dataset, (3) on the synthetic dataset and fine-tuned on the original dataset, or (4) on a dataset which was na & iuml;vely oversampled with images containing implants. Results: U-Net trained on the original dataset was unable to segment implants in the testing set. Model performance was significantly improved by na & iuml;ve over-sampling, achieving the highest precision. The model trained only on synthetic data performed worse than na & iuml;ve over-sampling in all metrics, but with fine-tuning on original data, it resulted in the highest Dice score, recall, F1 score and ROC AUC, respectively. The performance on other classes than implants was similar for all strategies except training only on synthetic data, which tended to perform worse. Conclusions: The use of synthetic data alone may deteriorate the performance of segmentation models. However, fine-tuning on original data could significantly enhance model performance, especially for heavily underrepresented classes. Clinical significance: This study explored the use of synthetic data to enhance segmentation of bitewing radiographs, focusing on underrepresented classes like implants. Pre-training on synthetic data followed by finetuning on original data yielded the best results, highlighting the potential of synthetic data to advance AIdriven dental imaging and ultimately support clinical decision-making.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] On the usability of synthetic data for improving the robustness of deep learning-based segmentation of cardiac magnetic resonance images
    Al Khalil, Yasmina
    Amirrajab, Sina
    Lorenz, Cristian
    Weese, Juergen
    Pluim, Josien
    Breeuwer, Marcel
    MEDICAL IMAGE ANALYSIS, 2023, 84
  • [2] Using Synthetic Training Data for Deep Learning-Based GBM Segmentation
    Lindner, Lydia
    Narnhofer, Dominik
    Weber, Maximilian
    Gsaxner, Christina
    Kolodziej, Malgorzata
    Egger, Jan
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 6724 - 6729
  • [3] Dental bitewing radiographs segmentation using deep learning-based convolutional neural network algorithms
    Bonny, Talal
    Al-Ali, Abdelaziz
    Al-Ali, Mohammed
    Alsaadi, Rashid
    Al Nassan, Wafaa
    Obaideen, Khaled
    AlMallahi, Maryam
    ORAL RADIOLOGY, 2023, 40 (2) : 165 - 177
  • [4] Big Data Landscapes: Improving the Visualization of Machine Learning-based Clustering Algorithms
    Kammer, Dietrich
    Keck, Mandy
    Gruender, Thomas
    Groh, Rainer
    AVI'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED VISUAL INTERFACES, 2018,
  • [5] Dental bitewing radiographs segmentation using deep learning-based convolutional neural network algorithms
    Talal Bonny
    Abdelaziz Al-Ali
    Mohammed Al-Ali
    Rashid Alsaadi
    Wafaa Al Nassan
    Khaled Obaideen
    Maryam AlMallahi
    Oral Radiology, 2024, 40 : 165 - 177
  • [6] Machine learning-based segmentation of aerial LiDAR point cloud data on building roof
    Dey, Emon Kumar
    Awrangjeb, Mohammad
    Kurdi, Fayez Tarsha
    Stantic, Bela
    EUROPEAN JOURNAL OF REMOTE SENSING, 2023, 56 (01)
  • [7] Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach
    Goh, Chul Jun
    Kwon, Hyuk-Jung
    Kim, Yoonhee
    Jung, Seunghee
    Park, Jiwoo
    Lee, Isaac Kise
    Park, Bo-Ram
    Kim, Myeong-Ji
    Kim, Min-Jeong
    Lee, Min-Seob
    DIAGNOSTICS, 2024, 14 (01)
  • [8] Machine learning-based inversion for acoustic impedance with large synthetic training data: Workflow and data characterization
    Zeng, Hongliu
    He, Yawen
    Olariu, Mariana
    Trevino, Ramon
    GEOPHYSICS, 2023, 88 (02) : R193 - R207
  • [9] Image data augmentation for improving performance of deep learning-based model in pathological lung segmentation
    Alam, Md Shariful
    Wang, Dadong
    Sowmya, Arcot
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 383 - 387
  • [10] A Machine Learning-Based Recommender System for Improving Students Learning Experiences
    Yanes, Nacim
    Mostafa, Ayman Mohamed
    Ezz, Mohamed
    Almuayqil, Saleh Naif
    IEEE ACCESS, 2020, 8 (08): : 201218 - 201235