Vision transformer distillation for enhanced gastrointestinal abnormality recognition in wireless capsule endoscopy images

被引:0
|
作者
Oukdach, Yassine [1 ]
Garbaz, Anass [1 ]
Kerkaou, Zakaria [1 ]
El Ansari, Mohamed [2 ]
Koutti, Lahcen [1 ]
Papachrysos, Nikolaos [3 ,4 ]
El Ouafdi, Ahmed Fouad [1 ]
de Lange, Thomas [3 ,4 ]
Distante, Cosimo [5 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Dept Comp Sci, LabSIV, Agadir, Morocco
[2] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, Informat & Applicat Lab, Meknes, Morocco
[3] Univ Gothenburg, Sahlgrenska Acad, Dept Mol & Clin Med, Gothenburg, Sweden
[4] Sahlgrens Univ Hosp, Med Dept, Molndal, Sweden
[5] CNR, Inst Appl Sci & Intelligent Syst Eduardo Caianiell, Lecce, Italy
关键词
wireless capsule endoscopy; vision transformer; convolutional neural network; attention mechanism; knowledge distillation; gastrointestinal abnormality detection; CANCER STATISTICS; SYSTEM; COLON;
D O I
10.1117/1.JMI.12.1.014505
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Wireless capsule endoscopy (WCE) is a non-invasive technology used for diagnosing gastrointestinal abnormalities. A single examination generates similar to 55,000 images, making manual review both time-consuming and costly for doctors. Therefore, the development of computer vision-assisted systems is highly desirable to aid in the diagnostic process. Approach: We presents a deep learning approach leveraging knowledge distillation (KD) from a convolutional neural network (CNN) teacher model to a vision transformer (ViT) student model for gastrointestinal abnormality recognition. The CNN teacher model utilizes attention mechanisms and depth-wise separable convolutions to extract features from WCE images, supervising the ViT in learning these representations. Results: The proposed method achieves accuracy of 97% and 96% on the Kvasir and KID datasets, respectively, demonstrating its effectiveness in distinguishing normal from abnormal regions and bleeding from non-bleeding cases. The proposed approach offers computational efficiency and generalization to unseen datasets, outperforming several state-of-the-art methods. Conclusions: We proposed a deep learning approach utilizing CNNs and a ViT with KD to effectively classify gastrointestinal diseases in WCE images. It demonstrates promising performance on public datasets, distinguishing normal from abnormal regions and bleeding from non-bleeding cases while offering optimal computational efficiency compared with existing methods, making it suitable for GI disease applications.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Tumor Recognition in Wireless Capsule Endoscopy Images
    Sawant, Suvidha
    Deshpande
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (05): : 71 - 75
  • [2] Tumor Recognition in Wireless Capsule Endoscopy Images
    Sawant, Suvidha
    Deshpande, M. S.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (04): : 85 - 89
  • [3] RECOGNITION OF HEMORRHAGE IN THE IMAGES OF WIRELESS CAPSULE ENDOSCOPY
    Kukushkin, Alexander
    Dmitry, Mikhaylov
    Ivanova, Ekaterina
    2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 899 - 902
  • [4] Dilated CNN for abnormality detection in wireless capsule endoscopy images
    Nidhi Goel
    Samarjeet Kaur
    Deepak Gunjan
    S. J. Mahapatra
    Soft Computing, 2022, 26 : 1231 - 1247
  • [5] Dilated CNN for abnormality detection in wireless capsule endoscopy images
    Goel, Nidhi
    Kaur, Samarjeet
    Gunjan, Deepak
    Mahapatra, S. J.
    SOFT COMPUTING, 2022, 26 (03) : 1231 - 1247
  • [6] Detection of abnormality in wireless capsule endoscopy images using fractal features
    Jain, Samir
    Seal, Ayan
    Ojha, Aparajita
    Krejcar, Ondrej
    Bures, Jan
    Tacheci, Ilja
    Yazidi, Anis
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 127
  • [7] Investigating the significance of color space for abnormality detection in wireless capsule endoscopy images
    Goel, Nidhi
    Kaur, Samarjeet
    Gunjan, Deepak
    Mahapatra, S. J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 75
  • [8] Deep learning for polyp recognition in wireless capsule endoscopy images
    Yuan, Yixuan
    Meng, Max Q. -H.
    MEDICAL PHYSICS, 2017, 44 (04) : 1379 - 1389
  • [9] Gastrointestinal Tract Disease Classification Using Residual-Inception Transformer With Wireless Capsule Endoscopy Images Segmentation
    Ozbay, Erdal
    IEEE Access, 2024, 12 : 197988 - 197998
  • [10] Automatic Polyp recognition of small bowel in Wireless Capsule Endoscopy images
    Xu, Lanmeng
    Fan, Shanhui
    Fan, Yihong
    Li, Lihua
    MEDICAL IMAGING 2018: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2018, 10579