Vision transformer distillation for enhanced gastrointestinal abnormality recognition in wireless capsule endoscopy images

被引:0
|
作者
Oukdach, Yassine [1 ]
Garbaz, Anass [1 ]
Kerkaou, Zakaria [1 ]
El Ansari, Mohamed [2 ]
Koutti, Lahcen [1 ]
Papachrysos, Nikolaos [3 ,4 ]
El Ouafdi, Ahmed Fouad [1 ]
de Lange, Thomas [3 ,4 ]
Distante, Cosimo [5 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Dept Comp Sci, LabSIV, Agadir, Morocco
[2] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, Informat & Applicat Lab, Meknes, Morocco
[3] Univ Gothenburg, Sahlgrenska Acad, Dept Mol & Clin Med, Gothenburg, Sweden
[4] Sahlgrens Univ Hosp, Med Dept, Molndal, Sweden
[5] CNR, Inst Appl Sci & Intelligent Syst Eduardo Caianiell, Lecce, Italy
关键词
wireless capsule endoscopy; vision transformer; convolutional neural network; attention mechanism; knowledge distillation; gastrointestinal abnormality detection; CANCER STATISTICS; SYSTEM; COLON;
D O I
10.1117/1.JMI.12.1.014505
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Wireless capsule endoscopy (WCE) is a non-invasive technology used for diagnosing gastrointestinal abnormalities. A single examination generates similar to 55,000 images, making manual review both time-consuming and costly for doctors. Therefore, the development of computer vision-assisted systems is highly desirable to aid in the diagnostic process. Approach: We presents a deep learning approach leveraging knowledge distillation (KD) from a convolutional neural network (CNN) teacher model to a vision transformer (ViT) student model for gastrointestinal abnormality recognition. The CNN teacher model utilizes attention mechanisms and depth-wise separable convolutions to extract features from WCE images, supervising the ViT in learning these representations. Results: The proposed method achieves accuracy of 97% and 96% on the Kvasir and KID datasets, respectively, demonstrating its effectiveness in distinguishing normal from abnormal regions and bleeding from non-bleeding cases. The proposed approach offers computational efficiency and generalization to unseen datasets, outperforming several state-of-the-art methods. Conclusions: We proposed a deep learning approach utilizing CNNs and a ViT with KD to effectively classify gastrointestinal diseases in WCE images. It demonstrates promising performance on public datasets, distinguishing normal from abnormal regions and bleeding from non-bleeding cases while offering optimal computational efficiency compared with existing methods, making it suitable for GI disease applications.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] An intelligent system for polyp detection in wireless capsule endoscopy images
    Figueiredo, Isabel N.
    Kumar, Sunil
    Figueiredo, Pedro N.
    COMPUTATIONAL VISION AND MEDICAL IMAGE PROCESSING IV, 2014, : 229 - 235
  • [42] Analysis of wireless capsule endoscopy images using chromaticity moments
    Li, Baopu
    Meng, Max Q. -H.
    2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 87 - 92
  • [43] Wireless capsule endoscopy images enhancement by tensor based diffusion
    Li, Baopu
    Meng, Max Q-H.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 136 - 139
  • [44] A Novel Feature for Polyp Detection in Wireless Capsule Endoscopy images
    Yuan, Yixuan
    Meng, Max Q. -H.
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 5010 - 5015
  • [45] An automatic blood detection algorithm for wireless capsule endoscopy images
    Figueiredo, Isabel N.
    Kumar, Sunil
    Leal, Carlos
    Figueiredo, Pedro N.
    COMPUTATIONAL VISION AND MEDICAL IMAGE PROCESSING IV, 2014, : 237 - 241
  • [46] Detecting Mucosal Abnormalities from Wireless Capsule Endoscopy Images
    Abiko, Aschalew Tirulo
    Vala, Brijesh
    Patel, Satvik
    INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 872 - 878
  • [47] Automatic Bleeding Frame Detection in the Wireless Capsule Endoscopy Images
    Yuan, Yixuan
    Meng, Max Q-H
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1310 - 1315
  • [48] 3D reconstruction of wireless capsule endoscopy images
    Fan, Yichen
    Meng, Max Q. -H.
    Li, Baopu
    2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 5149 - 5152
  • [49] Hookworm Detection in Wireless Capsule Endoscopy Images With Deep Learning
    He, Jun-Yan
    Wu, Xiao
    Jiang, Yu-Gang
    Peng, Qiang
    Jain, Ramesh
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) : 2379 - 2392
  • [50] WCE-DCGAN: A data augmentation method based on wireless capsule endoscopy images for gastrointestinal disease detection
    Xiao, Zhiguo
    Lu, Jia
    Wang, Xiaokun
    Li, Nianfeng
    Wang, Yuying
    Zhao, Nan
    IET IMAGE PROCESSING, 2023, 17 (04) : 1170 - 1180