Vision transformer distillation for enhanced gastrointestinal abnormality recognition in wireless capsule endoscopy images

被引:0
|
作者
Oukdach, Yassine [1 ]
Garbaz, Anass [1 ]
Kerkaou, Zakaria [1 ]
El Ansari, Mohamed [2 ]
Koutti, Lahcen [1 ]
Papachrysos, Nikolaos [3 ,4 ]
El Ouafdi, Ahmed Fouad [1 ]
de Lange, Thomas [3 ,4 ]
Distante, Cosimo [5 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Dept Comp Sci, LabSIV, Agadir, Morocco
[2] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, Informat & Applicat Lab, Meknes, Morocco
[3] Univ Gothenburg, Sahlgrenska Acad, Dept Mol & Clin Med, Gothenburg, Sweden
[4] Sahlgrens Univ Hosp, Med Dept, Molndal, Sweden
[5] CNR, Inst Appl Sci & Intelligent Syst Eduardo Caianiell, Lecce, Italy
关键词
wireless capsule endoscopy; vision transformer; convolutional neural network; attention mechanism; knowledge distillation; gastrointestinal abnormality detection; CANCER STATISTICS; SYSTEM; COLON;
D O I
10.1117/1.JMI.12.1.014505
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Wireless capsule endoscopy (WCE) is a non-invasive technology used for diagnosing gastrointestinal abnormalities. A single examination generates similar to 55,000 images, making manual review both time-consuming and costly for doctors. Therefore, the development of computer vision-assisted systems is highly desirable to aid in the diagnostic process. Approach: We presents a deep learning approach leveraging knowledge distillation (KD) from a convolutional neural network (CNN) teacher model to a vision transformer (ViT) student model for gastrointestinal abnormality recognition. The CNN teacher model utilizes attention mechanisms and depth-wise separable convolutions to extract features from WCE images, supervising the ViT in learning these representations. Results: The proposed method achieves accuracy of 97% and 96% on the Kvasir and KID datasets, respectively, demonstrating its effectiveness in distinguishing normal from abnormal regions and bleeding from non-bleeding cases. The proposed approach offers computational efficiency and generalization to unseen datasets, outperforming several state-of-the-art methods. Conclusions: We proposed a deep learning approach utilizing CNNs and a ViT with KD to effectively classify gastrointestinal diseases in WCE images. It demonstrates promising performance on public datasets, distinguishing normal from abnormal regions and bleeding from non-bleeding cases while offering optimal computational efficiency compared with existing methods, making it suitable for GI disease applications.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Wireless capsule endoscopy: Perspectives beyond gastrointestinal bleeding
    Redondo-Cerezo, Eduardo
    Damian Sachez-Capilla, Antonio
    De La Torre-Rubio, Paloma
    De Teresa, Javier
    WORLD JOURNAL OF GASTROENTEROLOGY, 2014, 20 (42) : 15664 - 15673
  • [22] Analysis of the gastrointestinal status from wireless capsule endoscopy images using local color feature
    Li, Baopu
    Meng, Max Q. -H.
    2007 INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, 2007, : 554 - 558
  • [23] A CONVOLUTIONAL NEURAL NETWORK APPROACH FOR ABNORMALITY DETECTION IN WIRELESS CAPSULE ENDOSCOPY
    Sekuboyina, Anjany Kumar
    Devarakonda, Surya Teja
    Seelamantula, Chandra Sekhar
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 1057 - 1060
  • [24] Automatic polyp detection for wireless capsule endoscopy images
    Li, Baopu
    Meng, Max Q. -H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10952 - 10958
  • [25] Enhancing Wireless Capsule Endoscopy Images for Illumination and Noise
    Kadkhodaei, B.
    Hassanpour, H.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2025, 38 (05): : 964 - 975
  • [26] Detection of Hookworm Infection in the Wireless Capsule Endoscopy Images
    Sri, R. Sneha
    Oliver, A. Sheryl
    Shanthini, S.
    2017 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, INSTRUMENTATION AND COMMUNICATION ENGINEERING (ICEICE), 2017,
  • [27] Detection of Uninformative Regions in Wireless Capsule Endoscopy Images
    Alizadeh, Mandi
    Sharzehi, Kaveh
    Talebpour, Alireza
    Soltanian-Zadeh, Hamid
    Eskandari, Hoda
    Maghsoudi, Omid Haji
    2015 41ST ANNUAL NORTHEAST BIOMEDICAL ENGINEERING CONFERENCE (NEBEC), 2015,
  • [28] Automatic Hookworm Detection in Wireless Capsule Endoscopy Images
    Wu, Xiao
    Chen, Honghan
    Gan, Tao
    Chen, Junzhou
    Ngo, Chong-Wah
    Peng, Qiang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (07) : 1741 - 1752
  • [29] An intelligent compression system for wireless capsule endoscopy images
    Bouyaya, Dallel
    Benierbah, Said
    Khamadja, Mohammed
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 70
  • [30] Robust Prototypical Networks for Small-Intestine Polyp Recognition in Wireless Capsule Endoscopy Images
    Liao, Chao
    Wang, Chengliang
    Bai, Jianying
    THIRD INTERNATIONAL SYMPOSIUM ON IMAGE COMPUTING AND DIGITAL MEDICINE (ISICDM 2019), 2019, : 319 - 323