A multimodal deep learning model for detecting endoscopic images of near-infrared fluorescence capsules

被引:0
|
作者
Wang, Junhao [1 ,2 ]
Zhou, Cheng [1 ]
Wang, Wei [1 ]
Zhang, Hanxiao [2 ]
Zhang, Amin [4 ]
Cui, Daxiang [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst & Med Robot, Shanghai 200240, Peoples R China
[3] Henan Univ, Med & Engn Cross Res Inst, Sch Med, Kaifeng 475004, Peoples R China
[4] Shanghai Jiao Tong Univ, Sch Agr & Biol, Dept Food Sci & Technol, Shanghai 200240, Peoples R China
来源
基金
上海市自然科学基金; 中国国家自然科学基金; 国家自然科学基金国际合作与交流项目; 中国博士后科学基金;
关键词
Fluorescence endoscopy; Multimodal deep learning; Disease detection; CANCER;
D O I
10.1016/j.bios.2025.117251
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Early screening for gastrointestinal (GI) diseases is critical for preventing cancer development. With the rapid advancement of deep learning technology, artificial intelligence (AI) has become increasingly prominent in the early detection of GI diseases. Capsule endoscopy is a non-invasive medical imaging technique used to examine the gastrointestinal tract. In our previous work, we developed a near-infrared fluorescence capsule endoscope (NIRF-CE) capable of exciting and capturing near-infrared (NIR) fluorescence images to specifically identify subtle mucosal microlesions and submucosal abnormalities while simultaneously capturing conventional white- light images to detect lesions with significant morphological changes. However, limitations such as low camera resolution and poor lighting within the gastrointestinal tract may lead to misdiagnosis and other medical errors. Manually reviewing and interpreting large volumes of capsule endoscopy images is time-consuming and prone to errors. Deep learning models have shown potential in automatically detecting abnormalities in NIRF-CE images. This study focuses on an improved deep learning model called Retinex-Attention-YOLO (RAY), which is based on single-modality image data and built on the YOLO series of object detection models. RAY enhances the accuracy and efficiency of anomaly detection, especially under low-light conditions. To further improve detection performance, we also propose a multimodal deep learning model, Multimodal-Retinex-Attention-YOLO (MRAY), which combines both white-light and fluorescence image data. The dataset used in this study consists of images of pig stomachs captured by our NIRF-CE system, simulating the human GI tract. In conjunction with a targeted fluorescent probe, which accumulates at lesion sites and releases fluorescent signals for imaging when abnormalities are present, a bright spot indicates a lesion. The MRAY model achieved an impressive precision of 96.3%, outperforming similar object detection models. To further validate the model's performance, ablation experiments were conducted, and comparisons were made with publicly available datasets. MRAY shows great promise for the automated detection of GI cancers, ulcers, inflammations, and other medical conditions in clinical practice.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Near-infrared fluorescence system images the heart
    Robinson, Kevin
    Biophotonics International, 2002, 9 (10): : 55 - 56
  • [2] Near-Infrared Fluorescence from Nanodiamond for Multimodal Bioimaging
    Lin, Y. -C.
    Tsai, L. -W.
    Perevedentseva, E.
    Karmenyan, A.
    Cheng, C. -L.
    SOVREMENNYE TEHNOLOGII V MEDICINE, 2018, 10 (01) : 49 - 55
  • [3] MEIBOMIAN GLANDS SEGMENTATION IN NEAR-INFRARED IMAGES WITH WEAKLY SUPERVISED DEEP LEARNING
    Liu, Xiaoming
    Wang, Shuo
    Zhang, Ying
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 16 - 20
  • [4] Multimodal Chelation Platform for Near-Infrared Fluorescence/Nuclear Imaging
    Ghosh, Sukhen C.
    Ghosh, Pradip
    Wilganowski, Nathaniel
    Robinson, Holly
    Hall, Mary A.
    Dickinson, Gabriel
    Pinkston, Ken L.
    Harvey, Barrett R.
    Sevick-Muraca, Eva M.
    Azhdarinia, Ali
    JOURNAL OF MEDICINAL CHEMISTRY, 2013, 56 (02) : 406 - 416
  • [5] Deep learning for in vivo near-infrared imaging
    Ma, Zhuoran
    Wang, Feifei
    Wang, Weizhi
    Zhong, Yeteng
    Dai, Hongjie
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (01)
  • [6] Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
    Holtkamp, Agnes
    Elhennawy, Karim
    Cejudo Grano de Oro, Jose E.
    Krois, Joachim
    Paris, Sebastian
    Schwendicke, Falk
    JOURNAL OF CLINICAL MEDICINE, 2021, 10 (05) : 1 - 8
  • [7] Learning to colorize near-infrared images with limited data
    Liu, Yu
    Guo, Zhe
    Guo, Haojie
    Xiao, Huaxin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19865 - 19884
  • [8] Near-infrared Fluorescence Imaging for Detecting Pancreatic Liver Metastasis in an Orthotopic Athymic Mouse Model
    Lee, Lucas D.
    Hering, Nina A.
    Zibell, Miriam
    Lobbes, Leonard A.
    Kamphues, Carsten
    Lauscher, Johannes C.
    Margonis, Georgios A.
    Seeliger, Hendrik
    Beyer, Katharina
    Weixler, Benjamin
    Pozios, Ioannis
    IN VIVO, 2023, 37 (02): : 519 - 523
  • [9] Learning to colorize near-infrared images with limited data
    Yu Liu
    Zhe Guo
    Haojie Guo
    Huaxin Xiao
    Neural Computing and Applications, 2023, 35 : 19865 - 19884
  • [10] Detecting the content of the bright blue pigment in cream based on deep learning and near-infrared spectroscopy
    Liu, Jun
    Zhang, Jianxing
    Tan, Zhenglin
    Hou, Qin
    Liu, Ruirui
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2022, 270