Classification of endoscopic image and video frames using distance metric-based learning with interpolated latent features

被引:0
|
作者
Chafjiri, Fatemeh Sedighipour [1 ]
Mohebbian, Mohammad Reza [1 ]
Wahid, Khan A. A. [1 ]
Babyn, Paul [2 ,3 ]
机构
[1] Univ Saskatchewan, Dept Elect & Comp Engn, Saskatoon, SK S7N 5A9, Canada
[2] Univ Saskatchewan, Dept Med Imaging, Saskatoon, SK S7K 0M7, Canada
[3] Saskatchewan Hlth Author, Saskatoon, SK S7K 0M7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Endoscopy; Few shot learning; Manifold mix-up; Siamese neural network; Classification; GI track anatomic locations; CAPSULE ENDOSCOPY; SYSTEMS;
D O I
10.1007/s11042-023-14982-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional Endoscopy (CE) and Wireless Capsule Endoscopy (WCE) are well known tools for diagnosing gastrointestinal (GI) tract related disorders. Defining the anatomical location within the GI tract helps clinicians determine appropriate treatment options, which can reduce the need for repetitive endoscopy. Limited research addresses the localization of the anatomical location of WCE and CE images using classification, mainly due to the difficulty in collecting annotated data. In this study, we present a few-shot learning method based on distance metric learning which combines transfer-learning and manifold mixup schemes to localize and classify endoscopic images and video frames. The proposed method allows us to develop a pipeline for endoscopy video sequence localization that can be trained with only a few samples. The use of manifold mixup improves learning by increasing the number of training epochs while reducing overfitting and providing more accurate decision boundaries. A dataset is collected from 10 different anatomical positions of the human GI tract. Two models were trained using only 78 CE and 27 WCE annotated frames to predict the location of 25,700 and 1825 video frames from CE and WCE respectively. We performed subjective evaluation using nine gastroenterologists to validate the need of having such an automated system to localize endoscopic images and video frames. Our method achieved higher accuracy and a higher F1-score when compared with the scores from subjective evaluation. In addition, the results show improved performance with less cross-entropy loss when compared with several existing methods trained on the same datasets. This indicates that the proposed method has the potential to be used in endoscopy image classification.
引用
收藏
页码:36577 / 36598
页数:22
相关论文
共 50 条
  • [1] Classification of endoscopic image and video frames using distance metric-based learning with interpolated latent features
    Fatemeh Sedighipour Chafjiri
    Mohammad Reza Mohebbian
    Khan A. Wahid
    Paul Babyn
    [J]. Multimedia Tools and Applications, 2023, 82 : 36577 - 36598
  • [2] Metric-Based Learning for Nearest-Neighbor Few-Shot Image Classification
    Lee, Min Jun
    So, Jungmin
    [J]. 35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 460 - 464
  • [3] Metric-Based Attention Feature Learning for Video Action Recognition
    Kim, Dae Ha
    Anvarov, Fazliddin
    Lee, Jun Min
    Song, Byung Cheol
    [J]. IEEE ACCESS, 2021, 9 : 39218 - 39228
  • [4] Convex hull-based distance metric learning for image classification
    Zhang, Xue
    Wang, Changzhong
    Fan, Xiaodong
    [J]. COMPUTATIONAL & APPLIED MATHEMATICS, 2021, 40 (04):
  • [5] REGRESSION AND CLASSIFICATION BASED DISTANCE METRIC LEARNING FOR MEDICAL IMAGE RETRIEVAL
    Cai, Weidong
    Song, Yang
    Feng, David Dagan
    [J]. 2012 9TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2012, : 1775 - 1778
  • [6] Convex hull-based distance metric learning for image classification
    Xue Zhang
    Changzhong Wang
    Xiaodong Fan
    [J]. Computational and Applied Mathematics, 2021, 40
  • [7] Eigenvector-Based Distance Metric Learning for Image Classification and Retrieval
    Wang, Zhangcheng
    Li, Ya
    Hong, Richang
    Tian, Xinmei
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (03)
  • [8] Decomposition-Based Transfer Distance Metric Learning for Image Classification
    Luo, Yong
    Liu, Tongliang
    Tao, Dacheng
    Xu, Chao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3789 - 3801
  • [9] Image-to-Class Distance Metric Learning for Image Classification
    Wang, Zhengxiang
    Hu, Yiqun
    Chia, Liang-Tien
    [J]. COMPUTER VISION-ECCV 2010, PT I, 2010, 6311 : 706 - 719
  • [10] Learning Image-to-Class Distance Metric for Image Classification
    Wang, Zhengxiang
    Hu, Yiqun
    Chia, Liang-Tien
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (02)