Contrastive Transformer-Based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection

被引:10
|
作者
Tian, Yu [1 ,2 ,4 ]
Pang, Guansong [3 ]
Liu, Fengbei [1 ]
Liu, Yuyuan [1 ]
Wang, Chong [1 ]
Chen, Yuanhong [1 ]
Verjans, Johan [2 ]
Carneiro, Gustavo [1 ]
机构
[1] Univ Adelaide, Australian Inst Machine Learning, Adelaide, Australia
[2] South Australian Hlth & Med Res Inst, Adelaide, Australia
[3] Singapore Management Univ, Singapore, Singapore
[4] Harvard Med Sch, Boston, MA USA
基金
澳大利亚研究理事会;
关键词
Polyp detection; Colonoscopy; Weakly-supervised learning; Video anomaly detection; Vision transformer;
D O I
10.1007/978-3-031-16437-8_9
中图分类号
R445 [影像诊断学];
学科分类号
100207 ;
摘要
Current polyp detection methods from colonoscopy videos use exclusively normal (i.e., healthy) training images, which i) ignore the importance of temporal information in consecutive video frames, and ii) lack knowledge about the polyps. Consequently, they often have high detection errors, especially on challenging polyp cases (e.g., small, flat, or partially visible polyps). In this work, we formulate polyp detection as a weakly-supervised anomaly detection task that uses video-level labelled training data to detect frame-level polyps. In particular, we propose a novel convolutional transformer-based multiple instance learning method designed to identify abnormal frames (i.e., frames with polyps) from anomalous videos (i.e., videos containing at least one frame with polyp). In our method, local and global temporal dependencies are seamlessly captured while we simultaneously optimise video and snippet-level anomaly scores. A contrastive snippet mining method is also proposed to enable an effective modelling of the challenging polyp cases. The resulting method achieves a detection accuracy that is substantially better than current state-of-the-art approaches on a new large-scale colonoscopy video dataset introduced in this work.
引用
收藏
页码:88 / 98
页数:11
相关论文
共 50 条
  • [1] A FRAME LOSS OF MULTIPLE INSTANCE LEARNING FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
    Wang, Xu
    Zhang, Xiangjinzi
    Zi, Yunfei
    Xiong, Shengwu
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 331 - 335
  • [2] Transformer Based Multiple Instance Learning for Weakly Supervised Histopathology Image Segmentation
    Qian, Ziniu
    Li, Kailu
    Lai, Maode
    Chang, Eric I-Chao
    Wei, Bingzheng
    Fan, Yubo
    Xu, Yan
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 160 - 170
  • [3] Instance-Level Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Ming
    Zeng, Bing
    [J]. SENSORS, 2022, 22 (19)
  • [4] Discrepant multiple instance learning for weakly supervised object detection
    Gao, Wei
    Wan, Fang
    Yue, Jun
    Xu, Songcen
    Ye, Qixiang
    [J]. PATTERN RECOGNITION, 2022, 122
  • [5] Colorectal cancer lymph node metastasis prediction with weakly supervised transformer-based multi-instance learning
    Tan, Luxin
    Li, Huan
    Yu, Jinze
    Zhou, Haoyi
    Wang, Zhi
    Niu, Zhiyong
    Li, Jianxin
    Li, Zhongwu
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (06) : 1565 - 1580
  • [6] Colorectal cancer lymph node metastasis prediction with weakly supervised transformer-based multi-instance learning
    Luxin Tan
    Huan Li
    Jinze Yu
    Haoyi Zhou
    Zhi Wang
    Zhiyong Niu
    Jianxin Li
    Zhongwu Li
    [J]. Medical & Biological Engineering & Computing, 2023, 61 : 1565 - 1580
  • [7] Towards Hierarchical Regional Transformer-based Multiple Instance Learning
    Cersovsky, Josef
    Mohammadi, Sadegh
    Kainmueller, Dagmar
    Hoehne, Johannes
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3954 - 3962
  • [8] Transformer-based contrastive learning framework for image anomaly detection
    Fan, Wentao
    Shangguan, Weimin
    Chen, Yewang
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3413 - 3426
  • [9] Transformer-based contrastive learning framework for image anomaly detection
    Wentao Fan
    Weimin Shangguan
    Yewang Chen
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 3413 - 3426
  • [10] Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection
    Ye, Qixiang
    Wan, Fang
    Liu, Chang
    Huang, Qingming
    Ji, Xiangyang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5452 - 5466