No More Training: SAM's Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation

被引:2
|
作者
Gutierrez, Juan D. [1 ]
Rodriguez-Echeverria, Roberto [2 ]
Delgado, Emilio [2 ]
Rodrigo, Miguel Angel Suero [3 ]
Sanchez-Figueroa, Fernando [2 ]
机构
[1] Univ Santiago De Compostela, Dept Elect & Comp Sci, Lugo 27002, Spain
[2] Univ Extremadura, Dept Comp Syst Engn & Telemat, i3 Lab Quercus Res Grp, Caceres 10003, Spain
[3] Hosp Univ Caceres, Serv Extremeno Salud, Caceres 10004, Spain
关键词
Image segmentation; deep learning; zero-shot learning; medical imaging; semantic segmentation;
D O I
10.1109/ACCESS.2024.3353142
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of medical images presents an enormous potential for diagnosis and surgery. However, achieving precise results involves designing and training complex Deep Learning (DL) models specifically for this task, which is only available to some. SAM is a model developed by Meta capable of segmenting objects present in virtually any type of image. This paper showcases SAM's robustness and exceptional performance in medical image segmentation, even in the absence of direct training on these image types (lung ComputedTomographyComputedTomographies(CTs) and chest X-rays, in particular). Additionally, it achieves this impressive outcome while requiring minimal user intervention. Although the dataset used to train SAM does not contain a single sample of both medical image types, processing a popular dataset comprised of 20 volumes with a total of 3520 slices using the ViT-L version of the model yields an average Jaccard index of 91.45 % and an average Dice score of 94.95 % . The same version of the model achieves a 93.19 % Dice score and a 87.45 % Jaccard index when segmenting a frequently-used chest X-ray dataset. The values obtained are above the 70 % mark recommended in the literature, and close to state-of-the art models developed specifically for medical segmentation. These results are achieved without user interaction by providing the model with positive prompts based on the masks of the dataset used and a negative prompt located in the center of bounding box that contains the masks.
引用
收藏
页码:24205 / 24216
页数:12
相关论文
共 10 条
  • [1] Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero-shot Medical Image Segmentation
    Aleem, Sidra
    Wang, Fangyijie
    Maniparambil, Mayug
    Arazo, Eric
    Dietlmeier, Julia
    Curran, Kathleen
    O'Connor, Noel E.
    Little, Suzanne
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 5184 - 5193
  • [2] SIMSAM: ZERO-SHOT MEDICAL IMAGE SEGMENTATION VIA SIMULATED INTERACTION
    Towle, Benjamin
    Chen, Xin
    Zhou, Ke
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [3] SIMSAM: ZERO-SHOT MEDICAL IMAGE SEGMENTATION VIA SIMULATED INTERACTION
    Towle, Benjamin
    Chen, Xin
    Zhou, Ke
    arXiv,
  • [4] Language-only Efficient Training of Zero-shot Composed Image Retrieval
    Gu, Geonmo
    Chun, Sanghyuk
    Kim, Wonjae
    Kang, Yoohoon
    Yun, Sangdoo
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13225 - 13234
  • [5] Domain Adaptation Meets Zero-Shot Learning: An Annotation-Efficient Approach to Multi-Modality Medical Image Segmentation
    Bian, Cheng
    Yuan, Chenglang
    Ma, Kai
    Yu, Shuang
    Wei, Dong
    Zheng, Yefeng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (05) : 1043 - 1056
  • [6] LGA: A Language Guide Adapter for Advancing the SAM Model's Capabilities in Medical Image Segmentation
    Hu, Jihong
    Li, Yinhao
    Sun, Hao
    Song, Yu
    Zhang, Chujie
    Lin, Lanfen
    Chen, Yen-Wei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 610 - 620
  • [7] Zero-Shot Medical Image Retrieval for Emerging Infectious Diseases Based on Meta-Transfer Learning - Worldwide, 2020
    Zhao, Yuying
    Lai, Hanjiang
    Yin, Jian
    Zhang, Yewu
    Yang, Shigui
    Jia, Zhongwei
    Ma, Jiaqi
    CHINA CDC WEEKLY, 2020, 2 (52): : 1004 - 1008
  • [8] TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation
    Jiang, Zekun
    Cheng, Dongjie
    Qin, Ziyuan
    Gao, Jun
    Lao, Qicheng
    Ismoilovich, Abdullaev Bakhrom
    Gayrat, Urazboev
    Elyorbek, Yuldashov
    Habibullo, Bekchanov
    Tang, Defu
    Wei, Linjing
    Li, Kang
    Zhang, Le
    BIG DATA MINING AND ANALYTICS, 2024, 7 (04): : 1199 - 1211
  • [9] Progressive expansion: Cost-efficient medical image analysis model with reversed once-for-all network training paradigm
    Lim, Shin Wei
    Chan, Chee Seng
    Faizal, Erma Rahayu Mohd
    Ewe, Kok Howg
    NEUROCOMPUTING, 2024, 581
  • [10] Trans-SAM: Transfer Segment Anything Model to medical image segmentation with Parameter-Efficient Fine-Tuning
    Wu, Yanlin
    Wang, Zhihong
    Yang, Xiongfeng
    Kang, Hong
    He, Along
    Li, Tao
    KNOWLEDGE-BASED SYSTEMS, 2025, 310