Vision Transformer Based Multi-class Lesion Detection in IVOCT

被引:0
|
作者
Wang, Zixuan [1 ]
Shao, Yifan [2 ]
Sun, Jingyi [2 ]
Huang, Zhili [1 ]
Wang, Su [1 ]
Li, Qiyong [3 ]
Li, Jinsong [3 ]
Yu, Qian [2 ]
机构
[1] Sichuan Univ, Chengdu, Peoples R China
[2] Beihang Univ, Beijing, Peoples R China
[3] Sichuan Prov Peoples Hosp, Chengdu, Peoples R China
关键词
IVOCT; Object Detection; Vision Transformer;
D O I
10.1007/978-3-031-43987-2_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cardiovascular disease is a high-fatality illness. Intravascular Optical Coherence Tomography (IVOCT) technology can significantly assist in diagnosing and treating cardiovascular diseases. However, locating and classifying lesions from hundreds of IVOCT images is time-consuming and challenging, especially for junior physicians. An automatic lesion detection and classification model is desirable. To achieve this goal, in this work, we first collect an IVOCT dataset, including 2,988 images from 69 IVOCT data and 4,734 annotations of lesions spanning over three categories. Based on the newly-collected dataset, we propose a multi-class detection model based on Vision Transformer, called G-Swin Transformer. The essential part of our model is grid attention which is used to model relations among consecutive IVOCT images. Through extensive experiments, we show that the proposed G-Swin Transformer can effectively localize different types of lesions in IVOCT images, significantly outperforming baseline methods in all evaluation metrics. Our code is available via this link. https://github.com/Shao1Fan/G-Swin-Transformer
引用
收藏
页码:327 / 336
页数:10
相关论文
共 50 条
  • [1] Transformer Faults Detection using Inrush Transients based on Multi-class SVM
    Vatsa, Aniket
    Hati, Ananda Shankar
    2022 IEEE 6TH INTERNATIONAL CONFERENCE ON CONDITION ASSESSMENT TECHNIQUES IN ELECTRICAL SYSTEMS, CATCON, 2022, : 24 - 29
  • [2] Multi-Class Skin Lesion Detection and Classification via Teledermatology
    Khan, Muhammad Attique
    Muhammad, Khan
    Sharif, Muhammad
    Akram, Tallha
    de Albuquerque, Victor Hugo C.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (12) : 4267 - 4275
  • [3] An efficient multi-class classification of skin cancer using optimized vision transformer
    Desale, R. P.
    Patil, P. S.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (03) : 773 - 789
  • [4] An efficient multi-class classification of skin cancer using optimized vision transformer
    R. P. Desale
    P. S. Patil
    Medical & Biological Engineering & Computing, 2024, 62 : 773 - 789
  • [5] Vision Based Nighttime Vehicle Detection Using Adaptive Threshold and Multi-Class Classification
    Sakagawa, Yuta
    Nakajima, Kosuke
    Ohashi, Gosuke
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2019, E102A (09) : 1235 - 1245
  • [6] Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection
    Huang, Junjia
    Li, Haofeng
    Wan, Xiang
    Li, Guanbin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21327 - 21336
  • [7] Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection
    Lu, Ruiying
    Wu, YuJie
    Tian, Long
    Wang, Dongsheng
    Chen, Bo
    Liu, Xiyang
    Hu, Ruimin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Transformer Fault Diagnosis Based on Multi-Class AdaBoost Algorithm
    Li, Jifang
    Li, Genxu
    Hai, Chen
    Guo, Mengbo
    IEEE ACCESS, 2022, 10 : 1522 - 1532
  • [9] WMC-ViT: Waste Multi-class Classification Using a Modified Vision Transformer
    Kurz, Aidan
    Adams, Ethan
    Depoian, Arthur C.
    Bailey, Colleen P.
    Guturu, Parthasarathy
    2022 IEEE METROCON, 2022, : 13 - 15
  • [10] A Deep Learning Enabled Multi-Class Plant Disease Detection Model Based on Computer Vision
    Roy, Arunabha M.
    Bhaduri, Jayabrata
    AI, 2021, 2 (03) : 413 - 428