Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance

被引:5
|
作者
Urrea, Claudio [1 ]
Garcia-Garcia, Yainet [1 ]
Kern, John [1 ]
机构
[1] Univ Santiago Chile, Fac Engn, Elect Engn Dept, Las Sophoras 165, Santiago 9170020, Chile
关键词
deep learning for laparoscopic surgery; class imbalance; activation function; loss function; Adam and SGDM optimizers; semantic segmentation of the surgical scene;
D O I
10.3390/biomedicines12061309
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This article addresses the semantic segmentation of laparoscopic surgery images, placing special emphasis on the segmentation of structures with a smaller number of observations. As a result of this study, adjustment parameters are proposed for deep neural network architectures, enabling a robust segmentation of all structures in the surgical scene. The U-Net architecture with five encoder-decoders (U-Net5ed), SegNet-VGG19, and DeepLabv3+ employing different backbones are implemented. Three main experiments are conducted, working with Rectified Linear Unit (ReLU), Gaussian Error Linear Unit (GELU), and Swish activation functions. The applied loss functions include Cross Entropy (CE), Focal Loss (FL), Tversky Loss (TL), Dice Loss (DiL), Cross Entropy Dice Loss (CEDL), and Cross Entropy Tversky Loss (CETL). The performance of Stochastic Gradient Descent with momentum (SGDM) and Adaptive Moment Estimation (Adam) optimizers is compared. It is qualitatively and quantitatively confirmed that DeepLabv3+ and U-Net5ed architectures yield the best results. The DeepLabv3+ architecture with the ResNet-50 backbone, Swish activation function, and CETL loss function reports a Mean Accuracy (MAcc) of 0.976 and Mean Intersection over Union (MIoU) of 0.977. The semantic segmentation of structures with a smaller number of observations, such as the hepatic vein, cystic duct, Liver Ligament, and blood, verifies that the obtained results are very competitive and promising compared to the consulted literature. The proposed selected parameters were validated in the YOLOv9 architecture, which showed an improvement in semantic segmentation compared to the results obtained with the original architecture.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] Road Scene Segmentation Based on Deep Learning
    Zheng, Ke
    Naji, Hasan Abdullah Hasan
    IEEE ACCESS, 2020, 8 : 140964 - 140971
  • [42] Semantic segmentation with labeling uncertainty and class imbalance applied to vegetation mapping
    Bressan, Patrik Ola
    Marcato Junior, Jose
    Martins, Jose Augusto Correa
    de Melo, Maximilian Jaderson
    Goncalves, Diogo Nunes
    Freitas, Daniel Matte
    Ramos, Ana Paula Marques
    Furuya, Michelle Tais Gracia
    Osco, Lucas Prado
    Silva, Jonathan de Andrare
    Luo, Zhipeng
    Garcia, Raymundo Dordero
    Ma, Lingfei
    Li, Jonathan
    Goncalves, Wesley Nunes
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 108
  • [43] Improving Breast Cancer Detection and Diagnosis through Semantic Segmentation Using the Unet3+ Deep Learning Framework
    Alam, Taukir
    Shia, Wei-Chung
    Hsu, Fang-Rong
    Hassan, Taimoor
    BIOMEDICINES, 2023, 11 (06)
  • [44] Intrinsically explainable deep learning architecture for semantic segmentation of histological structures in heart tissue
    Halinkovic M.
    Fabian O.
    Felsoova A.
    Kveton M.
    Benesova W.
    Computers in Biology and Medicine, 2024, 177
  • [45] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Tappeiner, Elias
    Welk, Martin
    Schubert, Rainer
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (11) : 2103 - 2111
  • [46] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Elias Tappeiner
    Martin Welk
    Rainer Schubert
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2103 - 2111
  • [47] Tackling the class imbalance problem of deep learning-based head and neck organ segmentation
    Tappeiner, Elias
    Welk, Martin
    Schubert, Rainer
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022,
  • [48] Wildfire Segmentation using Deep-RegSeg Semantic Segmentation Architecture
    Ghali, Rafik
    Akhloufi, Moulay A.
    Mseddi, Wided Souidene
    Jmal, Marwa
    19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 149 - 154
  • [49] Exploring the Impact of Deep Learning Models on Lane Detection Through Semantic Segmentation
    Kumar S.
    Pandey A.
    Varshney S.
    SN Computer Science, 5 (1)
  • [50] Deep learning enhanced Watershed for microstructural analysis using a boundary class semantic segmentation
    G. Fotos
    A. Campbell
    P. Murray
    E. Yakushina
    Journal of Materials Science, 2023, 58 : 14390 - 14410