Prostate lesion segmentation based on a 3D end-to-end convolution neural network with deep multi-scale attention

被引:11
|
作者
Song, Enmin [1 ]
Long, Jiaosong [1 ]
Ma, Guangzhi [1 ]
Liu, Hong [1 ]
Hung, Chih-Cheng [2 ]
Jin, Renchao [1 ]
Wang, Peijun [3 ]
Wang, Wei [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Kennesaw State Univ, Coll Comp & Software Engn, Atlanta, GA USA
[3] Tongji Univ, Tongji Hosp, Sch Medcine, Dept Radiol, Shanghai 200065, Peoples R China
基金
中国国家自然科学基金;
关键词
Mp-MRI; Prostate cancer segmentation; Convolution neural network; Attention; COMPUTER-AIDED DIAGNOSIS; SUPPORT VECTOR MACHINES; GLEASON SCORE; MR-IMAGES; CANCER;
D O I
10.1016/j.mri.2023.01.015
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Prostate cancer is one of the deadest cancers among human beings. To better diagnose the prostate cancer, prostate lesion segmentation becomes a very important work, but its progress is very slow due to the prostate lesions small in size, irregular in shape, and blurred in contour. Therefore, automatic prostate lesion segmentation from mp-MRI is a great significant work and a challenging task. However, the most existing multi-step segmentation methods based on voxel-level classification are time-consuming, may introduce errors in different steps and lead to error accumulation. To decrease the computation time, harness richer 3D spatial features, and fuse the multi-level contextual information of mp-MRI, we present an automatic segmentation method in which all steps are optimized conjointly as one step to form our end-to-end convolutional neural network. The proposed end-to-end network DMSA-V-Net consists of two parts: (1) a 3D V-Net is used as the backbone network, it is the first attempt in employing 3D convolutional neural network for CS prostate lesion segmentation, (2) a deep multi-scale attention mechanism is introduced into the 3D V-Net which can highly focus on the ROI while suppressing the redundant background. As a merit, the attention can adaptively re-align the context information between the feature maps at different scales and the saliency maps in high-levels. We performed experiments based on five cross-fold validation with data including 97 patients. The results show that the Dice and sensitivity are 0.7014 and 0.8652 respectively, which demonstrates that our segmentation approach is more significant and accurate compared to other methods.
引用
收藏
页码:98 / 109
页数:12
相关论文
共 50 条
  • [21] END-TO-END LEARNING OF DEEP CONVOLUTIONAL NEURAL NETWORK FOR 3D HUMAN ACTION RECOGNITION
    Li, Chao
    Sun, Shouqian
    Min, Xin
    Lin, Wenqian
    Nie, Binling
    Zhang, Xianfu
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [22] 3D mineral prospectivity modeling using multi-scale 3D convolution neural network and spatial attention approaches
    Li, Xiaohui
    Chen, Yuheng
    Yuan, Feng
    Jowitt, Simon M.
    Zhang, Mingming
    Ge, Can
    Wang, Zhiqiang
    Deng, Yufeng
    GEOCHEMISTRY, 2024, 84 (04):
  • [23] SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis
    Ji, Mengqi
    Gall, Juergen
    Zheng, Haitian
    Liu, Yebin
    Fang, Lu
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2326 - 2334
  • [24] SACANet: end-to-end self-attention-based network for 3D clothing animation
    Chen, Yunxi
    Cao, Yuanjie
    Fang, Fei
    Huang, Jin
    Hu, Xinrong
    He, Ruhan
    Zhang, Junjie
    VISUAL COMPUTER, 2024, : 3829 - 3842
  • [25] End-to-end autonomous driving based on the convolution neural network model
    Zhao, Yuanfang
    Chen, Yunli
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 419 - 423
  • [26] Deep Label Fusion: A 3D End-To-End Hybrid Multi-atlas Segmentation and Deep Learning Pipeline
    Xie, Long
    Wisse, Laura E. M.
    Wang, Jiancong
    Ravikumar, Sadhana
    Glenn, Trevor
    Luther, Anica
    Lim, Sydney
    Wolk, David A.
    Yushkevich, Paul A.
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2021, 2021, 12729 : 428 - 439
  • [27] An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion
    Liu, Hongzhe
    Yan, Hua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20139 - 20156
  • [28] An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion
    Hongzhe Liu
    Hua Yan
    Multimedia Tools and Applications, 2023, 82 : 20139 - 20156
  • [29] A novel end-to-end deep convolutional neural network based skin lesion classification framework
    A, Razia Sulthana
    Chamola, Vinay
    Hussain, Zain
    Albalwy, Faisal
    Hussain, Amir
    Expert Systems with Applications, 2024, 246
  • [30] A Convolutional Network With Multi-Scale and Attention Mechanisms for End-to-End Single-Channel Speech Enhancement
    Xiang, Xiaoxiao
    Zhang, Xiaojuan
    Chen, Haozhe
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1455 - 1459