PMED-Net: Pyramid Based Multi-Scale Encoder-Decoder Network for Medical Image Segmentation

被引:15
|
作者
Khan, Abbas [1 ,2 ]
Kim, Hyongsuk [1 ,2 ]
Chua, Leon [3 ]
机构
[1] Jeonbuk Natl Univ, Div Elect & Informat Engn, Jeonju 54896, South Korea
[2] Jeonbuk Natl Univ, Core Res Inst Intelligent Robots, Jeonju 54896, South Korea
[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
新加坡国家研究基金会;
关键词
Image segmentation; Decoding; Feature extraction; Medical diagnostic imaging; Training; Diseases; Deep learning; Convolutional neural networks; encoder-decoder architecture; medical image processing; semantic segmentation;
D O I
10.1109/ACCESS.2021.3071754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A pyramidical multi-scale encoder-decoder network, namely PMED-Net, is proposed for medical image segmentation. Different variants of encoder-decoder networks are in practice for segmenting the medical images and U-Net is the most widely used one. However, the existing architectures for segmenting medical images have millions of parameters that require enormous computations which results in memory and cost-inefficiency. To overcome such limitations, we come up with the idea of training small networks in a cascaded form for coarse-to-fine prediction. The proposed adaptive network is extended up to six pyramid levels, and at each level, features are extracted at different scales of the input image. Each lightweight encoder-decoder network is trained independently to minimize loss, where succeeding level networks further refine the prior predictions. Evaluation and comparison of our architecture were performed on four different publicly available medical image segmentation datasets: International Skin Imaging Collaboration (ISIC) challenge 2018 dataset, brain tumor dataset, nuclei dataset, and X-ray dataset. The experimental results of the PMED-Net are either better or on par with other state-of-the-art networks in terms of IoU, F1-Score, and sensitivity metrics. Moreover, PMED-Net is efficient in terms of parameterized complexity as it has 1/21.3, 1/21.1, 1/14.0, 1/11.6, 1/11.2, 1/6.64, and 1/4.95 times fewer parameters than SegNet, U-Net, BCDU-Net, CU-Net, FCN-8s, ORED-Net, and MultiResUNet respectively. The pre-trained models, datasets information, and implementation details are available at https://github.com/kabbas570/Pyramid-Based-Encoder-Decoder.
引用
收藏
页码:55988 / 55998
页数:11
相关论文
共 50 条
  • [1] Semantic Segmentation of Remote Sensing Image Based on Multi-Scale Semantic Encoder-Decoder Network
    Liang Y.
    Yi C.-X.
    Wang G.-Y.
    Hu Y.-H.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3199 - 3214
  • [2] MEDU-Net plus : a novel improved U-Net based on multi-scale encoder-decoder for medical image segmentation
    Yang, Zhenzhen
    Sun, Xue
    Yang, Yongpeng
    Wu, Xinyi
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (07): : 1706 - 1725
  • [3] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
    Ma, Xinxin
    Liu, Kai
    Ding, Chongyang
    Yan, Lin
    Duan, Meiyu
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [4] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE ENCODER-DECODER NETWORK
    Ma, Jingjing
    Wu, Linlin
    Tang, Xu
    Zhang, Xiangrong
    Zhu, Cheng
    Ma, Junyong
    Jiao, Licheng
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1283 - 1286
  • [5] DHA-Net: An encoder-decoder network fusing multi-scale features for optic disc segmentation
    Zheng, Xuan
    He, Yi
    Yuan, Huaqing
    Jiang, Yanglin
    Xu, Yanbin
    [J]. 2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,
  • [6] Roadway Crack Segmentation Based on an Encoder-decoder Deep Network with Multi-scale Convolutional Blocks
    Sun, Mengyuan
    Guo, Runhua
    Zhu, Jinhui
    Fan, Wenhui
    [J]. 2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 869 - 874
  • [7] Iterative Convolutional Encoder-Decoder Network with Multi-Scale Context Learning for Liver Segmentation
    Zhang, Feiyan
    Yan, Shuhao
    Zhao, Yizhong
    Gao, Yuan
    Li, Zhi
    Lu, Xuesong
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [8] MGU-Net: a multiscale gate attention encoder-decoder network for medical image segmentation
    Liu, Le
    Chen, Qi
    Su, Jian
    Du, Xiao Gang
    Lei, Tao
    Wan, Yong
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 71 (04) : 275 - 285
  • [9] Iterative Deep Convolutional Encoder-Decoder Network for Medical Image Segmentation
    Kim, Jung Uk
    Kim, Hak Gu
    Ro, Yong Man
    [J]. 2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 685 - 688
  • [10] Attention Guided Encoder-Decoder Network With Multi-Scale Context Aggregation for Land Cover Segmentation
    Wang, Shuyang
    Mu, Xiaodong
    Yang, Dongfang
    He, Hao
    Zhao, Peng
    [J]. IEEE ACCESS, 2020, 8 : 215299 - 215309