Mask Embedding for Realistic High-Resolution Medical Image Synthesis

被引:7
|
作者
Ren, Yinhao [1 ]
Zhu, Zhe [2 ]
Li, Yingzhou [3 ]
Kong, Dehan [4 ]
Hou, Rui [5 ]
Grimm, Lars J. [2 ]
Marks, Jeffery R. [6 ]
Lo, Joseph Y. [1 ,2 ,5 ]
机构
[1] Duke Univ, Dept Biomed Engn, Durham, NC 27706 USA
[2] Duke Univ, Sch Med, Dept Radiol, Durham, NC 27706 USA
[3] Duke Univ, Dept Math, Durham, NC 27706 USA
[4] Beijing Inst Technol, Dept Automat, Beijing, Peoples R China
[5] Duke Univ, Dept Elect Engn, Durham, NC 27706 USA
[6] Duke Univ, Sch Med, Dept Surg, Durham, NC 27706 USA
关键词
Generative Adversarial Networks; Image synthesis; Mask embedding; Mammogram;
D O I
10.1007/978-3-030-32226-7_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative Adversarial Networks (GANs) have found applications in natural image synthesis and begin to show promises generating synthetic medical images. In many cases, the ability to perform controlled image synthesis using masked priors such as shape and size of organs is desired. However, mask-guided image synthesis is challenging due to the pixel level mask constraint. While the few existing mask-guided image generation approaches suffer from the lack of fine-grained texture details, we tackle the issue of mask-guided stochastic image synthesis via mask embedding. Our novel architecture first encodes the input mask as an embedding vector and then inject these embedding into the random latent vector input. The intuition is to classify semantic masks into partitions before feature up-sampling for improved sample space mapping stability. We validate our approach on a large dataset containing 39,778 patients with 443,556 negative screening Full Field Digital Mammography (FFDM) images. Experimental results show that our approach can generate realistic high-resolution (256x512) images with pixel-level mask constraints, and outperform other state-of-the-art approaches.
引用
收藏
页码:422 / 430
页数:9
相关论文
共 50 条
  • [1] High-Resolution Realistic Image Synthesis from Text Using Iterative Generative Adversarial Network
    Ullah, Anwar
    Yu, Xinguo
    Majid, Abdul
    Rahman, Hafiz Ur
    Mughal, M. Farhan
    [J]. IMAGE AND VIDEO TECHNOLOGY (PSIVT 2019), 2019, 11854 : 211 - 224
  • [2] Latent space manipulation for high-resolution medical image synthesis via the StyleGAN
    Fetty, Lukas
    Bylund, Mikael
    Kuess, Peter
    Heilemann, Gerd
    Nyholm, Tufve
    Georg, Dietmar
    Lofstedt, Tommy
    [J]. ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2020, 30 (04): : 305 - 314
  • [3] GP-GAN: Towards Realistic High-Resolution Image Blending
    Wu, Huikai
    Zheng, Shuai
    Zhang, Junge
    Huang, Kaiqi
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2487 - 2495
  • [4] Taming Transformers for High-Resolution Image Synthesis
    Esser, Patrick
    Rombach, Robin
    Ommer, Bjoern
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12868 - 12878
  • [5] Any-Resolution Training for High-Resolution Image Synthesis
    Chai, Lucy
    Gharbi, Michael
    Shechtman, Eli
    Isola, Phillip
    Zhang, Richard
    [J]. COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 170 - 188
  • [6] CNN-based synthesis of realistic high-resolution LiDAR data
    Triess, Larissa T.
    Peter, David
    Rist, Christoph B.
    Enzweiler, Markus
    Zoellner, J. Marius
    [J]. 2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 1512 - 1519
  • [7] THEOREM FOR HIGH-RESOLUTION HIGH-CONTRAST IMAGE SYNTHESIS
    BUCKLEW, JA
    SALEH, BEA
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (08): : 1233 - 1236
  • [8] High-Resolution Swin Transformer for Automatic Medical Image Segmentation
    Wei, Chen
    Ren, Shenghan
    Guo, Kaitai
    Hu, Haihong
    Liang, Jimin
    [J]. SENSORS, 2023, 23 (07)
  • [9] Cloud Deployment of High-Resolution Medical Image Analysis With TOMAAT
    Milletari, Fausto
    Frei, Johann
    Aboulatta, Moustafa
    Vivar, Gerome
    Ahmadi, Seyed-Ahmad
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (03) : 969 - 977
  • [10] High-Resolution Image Synthesis with Latent Diffusion Models
    Rombach, Robin
    Blattmann, Andreas
    Lorenz, Dominik
    Esser, Patrick
    Ommer, Bjoern
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10674 - 10685