High-Quality Text-to-Image Generation Using High-Detail Feature-Preserving Network

被引:0
|
作者
Hsu, Wei-Yen [1 ,2 ,3 ]
Lin, Jing-Wen [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Informat Management, Chiayi 62102, Taiwan
[2] Natl Chung Cheng Univ, Adv Inst Mfg High Tech Innovat, Chiayi 62102, Taiwan
[3] Natl Chung Cheng Univ, Ctr Innovat Res Aging Society CIRAS, Chiayi 62102, Taiwan
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 02期
关键词
generative adversarial network; text-to-image generation; high detail; feature preservation;
D O I
10.3390/app15020706
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Multistage text-to-image generation algorithms have shown remarkable success. However, the images produced often lack detail and suffer from feature loss. This is because these methods mainly focus on extracting features from images and text, using only conventional residual blocks for post-extraction feature processing. This results in the loss of features, greatly reducing the quality of the generated images and necessitating more resources for feature calculation, which will severely limit the use and application of optical devices such as cameras and smartphones. To address these issues, the novel High-Detail Feature-Preserving Network (HDFpNet) is proposed to effectively generate high-quality, near-realistic images from text descriptions. The initial text-to-image generation (iT2IG) module is used to generate initial feature maps to avoid feature loss. Next, the fast excitation-and-squeeze feature extraction (FESFE) module is proposed to recursively generate high-detail and feature-preserving images with lower computational costs through three steps: channel excitation (CE), fast feature extraction (FFE), and channel squeeze (CS). Finally, the channel attention (CA) mechanism further enriches the feature details. Compared with the state of the art, experimental results obtained on the CUB-Bird and MS-COCO datasets demonstrate that the proposed HDFpNet achieves better performance and visual presentation, especially regarding high-detail images and feature preservation.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] High-quality holographic stereogram generation using four RGBD images
    Fachada, Sarah
    Bonatto, Daniele
    Lafruit, Gauthier
    APPLIED OPTICS, 2021, 60 (04) : A250 - A259
  • [42] A High-Quality Generation Approach for Educational Programming Projects Using LLM
    Song, Tian
    Zhang, Hang
    Xiao, Yijia
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2024, 17 : 2296 - 2309
  • [43] Generation of high-quality lines and arrays using nanoparticle controlling processes
    Huh, Seung H.
    Riu, Doh H.
    Naono, Y.
    Taguchi, Y.
    Kawabata, S.
    Nakajima, A.
    APPLIED PHYSICS LETTERS, 2007, 91 (09)
  • [44] Rapid high-quality PET Patlak parametric image generation based on direct reconstruction and temporal nonlocal neural network
    Xie, Nuobei
    Gong, Kuang
    Guo, Ning
    Qin, Zhixing
    Wu, Zhifang
    Liu, Huafeng
    Li, Quanzheng
    NEUROIMAGE, 2021, 240
  • [45] Using High-Quality Feature for Weakly-Supervised Camouflaged Object Detection
    Wu, Weijie
    Tong, Yiqiu
    Jiang, Qijun
    Chen, Lina
    Gao, Hong
    WEB AND BIG DATA, APWEB-WAIM 2024, PT V, 2024, 14965 : 165 - 178
  • [46] HIGH QUALITY MONOCULAR DEPTH ESTIMATION VIA A MULTI-SCALE NETWORK AND A DETAIL-PRESERVING OBJECTIVE
    Jiang, Hualie
    Huang, Rui
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1920 - 1924
  • [47] High-quality face image generation using particle swarm optimization-based generative adversarial networks
    Zhang, Long
    Zhao, Lin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 122 : 98 - 104
  • [48] Graphics processing unit-accelerated high-quality watercolor painting image generation
    Huang, Jiamian
    Ito, Yasuaki
    Nakano, Koji
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (19):
  • [49] High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination
    Zhu, Jingyuan
    Ma, Huimin
    Chen, Jiansheng
    Yuan, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2950 - 2965
  • [50] High-quality image matching and automated generation of 3D tree models
    Baltsavias, E.
    Gruen, A.
    Eisenbeiss, H.
    Zhang, L.
    Waser, L. T.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2008, 29 (05) : 1243 - 1259