High-Quality Text-to-Image Generation Using High-Detail Feature-Preserving Network

被引:0
|
作者
Hsu, Wei-Yen [1 ,2 ,3 ]
Lin, Jing-Wen [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Informat Management, Chiayi 62102, Taiwan
[2] Natl Chung Cheng Univ, Adv Inst Mfg High Tech Innovat, Chiayi 62102, Taiwan
[3] Natl Chung Cheng Univ, Ctr Innovat Res Aging Society CIRAS, Chiayi 62102, Taiwan
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 02期
关键词
generative adversarial network; text-to-image generation; high detail; feature preservation;
D O I
10.3390/app15020706
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Multistage text-to-image generation algorithms have shown remarkable success. However, the images produced often lack detail and suffer from feature loss. This is because these methods mainly focus on extracting features from images and text, using only conventional residual blocks for post-extraction feature processing. This results in the loss of features, greatly reducing the quality of the generated images and necessitating more resources for feature calculation, which will severely limit the use and application of optical devices such as cameras and smartphones. To address these issues, the novel High-Detail Feature-Preserving Network (HDFpNet) is proposed to effectively generate high-quality, near-realistic images from text descriptions. The initial text-to-image generation (iT2IG) module is used to generate initial feature maps to avoid feature loss. Next, the fast excitation-and-squeeze feature extraction (FESFE) module is proposed to recursively generate high-detail and feature-preserving images with lower computational costs through three steps: channel excitation (CE), fast feature extraction (FFE), and channel squeeze (CS). Finally, the channel attention (CA) mechanism further enriches the feature details. Compared with the state of the art, experimental results obtained on the CUB-Bird and MS-COCO datasets demonstrate that the proposed HDFpNet achieves better performance and visual presentation, especially regarding high-detail images and feature preservation.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] MULTI-BAND MELGAN: FASTERWAVEFORM GENERATION FOR HIGH-QUALITY TEXT-TO-SPEECH
    Yang, Geng
    Yang, Shan
    Liu, Kai
    Fang, Peng
    Chen, Wei
    Xie, Lei
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 492 - 498
  • [32] Advanced Deep Learning Techniques for High-Quality Synthetic Thermal Image Generation
    Pavez, Vicente
    Hermosilla, Gabriel
    Silva, Manuel
    Farias, Gonzalo
    MATHEMATICS, 2023, 11 (21)
  • [33] High-quality facial-expression image generation for UAV pedestrian detection
    Tang, Yumin
    Fan, Jing
    Qu, Jinshuai
    FRONTIERS IN SPACE TECHNOLOGIES, 2022, 3
  • [34] High-Quality Sonar Image Generation Algorithm Based on Generative Adversarial Networks
    Wang, Zhengyang
    Guo, Qingchang
    Lei, Min
    Guo, Shuxiang
    Ye, Xiufen
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 3099 - 3104
  • [35] High-quality multiservice transmission on a single network using DWDM
    Schoenau, Paul
    Smith, Brendan
    Lightwave, 1999, 16 (07):
  • [36] SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network
    Wang, Kexin
    Zhang, Jiahong
    Ren, Yong
    Yao, Man
    Di Shang
    Xu, Bo
    Li, Guoqi
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7927 - 7940
  • [37] Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
    Zeng, Yanhong
    Fu, Jianlong
    Chao, Hongyang
    Guo, Baining
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1486 - 1494
  • [38] The fast generation method based on lattice segmentation for high-quality confusion network
    Wang H.
    Han J.
    Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (05): : 473 - 480
  • [39] A Novel Method to Generate a High-Quality Image by Using a Stereo Camera
    Ji, Seo-Won
    Yeo, Yoon-Jae
    Kang, Seok-Jae
    Im, Joon-Hyuk
    Ko, Sung-Jea
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [40] A High-Quality and Convenient Camera Calibration Method Using a Single Image
    Qin, Xufang
    Xia, Xiaohua
    Xiang, Huatao
    ELECTRONICS, 2024, 13 (22)