High-Quality Text-to-Image Generation Using High-Detail Feature-Preserving Network

被引：0

作者：

Hsu, Wei-Yen ^{[1
,2
,3
]}

Lin, Jing-Wen ^{[1
]}

机构：

[1] Natl Chung Cheng Univ, Dept Informat Management, Chiayi 62102, Taiwan

[2] Natl Chung Cheng Univ, Adv Inst Mfg High Tech Innovat, Chiayi 62102, Taiwan

[3] Natl Chung Cheng Univ, Ctr Innovat Res Aging Society CIRAS, Chiayi 62102, Taiwan

来源：

APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 02期

关键词：

generative adversarial network; text-to-image generation; high detail; feature preservation;

D O I：

10.3390/app15020706

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Multistage text-to-image generation algorithms have shown remarkable success. However, the images produced often lack detail and suffer from feature loss. This is because these methods mainly focus on extracting features from images and text, using only conventional residual blocks for post-extraction feature processing. This results in the loss of features, greatly reducing the quality of the generated images and necessitating more resources for feature calculation, which will severely limit the use and application of optical devices such as cameras and smartphones. To address these issues, the novel High-Detail Feature-Preserving Network (HDFpNet) is proposed to effectively generate high-quality, near-realistic images from text descriptions. The initial text-to-image generation (iT2IG) module is used to generate initial feature maps to avoid feature loss. Next, the fast excitation-and-squeeze feature extraction (FESFE) module is proposed to recursively generate high-detail and feature-preserving images with lower computational costs through three steps: channel excitation (CE), fast feature extraction (FFE), and channel squeeze (CS). Finally, the channel attention (CA) mechanism further enriches the feature details. Compared with the state of the art, experimental results obtained on the CUB-Bird and MS-COCO datasets demonstrate that the proposed HDFpNet achieves better performance and visual presentation, especially regarding high-detail images and feature preservation.

引用

页数：16

共 50 条

[31] MULTI-BAND MELGAN: FASTERWAVEFORM GENERATION FOR HIGH-QUALITY TEXT-TO-SPEECH
Yang, Geng
Yang, Shan
Liu, Kai
Fang, Peng
Chen, Wei
Xie, Lei
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 492 - 498
[32] Advanced Deep Learning Techniques for High-Quality Synthetic Thermal Image Generation
Pavez, Vicente
Hermosilla, Gabriel
Silva, Manuel
Farias, Gonzalo
MATHEMATICS, 2023, 11 (21)
[33] High-quality facial-expression image generation for UAV pedestrian detection
Tang, Yumin
Fan, Jing
Qu, Jinshuai
FRONTIERS IN SPACE TECHNOLOGIES, 2022, 3
[34] High-Quality Sonar Image Generation Algorithm Based on Generative Adversarial Networks
Wang, Zhengyang
Guo, Qingchang
Lei, Min
Guo, Shuxiang
Ye, Xiufen
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 3099 - 3104
[35] High-quality multiservice transmission on a single network using DWDM
Schoenau, Paul
Smith, Brendan
Lightwave, 1999, 16 (07):
[36] SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network
Wang, Kexin
Zhang, Jiahong
Ren, Yong
Yao, Man
Di Shang
Xu, Bo
Li, Guoqi
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7927 - 7940
[37] Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
Zeng, Yanhong
Fu, Jianlong
Chao, Hongyang
Guo, Baining
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1486 - 1494
[38] The fast generation method based on lattice segmentation for high-quality confusion network
Wang H.
Han J.
Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (05): : 473 - 480
[39] A Novel Method to Generate a High-Quality Image by Using a Stereo Camera
Ji, Seo-Won
Yeo, Yoon-Jae
Kang, Seok-Jae
Im, Joon-Hyuk
Ko, Sung-Jea
2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
[40] A High-Quality and Convenient Camera Calibration Method Using a Single Image
Qin, Xufang
Xia, Xiaohua
Xiang, Huatao
ELECTRONICS, 2024, 13 (22)

← 1 2 3 4 5 →