Fine-grained image emotion captioning based on Generative Adversarial Networks

被引:0
|
作者
Yang, Chunmiao [1 ]
Wang, Yang [1 ]
Han, Liying [1 ]
Jia, Xiran [1 ]
Sun, Hebin [1 ]
机构
[1] Hebei Univ Technol, Sch Elect Informat Engn, Tianjin 300130, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative Adversarial Network (GAN); Gate Recurrent Unit (GRU); Capsule neural network; Image emotion captioning;
D O I
10.1007/s11042-024-18680-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image captioning, which combines natural language processing and computer vision, has developed rapidly in recent years. It tends to be applied in data retrieval, blind navigation, intelligent transportation, smart home, medical assistance, news media and other domains. In order to elevate the consistency and abundance of image captioning languages and express people's subjective emotions effectively, a Generative Adversarial Network (GAN) is applied in this paper to obtain multi-stylized image emotion captions and generate two captions containing positive and negative emotions, respectively. Among them, Residual Network (ResNet) and Gate Recurrent Unit (GRU) are integrated into the generator, while the capsule neural network is applied to the discriminator. We conduct experiments on the popular MSCOCO and Senticap datasets to validate the model and demonstrate its satisfied performance in comparison to current advanced image captioning approaches.
引用
收藏
页码:81857 / 81875
页数:19
相关论文
共 50 条
  • [1] AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
    Xu, Tao
    Zhang, Pengchuan
    Huang, Qiuyuan
    Zhang, Han
    Gan, Zhe
    Huang, Xiaolei
    He, Xiaodong
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1316 - 1324
  • [2] Model-Guided Generative Adversarial Networks for Unsupervised Fine-Grained Image Generation
    Xiao, Jian
    Bi, Xiaojun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1188 - 1199
  • [3] Fine-Grained Features for Image Captioning
    Shao, Mengyue
    Feng, Jie
    Wu, Jie
    Zhang, Haixiang
    Zheng, Yayu
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 4697 - 4712
  • [4] Conditional generative adversarial network for EEG-based emotion fine-grained estimation and visualization
    Fu, Boxun
    Li, Fu
    Niu, Yi
    Wu, Hao
    Li, Yang
    Shi, Guangming
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [5] A Novel Image Captioning Method Based on Generative Adversarial Networks
    Fan, Yang
    Xu, Jungang
    Sun, Yingfei
    Wang, Yiyu
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 281 - 292
  • [6] Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework
    Yan Zhao
    Ziqiang Zheng
    Chao Wang
    Zhaorui Gu
    Min Fu
    Zhibin Yu
    Haiyong Zheng
    Nan Wang
    Bing Zheng
    [J]. Multimedia Tools and Applications, 2020, 79 : 14981 - 15000
  • [7] Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework
    Zhao, Yan
    Zheng, Ziqiang
    Wang, Chao
    Gu, Zhaorui
    Fu, Min
    Yu, Zhibin
    Zheng, Haiyong
    Wang, Nan
    Zheng, Bing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14981 - 15000
  • [8] Fine-grained image inpainting with scale-enhanced generative adversarial network
    Liu, Weirong
    Cao, Chengrui
    Liu, Jie
    Ren, Chenwen
    Wei, Yulin
    Guo, Honglin
    [J]. PATTERN RECOGNITION LETTERS, 2021, 143 : 81 - 87
  • [9] Fine-Grained Urban Flow Inferring via Conditional Generative Adversarial Networks
    Zhang, Xv
    Xu, Yuanbo
    Li, Ying
    Yang, Yongjian
    [J]. WEB AND BIG DATA, PT III, APWEB-WAIM 2022, 2023, 13423 : 420 - 434
  • [10] Generative Adversarial Networks to Improve Fetal Brain Fine-Grained Plane Classification
    Montero, Alberto
    Bonet-Carne, Elisenda
    Burgos-Artizzu, Xavier Paolo
    [J]. SENSORS, 2021, 21 (23)