Grounded Vocabulary for Image Retrieval Using a Modified Multi-Generator Generative Adversarial Network

被引:0
|
作者
Kim, Kuekyeng [1 ]
Park, Chanjun [1 ]
Seo, Jaehyung [1 ]
Lim, Heuiseok [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Seoul 02841, South Korea
关键词
Vocabulary; Generators; Image retrieval; Visualization; Bit error rate; Task analysis; Training; Artificial intelligence; artificial neural network; computer vision; image processing; search methods;
D O I
10.1109/ACCESS.2021.3122547
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the recent increase in requirement of both natural-language and visual information, the demand for research on seamless multi-modal processing for effective retrieval of these types of information has increased. However, because of the unstructured nature of images, it is difficult to retrieve images that accurately represent the input text. In this study, we utilized an augmented version of a multi-generator generative adversarial network that uses BERT embeddings and attention maps as input to enable grounded vocabulary for visual representations. We compared the performance of our proposed model with those of other state-of-the-art text input-based image retrieval methods on the MSCOCO and Flikr30K datasets, and the results showed the potential of our proposed method. Even with limited vocabulary, our proposed model was comparable to other state-of-the-art performances on R@10 or even exceed them in R@1. Moreover, we revealed the unique properties of our method by demonstrating how it could perform successfully even when using more descriptive text or short sentences as input.
引用
收藏
页码:144614 / 144623
页数:10
相关论文
共 50 条
  • [1] MGMDcGAN: Medical Image Fusion Using Multi-Generator Multi-Discriminator Conditional Generative Adversarial Network
    Huang, Jun
    Le, Zhuliang
    Ma, Yong
    Fan, Fan
    Zhang, Hao
    Yang, Lei
    [J]. IEEE ACCESS, 2020, 8 : 55145 - 55157
  • [2] MGGAN: A multi-generator generative adversarial network for breast cancer immunohistochemical image generation
    Liu, Liangliang
    Liu, Zhihong
    Chang, Jing
    Qiao, Hongbo
    Sun, Tong
    Shang, Junping
    [J]. HELIYON, 2023, 9 (10)
  • [3] Text to image synthesis using multi-generator text conditioned generative adversarial networks
    Zhang, Min
    Li, Chunye
    Zhou, Zhiping
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) : 7789 - 7803
  • [4] Text to image synthesis using multi-generator text conditioned generative adversarial networks
    Min Zhang
    Chunye Li
    Zhiping Zhou
    [J]. Multimedia Tools and Applications, 2021, 80 : 7789 - 7803
  • [5] Ensemble Generative Adversarial Imputation Network with Selective Multi-Generator (ESM-GAIN) for Missing Data Imputation
    Li, Yuxuan
    Dogan, Ayse
    Liu, Chenang
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 807 - 812
  • [6] An Ensemble of Generation- and Retrieval-Based Image Captioning With Dual Generator Generative Adversarial Network
    Yang, Min
    Liu, Junhao
    Shen, Ying
    Zhao, Zhou
    Chen, Xiaojun
    Wu, Qingyao
    Li, Chengming
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9627 - 9640
  • [7] Sketch Based Image Retrieval with Conditional Generative Adversarial Network
    College of Computer & Communication Engineering, China University of Petroleum, Qingdao
    266580, China
    不详
    100190, China
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 12 (2336-2342):
  • [8] Unified Binary Generative Adversarial Network for Image Retrieval and Compression
    Song, Jingkuan
    He, Tao
    Gao, Lianli
    Xu, Xing
    Hanjalic, Alan
    Shen, Heng Tao
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 2243 - 2264
  • [9] Unified Binary Generative Adversarial Network for Image Retrieval and Compression
    Jingkuan Song
    Tao He
    Lianli Gao
    Xing Xu
    Alan Hanjalic
    Heng Tao Shen
    [J]. International Journal of Computer Vision, 2020, 128 : 2243 - 2264
  • [10] Imbalanced data augmentation for pipeline fault diagnosis: A multi-generator switching adversarial network
    Shang, Rou
    Dong, Hongli
    Wang, Chuang
    Chen, Shuangqing
    Sun, Tong
    Guan, Chuang
    [J]. CONTROL ENGINEERING PRACTICE, 2024, 144