Text-to-Image GAN-Based Scene Retrieval and Re-Ranking Considering Word Importance

被引:1
|
作者
Yanagi, Rintaro [1 ]
Togo, Ren [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Div Media & Network Technol, Sapporo, Hokkaido 0600814, Japan
关键词
Text-to-image generative adversarial network; multimedia information retrieval; scene retrieval; re-ranking;
D O I
10.1109/ACCESS.2019.2952676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel scene retrieval and re-ranking method based on a text-to-image Generative Adversarial Network (GAN). The proposed method generates an image from an input query sentence based on the text-to-image GAN and then retrieves a scene that is the most similar to the generated image. By utilizing the image generated from the input query sentence as a query, we can control semantic information of the query image at the text level. Furthermore, we introduce a novel interactive re-ranking scheme to our retrieval method. Specifically, users can consider the importance of each word within the first input query sentence. Then the proposed method re-generates the query image that reflects the word importance provided by users. By updating the generated query image based on the word importance, it becomes feasible for users to revise retrieval results through this re-ranking process. In experiments, we showed that our retrieval method including the re-ranking scheme outperforms recently proposed retrieval methods.
引用
收藏
页码:169920 / 169930
页数:11
相关论文
共 50 条
  • [41] Writer identification and writer retrieval based on NetVLAD with Re-ranking
    Rasoulzadeh, Shervin
    BabaAli, Bagher
    IET BIOMETRICS, 2022, 11 (01) : 10 - 22
  • [42] Image re-ranking based on extraction of semantic regions
    Chen Z.
    Hou J.
    Zhang D.-S.
    Zhang H.-Z.
    Zidonghua Xuebao/Acta Automatica Sinica, 2011, 37 (11): : 1356 - 1359
  • [43] R-DiP: Re-ranking Based Diffusion Pre-computation for Image Retrieval
    Kato, Tatsuya
    Komamizu, Takahiro
    Ide, Ichiro
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, DEXA 2024, 2024, 14911 : 233 - 247
  • [44] MF-Re-Rank: A Modality Feature-Based Re-Ranking Model for Medical Image Retrieval
    Ayadi, Hajer
    Torjmen-Khemakhem, Mouna
    Daoud, Mariam
    Huang, Jimmy Xiangji
    Ben Jemaa, Maher
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2018, 69 (09) : 1095 - 1108
  • [45] SWF-GAN: A Text-to-Image model based on sentence-word fusion perception
    Liu, Chun
    Hu, Jingsong
    Lin, Hong
    COMPUTERS & GRAPHICS-UK, 2023, 115 : 500 - 510
  • [46] X-Vision: Explainable Image Retrieval by Re-Ranking in Semantic Space
    Polley, Sayantan
    Mondal, Subhajit
    Mannam, Venkata Srinath
    Kumar, Kushagra
    Patra, Subhankar
    Nurnberger, Andreas
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4955 - 4959
  • [47] Complementary Incremental Hashing With Query-Adaptive Re-Ranking for Image Retrieval
    Tian, Xing
    Ng, Wing W. Y.
    Wang, Hui
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1210 - 1224
  • [48] Text-video retrieval re-ranking via multi-grained cross attention and frozen image encoders
    Dai, Zuozhuo
    Cheng, Kaihui
    Shao, Fangtao
    Dong, Zilong
    Zhu, Siyu
    PATTERN RECOGNITION, 2025, 159
  • [49] Scalable Face Image Retrieval with Identity-Based Quantization and Multi-Reference Re-ranking
    Wu, Zhong
    Ke, Qifa
    Sun, Jian
    Shum, Heung-Yeung
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3469 - 3476
  • [50] Re-ranking of Stereo Video Retrieval Results Based on Clustering and Density
    Duan, Fengfeng
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1612 - 1615