Text-to-Image GAN-Based Scene Retrieval and Re-Ranking Considering Word Importance

被引:1
|
作者
Yanagi, Rintaro [1 ]
Togo, Ren [2 ]
Ogawa, Takahiro [2 ]
Haseyama, Miki [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Div Media & Network Technol, Sapporo, Hokkaido 0600814, Japan
关键词
Text-to-image generative adversarial network; multimedia information retrieval; scene retrieval; re-ranking;
D O I
10.1109/ACCESS.2019.2952676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel scene retrieval and re-ranking method based on a text-to-image Generative Adversarial Network (GAN). The proposed method generates an image from an input query sentence based on the text-to-image GAN and then retrieves a scene that is the most similar to the generated image. By utilizing the image generated from the input query sentence as a query, we can control semantic information of the query image at the text level. Furthermore, we introduce a novel interactive re-ranking scheme to our retrieval method. Specifically, users can consider the importance of each word within the first input query sentence. Then the proposed method re-generates the query image that reflects the word importance provided by users. By updating the generated query image based on the word importance, it becomes feasible for users to revise retrieval results through this re-ranking process. In experiments, we showed that our retrieval method including the re-ranking scheme outperforms recently proposed retrieval methods.
引用
收藏
页码:169920 / 169930
页数:11
相关论文
共 50 条
  • [1] SCENE RETRIEVAL FOR VIDEO SUMMARIZATION BASED ON TEXT-TO-IMAGE GAN
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1825 - 1829
  • [2] Learnable Pillar-based Re-ranking for Image-Text Retrieval
    Qu, Leigang
    Liu, Meng
    Wang, Wenjie
    Zheng, Zhedong
    Nie, Liqiang
    Chua, Tat-Seng
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1252 - 1261
  • [3] Scene Retrieval from Multiple Resolution Generated Images Based on Text-to-image GAN
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [4] User Log Based Image Re-ranking and Retrieval
    Sangeetha, S.
    Varma, S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING 2016 (ICCASP 2016), 2017, 137 : 653 - 660
  • [5] Query is GAN: Scene Retrieval With Attentional Text-to-Image Generative Adversarial Network
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE ACCESS, 2019, 7 : 153183 - 153193
  • [6] FAST GEOMETRIC RE-RANKING FOR IMAGE-BASED RETRIEVAL
    Tsai, Sam S.
    Chen, David
    Takacs, Gabriel
    Chandrasekhar, Vijay
    Vedantham, Ramakrishna
    Grzeszczuk, Radek
    Girod, Bernd
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1029 - 1032
  • [7] Efficient Re-ranking in Vocabulary Tree based Image Retrieval
    Wang, Xiaoyu
    Yang, Ming
    Yu, Kai
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 855 - 859
  • [8] Adaptive Query Re-ranking Based on ImageGraph for Image Retrieval
    Fan, Haonan
    Hu, Hai-Miao
    Wang, Rong
    Zhang, Yugui
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4593 - 4599
  • [9] TIAR: Text-Image-Audio Retrieval with weighted multimodal re-ranking
    Peide Chi
    Yong Feng
    Mingliang Zhou
    Xian-cai Xiong
    Yong-heng Wang
    Bao-hua Qiang
    Applied Intelligence, 2023, 53 : 22898 - 22916
  • [10] TIAR: Text-Image-Audio Retrieval with weighted multimodal re-ranking
    Chi, Peide
    Feng, Yong
    Zhou, Mingliang
    Xiong, Xian-cai
    Wang, Yong-heng
    Qiang, Bao-hua
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22898 - 22916