Multimodal-Based Supervised Learning for Image Search Reranking

被引:0
|
作者
Zhao, Shengnan [1 ]
Ma, Jun [1 ]
Cui, Chaoran [1 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan 250101, Peoples R China
关键词
Image search reranking; Supervised reranking; Multimodal learning;
D O I
10.1007/978-3-319-21042-1_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of image search reranking is to rerank the images obtained by a conventional text-based image search engine to improve the search precision, diversity and so on. Current image reranking methods are often based on a single modality. However, it is hard to find a general modality which can work well for all kinds of queries. This paper proposes a multimodal-based supervised learning for image search reranking. First, for different modalities, different similarity graphs are constructed and different approaches are utilized to calculate the similarity between images on the graph. Exploiting the similarity graphs and the initial list, we integrate the multiple modality into query-independent reranking features, namely PageRank Pseudo Relevance Feedback, Density Feature, Initial Ranking Score Feature, and then fuse them into a 19-dimensional feature vector for each image. After that, the supervised method is employed to learn the weight of each reranking feature. The experiments constructed on the MSRA-MM Dataset demonstrate the improvement in robust and effectiveness of the proposed method.
引用
收藏
页码:135 / 147
页数:13
相关论文
共 50 条
  • [1] Multimodal Graph-Based Reranking for Web Image Search
    Wang, Meng
    Li, Hao
    Tao, Dacheng
    Lu, Ke
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (11) : 4649 - 4661
  • [2] Optimizing Multimodal Reranking for Web Image Search
    Li, Hao
    Wang, Meng
    Li, Zhisheng
    Zha, Zheng-Jun
    Shen, Jialie
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1119 - 1120
  • [3] DEEP LEARNING FOR MULTIMODAL-BASED VIDEO INTERESTINGNESS PREDICTION
    Shen, Yuesong
    Demarty, Claire-Helene
    Duong, Ngoc Q. K.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1003 - 1008
  • [4] Image Search Reranking with Transductive Learning to Rank Framework
    Zhang, Jing
    Jing, Peiguang
    Ji, Zhong
    Su, Yuting
    [J]. INFORMATION COMPUTING AND APPLICATIONS, PT II, 2011, 244 : 529 - 536
  • [5] Prototype-Based Image Search Reranking
    Yang, Linjun
    Hanjalic, Alan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (03) : 871 - 882
  • [6] Hypergraph learning with collaborative representation for image search reranking
    Bouhlel, Noura
    Feki, Ghada
    Ben Ammar, Anis
    Ben Amar, Chokri
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2020, 9 (03) : 205 - 214
  • [7] Hypergraph learning with collaborative representation for image search reranking
    Noura Bouhlel
    Ghada Feki
    Anis Ben Ammar
    Chokri Ben Amar
    [J]. International Journal of Multimedia Information Retrieval, 2020, 9 : 205 - 214
  • [8] Multimodal Fusion for Video Search Reranking
    Wei, Shikui
    Zhao, Yao
    Zhu, Zhenfeng
    Liu, Nan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (08) : 1191 - 1199
  • [9] Semi-supervised LPP algorithms for learning-to-rank-based visual search reranking
    Ji, Zhong
    Pang, Yanwei
    He, Yuqing
    Zhang, Huanfen
    [J]. INFORMATION SCIENCES, 2015, 302 : 83 - 93
  • [10] Glimpse: A Multimodal-based Transforming Image Collection with Vector Database
    Ho, Huu-Tuong
    Pham, Minh-Tien
    Tran, Quang-Duong
    Pham, Quang-Huy
    Luong Vuong Nguyen
    [J]. 38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 415 - 420