Bridging the gap between visual and auditory feature spaces for cross-media retrieval

被引:0
|
作者
Hong Zhang [1 ]
Fei Wu [1 ]
机构
[1] Zhejiang Univ, Inst Artificial Intelligence, Hangzhou 310027, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
cross-media retrieval; canonical correlation; relevance feedback; dynamic cross-media ranking;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-media retrieval is an interesting research problem, which seeks to breakthrough the limitation of modality so that users can query multimedia objects by examples of different modalities. In this paper we present a novel approach to learn the underlying correlation between visual and auditory feature spaces for cross-media retrieval. A semi-supervised Correlation Preserving Mapping (SSCPM) is described to learn the isomorphic SSCPM subspace where canonical correlations between original visual and auditory features are furthest preserved. Based on user interactions of relevance feedback, local semantic clusters are formed for images and audios respectively. With the dynamic spread of ranking scores of positive and negative examples, crossmedia semantic correlations are refined, and cross-media distance is accurately estimated. Experiment results are encouraging and show that the performance of our approach is effective.
引用
收藏
页码:596 / 605
页数:10
相关论文
共 50 条
  • [31] Cross-media retrieval based on linear discriminant analysis
    Qi, Yudan
    Zhang, Huaxiang
    Zhang, Bin
    Wang, Li
    Zheng, Shunxin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 24249 - 24268
  • [32] Bagging-based cross-media retrieval algorithm
    Xu, Gongwen
    Zhang, Yu
    Yin, Mingshan
    Hong, Wenzhong
    Zou, Ran
    Wang, Shanshan
    [J]. SOFT COMPUTING, 2023, 27 (05) : 2615 - 2623
  • [33] Cross-media retrieval with collective deep semantic learning
    Bin Zhang
    Lei Zhu
    Jiande Sun
    Huaxiang Zhang
    [J]. Multimedia Tools and Applications, 2018, 77 : 22247 - 22266
  • [34] Cross-media retrieval based on synthesis reasoning model
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2009, 9 (1307-1314):
  • [35] Finding the best picture: Cross-media retrieval of content
    Deschacht, Koen
    Moens, Marie-Francine
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 539 - 546
  • [36] ENHANCED ISOMORPHIC SEMANTIC REPRESENTATION FOR CROSS-MEDIA RETRIEVAL
    Liu, Ting
    Zhao, Yao
    Wei, Shikui
    Wei, Yunchao
    Liao, Lixin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 967 - 972
  • [37] Cross-media database retrieval system based on TOTEM
    Zeng, Cheng
    Zhou, Haiyang
    Yan, Bing
    [J]. WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 182 - 193
  • [38] Semantic Topic Multimodal Hashing for Cross-Media Retrieval
    Wang, Di
    Gao, Xinbo
    Wang, Xiumei
    He, Lihuo
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3890 - 3896
  • [39] Combining Global and Local Similarity for Cross-Media Retrieval
    Li, Zhixin
    Ling, Feng
    Zhang, Canlong
    Ma, Huifang
    [J]. IEEE ACCESS, 2020, 8 (08): : 21847 - 21856
  • [40] Temporal Cross-Media Retrieval with Soft-Smoothing
    Semedo, David
    Magalhaes, Joao
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1038 - 1046