Bridging the gap between visual and auditory feature spaces for cross-media retrieval

被引:0
|
作者
Hong Zhang [1 ]
Fei Wu [1 ]
机构
[1] Zhejiang Univ, Inst Artificial Intelligence, Hangzhou 310027, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
cross-media retrieval; canonical correlation; relevance feedback; dynamic cross-media ranking;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-media retrieval is an interesting research problem, which seeks to breakthrough the limitation of modality so that users can query multimedia objects by examples of different modalities. In this paper we present a novel approach to learn the underlying correlation between visual and auditory feature spaces for cross-media retrieval. A semi-supervised Correlation Preserving Mapping (SSCPM) is described to learn the isomorphic SSCPM subspace where canonical correlations between original visual and auditory features are furthest preserved. Based on user interactions of relevance feedback, local semantic clusters are formed for images and audios respectively. With the dynamic spread of ranking scores of positive and negative examples, crossmedia semantic correlations are refined, and cross-media distance is accurately estimated. Experiment results are encouraging and show that the performance of our approach is effective.
引用
收藏
页码:596 / 605
页数:10
相关论文
共 50 条
  • [21] Image Retrieval by Cross-Media Relevance Fusion
    Dong, Jianfeng
    Li, Xirong
    Liao, Shuai
    Xu, Jieping
    Xu, Duanqing
    Du, Xiaoyong
    [J]. MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 173 - 176
  • [22] Present Development and Prospect of Cross-Media Retrieval
    Yin Zhenzhen
    Wang Feng
    Li Bin
    Zhang Lianjie
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE (ICTMF 2012), 2013, 38 : 889 - 894
  • [23] Cross-media retrieval: Concepts, advances and challenges
    Zhuang, Yueting
    Wu, Fei
    Zhang, Hong
    Yang, Yi
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 847 - 850
  • [24] Cross-media retrieval: Concepts, advances and challenges
    Zhuang, Yueting
    Wu, Fei
    Zhang, Hong
    Yang, Yi
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 377 - 380
  • [25] Towards Cross-Media Information Spaces and Architectures
    Signer, Beat
    [J]. 2019 13TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2019, : 23 - 29
  • [26] Toward cross-language and cross-media image retrieval
    Alvarez, C
    Oumohmed, AI
    Mignotte, M
    Nie, JY
    [J]. MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 676 - 687
  • [27] CROSS-MODALITY CORRELATION PROPAGATION FOR CROSS-MEDIA RETRIEVAL
    Zhai, Xiaohua
    Peng, Yuxin
    Xiao, Jianguo
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2337 - 2340
  • [28] Internet cross-media retrieval based on deep learning
    Jiang, Bin
    Yang, Jiachen
    Lv, Zhihan
    Tian, Kun
    Meng, Qinggang
    Yan, Yan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 356 - 366
  • [29] LEARNING OPTIMAL DATA REPRESENTATION FOR CROSS-MEDIA RETRIEVAL
    Zhang, Hong
    Chen, Li
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1925 - 1928
  • [30] Complementary information retrieval for cross-media news content
    Ma, Qiang
    Nadamoto, Akiyo
    Tanaka, Katsumi
    [J]. INFORMATION SYSTEMS, 2006, 31 (07) : 659 - 678