Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval

被引:0
|
作者
Hong Zhang
Yan-yun Wang
Hong Pan
Fei Wu
机构
[1] Wuhan University of Science and Technology,College of Computer Science and Technology
[2] Zhejiang University,School of Computer Science and Technology
[3] Hangzhou Normal University,School of Elementary Education
[4] Hangzhou Normal University,School of Information Engineering
来源
Journal of Zhejiang University SCIENCE A | 2008年 / 9卷
关键词
Heterogeneity; Cross-media retrieval; Subspace optimization; Dynamic correlation update; A; TP37; TP391;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-media retrieval is an interesting research topic, which seeks to remove the barriers among different modalities. To enable cross-media retrieval, it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity. This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval. A semi-supervised correlation preserving mapping (SSCPM) method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved. Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way. A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors, so retrieval performance is enhanced in a progressive manner. Experimental results show that the performance of our approach is effective.
引用
收藏
页码:241 / 249
页数:8
相关论文
共 50 条
  • [21] Multiple kernel visual-auditory representation learning for retrieval
    Hong Zhang
    Wenping Zhang
    Wenhe Liu
    Xin Xu
    Hehe Fan
    Multimedia Tools and Applications, 2016, 75 : 9169 - 9184
  • [22] A Novel Cross-Modal Topic Correlation Model for Cross-Media Retrieval
    Cheng, Yong
    Huang, Fei
    Jin, Cheng
    Zhang, Yuejie
    Zhang, Tao
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 399 - 407
  • [23] Multiple kernel visual-auditory representation learning for retrieval
    Zhang, Hong
    Zhang, Wenping
    Liu, Wenhe
    Xu, Xin
    Fan, Hehe
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 9169 - 9184
  • [24] Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval
    Yang, Yi
    Zhuang, Yue-Ting
    Wu, Fei
    Pan, Yun-He
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (03) : 437 - 446
  • [25] Cross-media retrieval based on semi-supervised regularization and correlation learning
    Hong Zhang
    Gang Dai
    Du Tang
    Xin Xu
    Multimedia Tools and Applications, 2018, 77 : 22455 - 22473
  • [26] Cross-media retrieval based on semi-supervised regularization and correlation learning
    Zhang, Hong
    Dai, Gang
    Tang, Du
    Xu, Xin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22455 - 22473
  • [27] Cross-media retrieval by exploiting fine-grained correlation at entity level
    Huang, Lei
    Peng, Yuxin
    NEUROCOMPUTING, 2017, 236 : 123 - 133
  • [28] Efficient Manifold Ranking for Cross-media retrieval
    Ma, ShaoQin
    Zhang, Hong
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 335 - 340
  • [29] Cross-media Relevance Computation for Multimedia Retrieval
    Dong, Jianfeng
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 831 - 835
  • [30] Learning semantic correlations for cross-media retrieval
    Wu, Fei
    Zhang, Hong
    Zhuang, Yueting
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1465 - +