Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval

被引:0
|
作者
Hong Zhang
Yan-yun Wang
Hong Pan
Fei Wu
机构
[1] Wuhan University of Science and Technology,College of Computer Science and Technology
[2] Zhejiang University,School of Computer Science and Technology
[3] Hangzhou Normal University,School of Elementary Education
[4] Hangzhou Normal University,School of Information Engineering
来源
Journal of Zhejiang University SCIENCE A | 2008年 / 9卷
关键词
Heterogeneity; Cross-media retrieval; Subspace optimization; Dynamic correlation update; A; TP37; TP391;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-media retrieval is an interesting research topic, which seeks to remove the barriers among different modalities. To enable cross-media retrieval, it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity. This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval. A semi-supervised correlation preserving mapping (SSCPM) method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved. Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way. A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors, so retrieval performance is enhanced in a progressive manner. Experimental results show that the performance of our approach is effective.
引用
收藏
页码:241 / 249
页数:8
相关论文
共 50 条
  • [31] Cross-media retrieval based on CSRN clustering
    Zeng, Cheng
    Wang, Zhenzhen
    Du, Gang
    Journal of Computational Information Systems, 2010, 6 (09): : 2821 - 2830
  • [32] Cross-media retrieval: Concepts, advances and challenges
    Zhuang, Yueting
    Wu, Fei
    Zhang, Hong
    Yang, Yi
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 847 - 850
  • [33] Cross-media retrieval: Concepts, advances and challenges
    Zhuang, Yueting
    Wu, Fei
    Zhang, Hong
    Yang, Yi
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 377 - 380
  • [34] Modality-Dependent Cross-Media Retrieval
    Wei, Yunchao
    Zhao, Yao
    Zhu, Zhenfeng
    Wei, Shikui
    Xiao, Yanhui
    Feng, Jiashi
    Yan, Shuicheng
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 7 (04)
  • [35] CROSS-MEDIA RETRIEVAL WITH SEMANTICS CLUSTERING AND ENHANCEMENT
    Zhan, Minfeng
    Li, Liang
    Huang, Qingming
    Liu, Yugui
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1398 - 1403
  • [36] Towards Private and Scalable Cross-Media Retrieval
    Hu, Shengshan
    Zhang, Leo Yu
    Wang, Qian
    Qin, Zhan
    Wang, Cong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (03) : 1354 - 1368
  • [37] Image Retrieval by Cross-Media Relevance Fusion
    Dong, Jianfeng
    Li, Xirong
    Liao, Shuai
    Xu, Jieping
    Xu, Duanqing
    Du, Xiaoyong
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 173 - 176
  • [38] Present Development and Prospect of Cross-Media Retrieval
    Yin Zhenzhen
    Wang Feng
    Li Bin
    Zhang Lianjie
    2012 THIRD INTERNATIONAL CONFERENCE ON THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE (ICTMF 2012), 2013, 38 : 889 - 894
  • [39] Tri-space and Ranking Based Heterogeneous Similarity Measure for Cross-Media Retrieval
    Ling, Li
    Zhai, Xiaohua
    Peng, Yuxin
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 230 - 233
  • [40] Discovering Semantic Vocabularies for Cross-Media Retrieval
    Habibian, Amirhossein
    Mensink, Thomas
    Snoek, Cees G. M.
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 131 - 138