Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval

被引:0
|
作者
Hong ZHANG1
机构
基金
中国国家自然科学基金;
关键词
Heterogeneity; Cross-media retrieval; Subspace optimization; Dynamic correlation update;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Cross-media retrieval is an interesting research topic,which seeks to remove the barriers among different modalities.To enable cross-media retrieval,it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity.This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval.A semi-supervised correlation preserving mapping(SSCPM)method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved.Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way.A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors,so retrieval performance is enhanced in a progressive manner.Experimental results show that the performance of our approach is effective.
引用
收藏
页码:241 / 249
页数:9
相关论文
共 3 条
  • [1] Color image retrieval technique based on color features and image bitmap[J] . Tzu-Chuen Lu,Chin-Chen Chang.Information Processing and Management . 2006 (2)
  • [2] Robust and Rapid Generation of Animated Faces from Video Images: A Model-Based Modeling Approach[J] . Zhengyou Zhang,Zicheng Liu,Dennis Adler,Michael F. Cohen,Erik Hanson,Ying Shan.International Journal of Computer Vision . 2004 (2)
  • [3] Matching Words and Pictures. Kobus Barnard,Pinar Duygulu,Nando de Freitas,David Forsyth,David Blei,and Michael I. Jordan. Journal of Machine Learning Research .