Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval

被引:0
|
作者
Hong Zhang
Yan-yun Wang
Hong Pan
Fei Wu
机构
[1] Wuhan University of Science and Technology,College of Computer Science and Technology
[2] Zhejiang University,School of Computer Science and Technology
[3] Hangzhou Normal University,School of Elementary Education
[4] Hangzhou Normal University,School of Information Engineering
关键词
Heterogeneity; Cross-media retrieval; Subspace optimization; Dynamic correlation update; A; TP37; TP391;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-media retrieval is an interesting research topic, which seeks to remove the barriers among different modalities. To enable cross-media retrieval, it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity. This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval. A semi-supervised correlation preserving mapping (SSCPM) method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved. Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way. A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors, so retrieval performance is enhanced in a progressive manner. Experimental results show that the performance of our approach is effective.
引用
收藏
页码:241 / 249
页数:8
相关论文
共 50 条
  • [41] FROM IVORY TOWER TO CROSS-MEDIA PERSONAS The heterogeneous cultural critic in the media
    Kristensen, Nete Norgaard
    From, Unni
    JOURNALISM PRACTICE, 2015, 9 (06) : 853 - 871
  • [42] Content-oriented multimedia document understanding through cross-media correlation
    Lu, Tong
    Jin, Yukang
    Su, Feng
    Shivakumara, Palaiahnakote
    Tan, Chew Lim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (18) : 8105 - 8135
  • [43] Content-oriented multimedia document understanding through cross-media correlation
    Tong Lu
    Yukang Jin
    Feng Su
    Palaiahnakote Shivakumara
    Chew Lim Tan
    Multimedia Tools and Applications, 2015, 74 : 8105 - 8135
  • [44] Cross-media Retrieval based on Levenberg-Marquardt Deep Canonical Correlation Analysis
    Wang, Jinzhou
    Zhang, Hong
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 2000 - 2004
  • [45] Cross-Media Retrieval via Deep Semantic Canonical Correlation Analysis and Logistic Regression
    Zhang, Hong
    Xia, Liangmeng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 123 - 133
  • [46] Homogeneous Description for Heterogeneous Cross-Media Resources
    Xu, Dong
    Hu, Ping
    Li, Hua
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 390 - 395
  • [47] Cross-Media Retrieval of Tourism Big Data Based on Deep Features and Topic Semantics
    Li, Yang
    Du, Junping
    Lin, Zijian
    Ye, Lingfei
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2017, 2017, 10585 : 94 - 102
  • [48] Learning Privately: Privacy-Preserving Canonical Correlation Analysis for Cross-Media Retrieval
    Wang, Qian
    Hu, Shengshan
    Du, Minxin
    Wang, Jingjun
    Ren, Kui
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [49] Discovering aspect-based correlation of web contents for cross-media information retrieval
    Zettsu, K
    Kidawara, Y
    Tanaka, K
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1015 - 1018
  • [50] Toward cross-language and cross-media image retrieval
    Alvarez, C
    Oumohmed, AI
    Mignotte, M
    Nie, JY
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 676 - 687