Nonnegative cross-media recoding of visual-auditory content for social media analysis

被引:0
|
作者
Hong Zhang
Xin Xu
机构
[1] Wuhan University of Science & Technology,College of Computer Science & Technology
[2] Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System,State Key Laboratory of Software Engineering
[3] Wuhan University,undefined
来源
关键词
Cross-media; Subspace learning; Distance metric; Data clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-media semantics understanding, which focuses on multimedia data of different modalities, is a rising hot topic in social media analysis. One of the most challenging issues for cross-media semantics understanding is how to represent multimedia data of different modalities. Most traditional multimedia semantics analysis works are based on single modality data sources, such as Flickr images or YouTube videos, leaving efficient cross-media data representation wide open. In this paper, we propose a novel nonnegative cross-media recoding approach, which learns co-occurrences of cross-media feature spaces by explicitly learning a common subset of basis vectors. Moreover, we impose the nonnegativity constraint on the decomposed matrices so that the basis vectors represent important and locally meaningful features of the cross-media data. We take two kinds of typical multimedia data, that is, image and audio, as experimental data. Our approach can be applied to a wide range of multimedia applications. The experiments are conducted on image-audio dataset for applications of cross-media retrieval and data clustering. Experiment results are encouraging and show that the performance of our approach is effective.
引用
收藏
页码:577 / 593
页数:16
相关论文
共 50 条
  • [1] Nonnegative cross-media recoding of visual-auditory content for social media analysis
    Zhang, Hong
    Xu, Xin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 577 - 593
  • [2] Boosting Cross-media Retrieval via Visual-Auditory Feature Analysis and Relevance Feedback
    Zhang, Hong
    Yuan, Junsong
    Gao, Xingyu
    Chen, Zhenyu
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 953 - 956
  • [4] Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval
    Zhang, Hong
    Wang, Yan-yun
    Pan, Hong
    Wu, Fei
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2008, 9 (02): : 241 - 249
  • [5] Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval
    Hong Zhang
    Yan-yun Wang
    Hong Pan
    Fei Wu
    [J]. Journal of Zhejiang University SCIENCE A, 2008, 9 : 241 - 249
  • [6] Bridging the gap between visual and auditory feature spaces for cross-media retrieval
    Hong Zhang
    Fei Wu
    [J]. ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 596 - 605
  • [7] An analysis of cross-media transfers
    Phillips, JB
    Hindawi, MA
    Phillips, A
    Bailey, RV
    [J]. POLLUTION ENGINEERING, 1997, 29 (06) : 41 - +
  • [8] Knowledge-assisted cross-media analysis of audio-visual content in the news domain
    Mezaris, Vasileios
    Gidaros, Spyros
    Papadopoulos, Georgios Th.
    Kasper, Walter
    Ordelman, Roeland
    de Jong, Franciska
    Kompatsiaris, Ioannis
    [J]. 2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 264 - +
  • [9] Cross-media publishing: Obey the content master
    Rosenblatt, B
    [J]. ECONTENT, 2001, 24 (05) : 44 - 47
  • [10] Study of Cross-Media Topic Analysis Based on Visual Topic Model
    Zhou, Yipeng
    Liang, Meiyu
    Du, Junping
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3467 - 3470