Semantic indexing of multimedia content using visual, audio, and text cues

被引:0
|
作者
机构
[1] Adams, W.H.
[2] Iyengar, Giridharan
[3] Lin, Ching-Yung
[4] Naphade, Milind Ramesh
[5] Neti, Chalapathy
[6] Nock, Harriet J.
[7] Smith, John R.
来源
Adams, W.H. (whadams@us.ibm.com) | 1600年 / Hindawi Publishing Corporation卷 / 2003期
关键词
Information analysis - Learning systems - Markov processes - Semantics - Statistical methods;
D O I
暂无
中图分类号
学科分类号
摘要
We present a learning-based approach to the semantic indexing of multimedia content using cues derived from audio, visual, and text features. We approach the problem by developing a set of statistical models for a predefined lexicon. Novel concepts are then mapped in terms of the concepts in the lexicon. To achieve robust detection of concepts, we exploit features from multiple modalities, namely, audio, video, and text, Concept representations are modeled using Gaussian mixture models (GMM), hidden Markov models (HMM), and support vector machines (SVM), Models such as Bayesian networks and SVMs are used in a late-fusion approach to model concepts that are not explicitly modeled in terms of features. Our experiments indicate promise in the proposed classification and fusion methodologies: our proposed fusion scheme achieves more than 10% relative improvement over the best unimodal concept detector.
引用
下载
收藏
相关论文
共 50 条
  • [21] Semantic indexing of multimedia documents
    Leonardi, R
    Migliorati, P
    IEEE MULTIMEDIA, 2002, 9 (02) : 44 - 51
  • [22] A semantic indexing approach of multimedia documents content based partial transcription
    Bendib, Issam
    Laouar, Mohammed Ridda
    2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING (ICNLSP), 2018, : 136 - 141
  • [23] Audio-visual content analysis for content-based video indexing
    Tsekeridou, S
    Pitas, I
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 667 - 672
  • [24] Audio-visual content analysis for content-based video indexing
    Tsekeridou, Sofia
    Pitas, Ioannis
    International Conference on Multimedia Computing and Systems -Proceedings, 1999, 1 : 667 - 672
  • [25] Toward semantic indexing and retrieval using hierarchical audio models
    Wei-Ta Chu
    Wen-Huang Cheng
    Jane Yung-Jen Hsu
    Ja-Ling Wu
    Multimedia Systems, 2005, 10 : 570 - 583
  • [26] Toward semantic indexing and retrieval using hierarchical audio models
    Chu, WT
    Cheng, WH
    Hsu, JYJ
    Wu, JL
    MULTIMEDIA SYSTEMS, 2005, 10 (06) : 570 - 583
  • [27] Indexing audio documents by using latent semantic analysis and SOM
    Kurimo, M
    KOHONEN MAPS, 1999, : 363 - 374
  • [28] The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing
    Snoek, Cees G. M.
    Worring, Marcel
    Geusebroek, Jan-Mark
    Koelma, Dennis C.
    Seinstra, Frank J.
    Smeulders, Arnold W. M.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (10) : 1678 - 1689
  • [29] Conceptual feedback for semantic multimedia indexing
    Hamadi, Abdelkader
    Mulhem, Philippe
    Quenot, Georges
    2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013), 2013, : 53 - 58
  • [30] Detection and classification of vehicles using audio visual cues
    Prasad, S. Anuja
    Mary, Leena
    Koshy, Bino I.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 44087 - 44106