Semantic indexing of multimedia content using visual, audio, and text cues

被引：0

作者：

机构：

[1] Adams, W.H.

[2] Iyengar, Giridharan

[3] Lin, Ching-Yung

[4] Naphade, Milind Ramesh

[5] Neti, Chalapathy

[6] Nock, Harriet J.

[7] Smith, John R.

来源：

Adams, W.H. (whadams@us.ibm.com) | 1600年 / Hindawi Publishing Corporation卷 / 2003期

关键词：

Information analysis - Learning systems - Markov processes - Semantics - Statistical methods;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present a learning-based approach to the semantic indexing of multimedia content using cues derived from audio, visual, and text features. We approach the problem by developing a set of statistical models for a predefined lexicon. Novel concepts are then mapped in terms of the concepts in the lexicon. To achieve robust detection of concepts, we exploit features from multiple modalities, namely, audio, video, and text, Concept representations are modeled using Gaussian mixture models (GMM), hidden Markov models (HMM), and support vector machines (SVM), Models such as Bayesian networks and SVMs are used in a late-fusion approach to model concepts that are not explicitly modeled in terms of features. Our experiments indicate promise in the proposed classification and fusion methodologies: our proposed fusion scheme achieves more than 10% relative improvement over the best unimodal concept detector.

引用

下载

共 50 条

[21] Semantic indexing of multimedia documents
Leonardi, R
Migliorati, P
IEEE MULTIMEDIA, 2002, 9 (02) : 44 - 51
[22] A semantic indexing approach of multimedia documents content based partial transcription
Bendib, Issam
Laouar, Mohammed Ridda
2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING (ICNLSP), 2018, : 136 - 141
[23] Audio-visual content analysis for content-based video indexing
Tsekeridou, S
Pitas, I
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 667 - 672
[24] Audio-visual content analysis for content-based video indexing
Tsekeridou, Sofia
Pitas, Ioannis
International Conference on Multimedia Computing and Systems -Proceedings, 1999, 1 : 667 - 672
[25] Toward semantic indexing and retrieval using hierarchical audio models
Wei-Ta Chu
Wen-Huang Cheng
Jane Yung-Jen Hsu
Ja-Ling Wu
Multimedia Systems, 2005, 10 : 570 - 583
[26] Toward semantic indexing and retrieval using hierarchical audio models
Chu, WT
Cheng, WH
Hsu, JYJ
Wu, JL
MULTIMEDIA SYSTEMS, 2005, 10 (06) : 570 - 583
[27] Indexing audio documents by using latent semantic analysis and SOM
Kurimo, M
KOHONEN MAPS, 1999, : 363 - 374
[28] The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing
Snoek, Cees G. M.
Worring, Marcel
Geusebroek, Jan-Mark
Koelma, Dennis C.
Seinstra, Frank J.
Smeulders, Arnold W. M.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (10) : 1678 - 1689
[29] Conceptual feedback for semantic multimedia indexing
Hamadi, Abdelkader
Mulhem, Philippe
Quenot, Georges
2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013), 2013, : 53 - 58
[30] Detection and classification of vehicles using audio visual cues
Prasad, S. Anuja
Mary, Leena
Koshy, Bino I.
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 44087 - 44106

← 1 2 3 4 5 →