Audio visual cues for video indexing and retrieval

被引：0

作者：

Muneesawang, Paisarn ^{[1
]}

Amin, Tahir ^{[2
]}

Guan, Ling ^{[2
]}

机构：

[1] Dept. of Electrical and Computer Engineering, Naresuan University, Thailand

[2] Dept. of Electrical and Computer Engineering, Ryerson University, Toronto, Ont., Canada

来源：

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2004年 / 3331卷

关键词：

Image retrieval - Video recording;

D O I：

10.1007/978-3-540-30541-5_79

中图分类号：

学科分类号：

摘要：

This paper studies content-based video retrieval using the combination of audio and visual features. The visual feature is extracted by an adaptive video indexing technique that places a strong emphasis on accurate characterization of spatio-temporal information within video clips. Audio feature is extracted by a statistical time-frequency analysis method that applies Laplacian mixture models to wavelet coefficients. The proposed joint audio-visual retrieval framework is highly flexible and scalable, and can be effectively applied to various types of video databases. © Springer-Verlag Berlin Heidelberg 2004.

引用

页码：642 / 649

共 50 条

[1] Audio visual cues for video indexing and retrieval
Muneesawang, P
Amin, T
Guan, L
[J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 642 - 649
[2] MPEG-7 audio-visual indexing test-bed for video retrieval
Gagnon, L
Foucher, S
Gouaillier, V
Brun, C
Brousseau, J
Boulianne, G
Osterrath, F
Chapdelaine, C
Dutrisac, J
St-Onge, F
Champagne, B
Lu, X
[J]. INTERNET IMAGING V, 2004, 5304 : 319 - 329
[3] Semantic indexing of multimedia using audio, text and visual cues
Iyengar, G
Nock, H
Neti, C
Franz, M
[J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A369 - A372
[4] Indexing audio-visual sequences by joint audio and video processing
Saraceno, C
Leonardi, R
[J]. VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
[5] An Audio Indexing and Retrieval Approach using a Video Surveillance Ontology
Kazi Tani, Mohammed Yassine
Ghomari, Abdelghani
Dali Youcef, Lamia
Lablack, Adel
Bilasco, Ioan Marius
[J]. 2017 COMPUTING CONFERENCE, 2017, : 258 - 261
[6] Video Description Generation using Audio and Visual Cues
Jin, Qin
Liang, Junwei
[J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 239 - 242
[7] Tennis video abstraction from audio and visual cues
Coldefy, F
Bouthemy, P
Betser, M
Gravier, G
[J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 163 - 166
[8] Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues
W. H. Adams
Giridharan Iyengar
Ching-Yung Lin
Milind Ramesh Naphade
Chalapathy Neti
Harriet J. Nock
John R. Smith
[J]. EURASIP Journal on Advances in Signal Processing, 2003
[9] Semantic indexing of multimedia content using visual, audio, and text cues
Adams, WH
Iyengar, G
Lin, CY
Naphade, MR
Neti, C
Nock, HJ
Smith, JR
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 170 - 185
[10] Semantic indexing of multimedia content using visual, audio, and text cues
[J]. Adams, W.H. (whadams@us.ibm.com), 1600, Hindawi Publishing Corporation (2003):

← 1 2 3 4 5 →