Audio visual cues for video indexing and retrieval

被引:0
|
作者
Muneesawang, Paisarn [1 ]
Amin, Tahir [2 ]
Guan, Ling [2 ]
机构
[1] Dept. of Electrical and Computer Engineering, Naresuan University, Thailand
[2] Dept. of Electrical and Computer Engineering, Ryerson University, Toronto, Ont., Canada
关键词
Image retrieval - Video recording;
D O I
10.1007/978-3-540-30541-5_79
中图分类号
学科分类号
摘要
This paper studies content-based video retrieval using the combination of audio and visual features. The visual feature is extracted by an adaptive video indexing technique that places a strong emphasis on accurate characterization of spatio-temporal information within video clips. Audio feature is extracted by a statistical time-frequency analysis method that applies Laplacian mixture models to wavelet coefficients. The proposed joint audio-visual retrieval framework is highly flexible and scalable, and can be effectively applied to various types of video databases. © Springer-Verlag Berlin Heidelberg 2004.
引用
收藏
页码:642 / 649
相关论文
共 50 条
  • [1] Audio visual cues for video indexing and retrieval
    Muneesawang, P
    Amin, T
    Guan, L
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 642 - 649
  • [2] MPEG-7 audio-visual indexing test-bed for video retrieval
    Gagnon, L
    Foucher, S
    Gouaillier, V
    Brun, C
    Brousseau, J
    Boulianne, G
    Osterrath, F
    Chapdelaine, C
    Dutrisac, J
    St-Onge, F
    Champagne, B
    Lu, X
    [J]. INTERNET IMAGING V, 2004, 5304 : 319 - 329
  • [3] Semantic indexing of multimedia using audio, text and visual cues
    Iyengar, G
    Nock, H
    Neti, C
    Franz, M
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A369 - A372
  • [4] Indexing audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    [J]. VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
  • [5] An Audio Indexing and Retrieval Approach using a Video Surveillance Ontology
    Kazi Tani, Mohammed Yassine
    Ghomari, Abdelghani
    Dali Youcef, Lamia
    Lablack, Adel
    Bilasco, Ioan Marius
    [J]. 2017 COMPUTING CONFERENCE, 2017, : 258 - 261
  • [6] Video Description Generation using Audio and Visual Cues
    Jin, Qin
    Liang, Junwei
    [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 239 - 242
  • [7] Tennis video abstraction from audio and visual cues
    Coldefy, F
    Bouthemy, P
    Betser, M
    Gravier, G
    [J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 163 - 166
  • [8] Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues
    W. H. Adams
    Giridharan Iyengar
    Ching-Yung Lin
    Milind Ramesh Naphade
    Chalapathy Neti
    Harriet J. Nock
    John R. Smith
    [J]. EURASIP Journal on Advances in Signal Processing, 2003
  • [9] Semantic indexing of multimedia content using visual, audio, and text cues
    Adams, WH
    Iyengar, G
    Lin, CY
    Naphade, MR
    Neti, C
    Nock, HJ
    Smith, JR
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 170 - 185
  • [10] Semantic indexing of multimedia content using visual, audio, and text cues
    [J]. Adams, W.H. (whadams@us.ibm.com), 1600, Hindawi Publishing Corporation (2003):