Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003

被引:0
|
作者
Senechal, B [1 ]
Pellerin, D [1 ]
Besacier, L [1 ]
Simand, I [1 ]
Brès, S [1 ]
机构
[1] LIS, F-38031 Grenoble, France
关键词
D O I
10.1109/ICME.2005.1521400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure. Then we describe a spatio-temporal video signature, based on the evolution of gray level centroids over time. The major contribution of this work is the association of these two signatures in a so-called audiovisual signature by late integration of similarity measures obtained on both modalities. Our experiments conducted on a large video database (28Gb / 34h extracted from TRECVID2003) show that our audio-visual signature is more robust than the audio-only or video-only signatures, and also permits better detection of video clips of shorter duration (about 2 seconds).
引用
收藏
页码:221 / 224
页数:4
相关论文
共 50 条
  • [41] Audio and Video Channel Impact on Perceived Audio-visual Quality in Different Interactive Contexts
    Belmudez, Benjamin
    Moeller, Sebastian
    Lewcio, Blazej
    Raake, Alexander
    Mehmood, Amir
    2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 256 - 260
  • [42] Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions
    Stewart, Darryl
    Seymour, Rowan
    Pass, Adrian
    Ming, Ji
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (02) : 175 - 184
  • [43] Audio-Visual Art Performance System Using Computer Video Output Based on Converting Component Video Signal to Audio
    Ito, Yuichi
    Stone, Carl
    Yamada, Masashi
    Miyazaki, Shinya
    2013 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2013, : 356 - 363
  • [44] Creating motion video summaries with partial audio-visual alignment
    Gong, YH
    Liu, X
    Hua, W
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 285 - 288
  • [45] Toward a perceptive pretraining framework for Audio-Visual Video Parsing
    Wu, Jianning
    Jiang, Zhuqing
    Chen, Qingchao
    Wen, Shiping
    Men, Aidong
    Wang, Haiying
    INFORMATION SCIENCES, 2022, 609 : 897 - 912
  • [46] Audio-Visual Autoencoding for Privacy-Preserving Video Streaming
    Xu, Honghui
    Cai, Zhipeng
    Takabi, Daniel
    Li, Wei
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (03): : 1749 - 1761
  • [47] Full-reference audio-visual video quality metric
    Martinez, Helard Becerra
    Fariasa, Mylene C. Q.
    JOURNAL OF ELECTRONIC IMAGING, 2014, 23 (06)
  • [48] Video diaries: audio-visual research methods and the elusive body
    Bates, Charlotte
    VISUAL STUDIES, 2013, 28 (01) : 29 - 37
  • [49] Efficient video coding based on audio-visual focus of attention
    Lee, Jong-Seok
    De Simone, Francesca
    Ebrahimi, Touradj
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2011, 22 (08) : 704 - 711
  • [50] Video genre categorization and representation using audio-visual information
    Ionescu, Bogdan
    Seyerlehner, Klaus
    Rasche, Christoph
    Vertan, Constantin
    Lambert, Patrick
    JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (02)