Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003

被引:0
|
作者
Senechal, B [1 ]
Pellerin, D [1 ]
Besacier, L [1 ]
Simand, I [1 ]
Brès, S [1 ]
机构
[1] LIS, F-38031 Grenoble, France
关键词
D O I
10.1109/ICME.2005.1521400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure. Then we describe a spatio-temporal video signature, based on the evolution of gray level centroids over time. The major contribution of this work is the association of these two signatures in a so-called audiovisual signature by late integration of similarity measures obtained on both modalities. Our experiments conducted on a large video database (28Gb / 34h extracted from TRECVID2003) show that our audio-visual signature is more robust than the audio-only or video-only signatures, and also permits better detection of video clips of shorter duration (about 2 seconds).
引用
收藏
页码:221 / 224
页数:4
相关论文
共 50 条
  • [1] Video concept detection by audio-visual grouplets
    Wei Jiang
    Alexander C. Loui
    [J]. International Journal of Multimedia Information Retrieval, 2012, 1 (4) : 223 - 238
  • [2] Video concept detection by audio-visual grouplets
    Jiang, Wei
    Loui, Alexander C.
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (04) : 223 - 238
  • [3] news video story segmentation silence clip shot detection audio-visual fusion
    Song, Yu
    Wang, Wenhong
    Guo, Fengjuan
    [J]. ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1065 - +
  • [4] Video clip recognition using joint audio-visual processing model
    Kulesh, V
    Petrushin, VA
    Sethi, IK
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 500 - 503
  • [5] Video clip recognition using joint audio-visual processing model
    Kulesh, Victor
    Petrushin, Valery A.
    Sethi, Ishwar K.
    [J]. Proceedings - International Conference on Pattern Recognition, 2002, 16 (01): : 500 - 503
  • [6] Audio-visual synchrony for detection of monologues in video archives
    Iyengar, G
    Nock, HJ
    Neti, C
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 772 - 775
  • [7] Audio-visual synchrony for detection of monologues in video archives
    Iyengar, G
    Nock, HJ
    Neti, C
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 329 - 332
  • [8] Audio-visual quality and interactions between television audio and video
    Joly, A
    Montard, N
    Buttin, M
    [J]. ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 438 - 441
  • [9] Combining audio and video metrics to assess audio-visual quality
    Becerra Martinez, Helard A.
    Farias, Mylene C. Q.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23993 - 24012
  • [10] Combining audio and video metrics to assess audio-visual quality
    Helard A. Becerra Martinez
    Mylène C. Q. Farias
    [J]. Multimedia Tools and Applications, 2018, 77 : 23993 - 24012