Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003

被引:0
|
作者
Senechal, B [1 ]
Pellerin, D [1 ]
Besacier, L [1 ]
Simand, I [1 ]
Brès, S [1 ]
机构
[1] LIS, F-38031 Grenoble, France
关键词
D O I
10.1109/ICME.2005.1521400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure. Then we describe a spatio-temporal video signature, based on the evolution of gray level centroids over time. The major contribution of this work is the association of these two signatures in a so-called audiovisual signature by late integration of similarity measures obtained on both modalities. Our experiments conducted on a large video database (28Gb / 34h extracted from TRECVID2003) show that our audio-visual signature is more robust than the audio-only or video-only signatures, and also permits better detection of video clips of shorter duration (about 2 seconds).
引用
收藏
页码:221 / 224
页数:4
相关论文
共 50 条
  • [31] VIDEO CAMERA IDENTIFICATION USING AUDIO-VISUAL FEATURES
    Milani, S.
    Cuccovillo, L.
    Tagliasacchi, M.
    Tubaro, S.
    Aichroth, P.
    2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,
  • [32] Audio-Visual Atoms for Generic Video Concept Classification
    Jiang, Wei
    Cotton, Courtenay
    Chang, Shih-Fu
    Ellis, Dan
    Loui, Alexander C.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2010, 6 (03)
  • [33] Audio-visual speaker recognition for video broadcast news
    Maison, B
    Neti, C
    Senior, A
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79
  • [34] Audio-Visual Glance Network for Efficient Video Recognition
    Nugroho, Muhammad Adi
    Woo, Sangmin
    Lee, Sumin
    Kim, Changick
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10116 - 10125
  • [35] AVscript: Accessible Video Editing with Audio-Visual Scripts
    Huh, Mina
    Yang, Saelyne
    Peng, Yi-Hao
    Chen, Xiang 'Anthony'
    Kim, Young-Ho
    Pavel, Amy
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023), 2023,
  • [36] Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video
    Bolles, Robert
    Burns, J. Brian
    Graciarena, Martin
    Kathol, Andreas
    Lawson, Aaron
    McLaren, Mitchell
    Mensink, Thomas
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1907 - 1914
  • [37] Audio-Visual Speaker Recognition for Video Broadcast News
    Benoît Maison
    Chalapathy Neti
    Andrew Senior
    Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 71 - 79
  • [38] Audio-visual event recognition in surveillance video sequences
    Cristani, Marco
    Bicego, Manuele
    Murino, Vittorio
    IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (02) : 257 - 267
  • [39] Detection of Violent Video with Audio-Visual Features based on MPEG-7
    Peng, Yuqing
    Liu, Wei
    Zhao, Cuicui
    Li, Tiejun
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1002 - +
  • [40] Audio and Video Signatures for Synchronization
    Radhakrishnan, Regunathan
    Terry, Kent
    Bauer, Claus
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1549 - 1552