Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003

被引：0

作者：

Senechal, B ^{[1
]}

Pellerin, D ^{[1
]}

Besacier, L ^{[1
]}

Simand, I ^{[1
]}

Brès, S ^{[1
]}

机构：

[1] LIS, F-38031 Grenoble, France

来源：

2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2 | 2005年

关键词：

D O I：

10.1109/ICME.2005.1521400

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure. Then we describe a spatio-temporal video signature, based on the evolution of gray level centroids over time. The major contribution of this work is the association of these two signatures in a so-called audiovisual signature by late integration of similarity measures obtained on both modalities. Our experiments conducted on a large video database (28Gb / 34h extracted from TRECVID2003) show that our audio-visual signature is more robust than the audio-only or video-only signatures, and also permits better detection of video clips of shorter duration (about 2 seconds).

引用

页码：221 / 224

页数：4

共 50 条

[21] Audio-Visual Emotion Recognition in Video Clips
Noroozi, Fatemeh
Marjanovic, Marina
Njegus, Angelina
Escalera, Sergio
Anbarjafari, Gholamreza
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (01) : 60 - 75
[22] An audio-visual approach to web video categorization
Bogdan Emanuel Ionescu
Klaus Seyerlehner
Ionuţ Mironică
Constantin Vertan
Patrick Lambert
Multimedia Tools and Applications, 2014, 70 : 1007 - 1032
[23] Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio
Chao, Fang-Yi
Ozcinar, Cagri
Zhang, Lu
Hamidouche, Wassim
Deforges, Olivier
Smolic, Aljosa
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 355 - 358
[24] Perceptual Quality of Audio-Visual Content with Common Video and Audio Degradations
Becerra Martinez, Helard
Hines, Andrew
Farias, Mylene C. Q.
APPLIED SCIENCES-BASEL, 2021, 11 (13):
[25] Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Feng, Chao
Chen, Ziyang
Owens, Andrew
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10491 - 10503
[26] Identification of story units in audio-visual sequences by joint audio and video processing
Saraceno, C
Leonardi, R
1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 363 - 367
[27] Combining text and audio-visual features in video indexing
Chang, SF
Manmatha, R
Chua, TS
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1005 - 1008
[28] Audio-visual interactive services and video on demand (VOD)
CSELT
CSELT Tech Rep, 2 (195-209):
[29] Toward Long Form Audio-Visual Video Understanding
Hou, Wenxuan
Li, Guangyao
Tian, Yapeng
Hu, Di
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
[30] A NO-REFERENCE AUDIO-VISUAL VIDEO QUALITY METRIC
Martinez, Helard Becerra
Farias, Mylene C. Q.
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2125 - 2129

← 1 2 3 4 5 →