Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003

被引：0

作者：

Senechal, B ^{[1
]}

Pellerin, D ^{[1
]}

Besacier, L ^{[1
]}

Simand, I ^{[1
]}

Brès, S ^{[1
]}

机构：

[1] LIS, F-38031 Grenoble, France

来源：

2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2 | 2005年

关键词：

D O I：

10.1109/ICME.2005.1521400

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure. Then we describe a spatio-temporal video signature, based on the evolution of gray level centroids over time. The major contribution of this work is the association of these two signatures in a so-called audiovisual signature by late integration of similarity measures obtained on both modalities. Our experiments conducted on a large video database (28Gb / 34h extracted from TRECVID2003) show that our audio-visual signature is more robust than the audio-only or video-only signatures, and also permits better detection of video clips of shorter duration (about 2 seconds).

引用

页码：221 / 224

页数：4

共 50 条

[41] Audio and Video Channel Impact on Perceived Audio-visual Quality in Different Interactive Contexts
Belmudez, Benjamin
Moeller, Sebastian
Lewcio, Blazej
Raake, Alexander
Mehmood, Amir
2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 256 - 260
[42] Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions
Stewart, Darryl
Seymour, Rowan
Pass, Adrian
Ming, Ji
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (02) : 175 - 184
[43] Audio-Visual Art Performance System Using Computer Video Output Based on Converting Component Video Signal to Audio
Ito, Yuichi
Stone, Carl
Yamada, Masashi
Miyazaki, Shinya
2013 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2013, : 356 - 363
[44] Creating motion video summaries with partial audio-visual alignment
Gong, YH
Liu, X
Hua, W
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 285 - 288
[45] Toward a perceptive pretraining framework for Audio-Visual Video Parsing
Wu, Jianning
Jiang, Zhuqing
Chen, Qingchao
Wen, Shiping
Men, Aidong
Wang, Haiying
INFORMATION SCIENCES, 2022, 609 : 897 - 912
[46] Audio-Visual Autoencoding for Privacy-Preserving Video Streaming
Xu, Honghui
Cai, Zhipeng
Takabi, Daniel
Li, Wei
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (03): : 1749 - 1761
[47] Full-reference audio-visual video quality metric
Martinez, Helard Becerra
Fariasa, Mylene C. Q.
JOURNAL OF ELECTRONIC IMAGING, 2014, 23 (06)
[48] Video diaries: audio-visual research methods and the elusive body
Bates, Charlotte
VISUAL STUDIES, 2013, 28 (01) : 29 - 37
[49] Efficient video coding based on audio-visual focus of attention
Lee, Jong-Seok
De Simone, Francesca
Ebrahimi, Touradj
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2011, 22 (08) : 704 - 711
[50] Video genre categorization and representation using audio-visual information
Ionescu, Bogdan
Seyerlehner, Klaus
Rasche, Christoph
Vertan, Constantin
Lambert, Patrick
JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (02)

← 1 2 3 4 5 →