Identification of story units in audio-visual sequences by joint audio and video processing

被引：0

作者：

Saraceno, C ^{[1
]}

Leonardi, R ^{[1
]}

机构：

[1] Univ Brescia, SCL Dept Elect Automat, I-25123 Brescia, Italy

来源：

1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1 | 1998年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a novel technique, which uses a joint audio-visual analysis for scene identification and characterization, is proposed. The paper defines four different scene types: dialogues, stories, actions, and generic scenes. It then explains how any audio-visual material can be decomposed into a series of scenes obeying to the preview classification, by properly analyzing and then combining the underlying audio and visual information. A rule-based procedure is defined for such purpose. Before such rule-based decision can take place, a series of low-level pre-processing tasks care suggested to adequately measure audio and visual correlations. As far as visual information is concerned, it is proposed to measure similarities between non consecutive shots using a Learning Vector Quantization approach. An outlook on a possible implementation strategy for the overall scene identification task is suggested, and validated through a series of experimental simulations on real audio-visual data.

引用

页码：363 / 367

页数：5

共 50 条

[21] Integrating Audio-Visual Features and Text Information for Story Segmentation of News Video
Liu Hua-yong
[J]. Wuhan University Journal of Natural Sciences, 2003, (04) : 1070 - 1074
[22] Joint Audio-Visual Processing, Representation and Indexing of TV News Programmes
Zdansky, Jindrich
Chaloupka, Josef
Nouza, Jan
[J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 964 - 969
[23] Audio-visual speaker identification based on the use of dynamic audio and visual features
Fox, N
Reilly, RB
[J]. AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 743 - 751
[24] Advertising video as a kind of audio-visual production
Zarya, Svitlana
[J]. NATIONAL ACADEMY OF MANAGERIAL STAFF OF CULTURE AND ARTS HERALD, 2016, (02): : 94 - 98
[25] An audio-visual approach to web video categorization
Ionescu, Bogdan Emanuel
Seyerlehner, Klaus
Mironica, Ionut
Vertan, Constantin
Lambert, Patrick
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (02) : 1007 - 1032
[26] Audio-visual Privacy Protection for Video Conference
Venkatesh, M. Vijay
Zhao, Jian
Profitt, Larry
Cheung, Sen-ching S.
[J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1574 - 1575
[27] Video concept detection by audio-visual grouplets
Wei Jiang
Alexander C. Loui
[J]. International Journal of Multimedia Information Retrieval, 2012, 1 (4) : 223 - 238
[28] VIDEO CODING BASED ON AUDIO-VISUAL ATTENTION
Lee, Jong-Seok
De Simone, Francesca
Ebrahimi, Touradj
[J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 57 - 60
[29] Audio-Visual Emotion Recognition in Video Clips
Noroozi, Fatemeh
Marjanovic, Marina
Njegus, Angelina
Escalera, Sergio
Anbarjafari, Gholamreza
[J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (01) : 60 - 75
[30] A audio-visual model for efficient video summarization
El-Nagar, Gamal
El-Sawy, Ahmed
Rashad, Metwally
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100

← 1 2 3 4 5 →