Automatic summarization of soccer highlights using audio-visual descriptors

被引：23

作者：

Raventos, A. ^{[1
]}

Quijada, R. ^{[1
]}

Torres, Luis ^{[2
]}

Tarres, Francesc ^{[1
]}

机构：

[1] UPC BARCELONATECH, Signal Theory & Commun Dept, Castelldefels 08860, Spain

[2] UPC BARCELONATECH, Signal Theory & Commun Dept, Barcelona 08034, Spain

来源：

SPRINGERPLUS | 2015年 / 4卷

关键词：

Video summarization; Content analysis; Audiovisual descriptors; Multimedia feature extraction; Semantic detection; Multimodal processing and fusion; SHOT-BOUNDARY DETECTION; OF-THE-ART; RETRIEVAL;

D O I：

10.1186/s40064-015-1065-9

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Automatic summarization generation of sports video content has been object of great interest for many years. Although semantic descriptions techniques have been proposed, many of the approaches still rely on low-level video descriptors that render quite limited results due to the complexity of the problem and to the low capability of the descriptors to represent semantic content. In this paper, a new approach for automatic highlights summarization generation of soccer videos using audio-visual descriptors is presented. The approach is based on the segmentation of the video sequence into shots that will be further analyzed to determine its relevance and interest. Of special interest in the approach is the use of the audio information that provides additional robustness to the overall performance of the summarization system. For every video shot a set of low and mid level audio-visual descriptors are computed and lately adequately combined in order to obtain different relevance measures based on empirical knowledge rules. The final summary is generated by selecting those shots with highest interest according to the specifications of the user and the results of relevance measures. A variety of results are presented with real soccer video sequences that prove the validity of the approach.

引用

页数：19

共 50 条

[1] AUTOMATIC SUMMARIZATION OF AUDIO-VISUAL SOCCER FEEDS
Chen, Fan
De Vleeschouwer, C.
Duxans Barrobes, H.
Gregorio Escalada, J.
Conejero, D.
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 837 - 842
[2] The Importance of Audio Descriptors in Automatic Soccer Highlights Generation
Raventos, Arnau
Quijada, Raul
Torres, Luis
Tarres, Francesc
Carasusan, Eusebio
Giribet, Daniel
[J]. 2014 11TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2014,
[3] A audio-visual model for efficient video summarization
El-Nagar, Gamal
El-Sawy, Ahmed
Rashad, Metwally
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
[4] An audio-visual saliency model for movie summarization
Rapantzikos, Konstantinos
Evangelopoulos, Georgios
Maragos, Petros
Avrithis, Yannis
[J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 320 - 323
[5] Automatic Piano Music Transcription Using Audio-Visual Features
WAN Yulong
WANG Xianliang
ZHOU Ruohua
YAN Yonghong
[J]. Chinese Journal of Electronics, 2015, 24 (03) : 596 - 603
[6] Structure in soccer videos: Detecting and classifying highlights for automatic summarization
Sgarbi, E
Borges, DL
[J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 691 - 700
[7] Automatic Piano Music Transcription Using Audio-Visual Features
Wan Yulong
Wang Xianliang
Zhou Ruohua
Yan Yonghong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 596 - 603
[8] Automatic extraction of soccer video highlights using a combination of motion and audio features
Cabasson, R
Divakaran, A
[J]. STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 : 272 - 276
[9] PREDICTING AUDIO-VISUAL SALIENT EVENTS BASED ON VISUAL, AUDIO AND TEXT MODALITIES FOR MOVIE SUMMARIZATION
Koutras, P.
Zlatintsi, A.
Iosif, E.
Katsamanis, A.
Maragos, P.
Potamianos, A.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4361 - 4365
[10] Attention-Based Audio-Visual Fusion for Video Summarization
Fang, Yinghong
Zhang, Junpeng
Lu, Cewu
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340

← 1 2 3 4 5 →