The Importance of Audio Descriptors in Automatic Soccer Highlights Generation

被引：0

作者：

Raventos, Arnau ^{[1
]}

Quijada, Raul ^{[1
]}

Torres, Luis ^{[1
]}

Tarres, Francesc ^{[1
]}

Carasusan, Eusebio ^{[2
]}

Giribet, Daniel ^{[2
]}

机构：

[1] UPC Barcelona Tech, Signal Theory & Commun Dept, Castelldefels, Barcelona, Spain

[2] Televisio Catalunya, Esplugas de Llobregat, Barcelona, Spain

来源：

2014 11TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD) | 2014年

关键词：

video highlights; content analysis; audio descriptors; whistle detector; semantic detection; multi modal processing and fusion; FEATURES;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Automatic generation of sports highlights from recorded audiovisual content has been object of great interest in recent years. The problem is indeed important in the production of second and third division leagues highlights videos where the quantity of raw material is significant and does not contain manual annotations. Many approaches are mostly based on the analysis of the video and disregard the important information provided by the audio track. In this paper, a new approach that combines audio and video descriptors for automatic soccer highlights generation is proposed. The approach is based on the segmentation of the video contents into shots that are further analyzed in order to determine its relevance and interest. These video-shots are scored taking into account the fusion between different audio and video features. The paper is mainly focused to emphasize the importance of audio detectors that play a key role in the analysis and scoring of the video-shots. Specifically, a new algorithm for referee's whistle detection is proposed. The algorithm has been proven to be very robust and efficiently discriminates professional whistles against other types of noises such as public cheering-up, music instruments, etc. Several results have been produced using real soccer video sequences that prove the validity of the proposed audio and video fusion scheme.

引用

页数：6

共 50 条

[31] Automatic Audio Description Generation System for Olympics" b
Yamada I.
Kumano T.
Sato S.
Miyazaki T.
Imai A.
Seiyama N.
1600, Inst. of Image Information and Television Engineers (71): : 55 - 56
[32] Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework
Xiong, ZY
Radhakrishnan, R
Divakaran, A
Huang, TS
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 401 - 404
[33] Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework
Xiong, ZY
Radhakrishnan, R
Divakaran, A
Huang, TS
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 632 - 635
[34] Towards Automatic Code Generation for Robotic Soccer Behavior Simulation
Sales, Raoni
Mascarenhas, Ana Patricia Fontes Magalhaes
Simoes, Marco A. C.
Rodrigues de Souza, Josemar
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
[35] Towards Automatic Code Generation for Robotic Soccer Behavior Simulation
Raoni Sales
Ana Patrícia Fontes Magalhães Mascarenhas
Marco A. C. Simões
Josemar Rodrigues de Souza
Journal of Intelligent & Robotic Systems, 2024, 110
[36] Automatic Excitement-Level Detection for Sports Highlights Generation
Boril, Hynek
Sangwan, Abhijeet
Hasan, Taufiq
Hansen, John H. L.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2202 - 2205
[37] An Efficient Framework for Automatic Highlights Generation from Sports Videos
Javed, Ali
Bajwa, Khalid Bashir
Malik, Hafiz
Irtaza, Aun
IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (07) : 954 - 958
[38] Generation of sports highlights using a combination of supervised & unsupervised learning in audio domain
Radhakrishan, R
Xiong, ZY
Divakaran, A
Ishikawa, Y
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 935 - 939
[39] Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery
Huang, Haikun
Solah, Michael
Li, Dingzeyu
Yu, Lap-Fai
CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[40] Automatic Ontology Generation for Musical Instruments Based on Audio Analysis
Kolozali, Sefki
Barthet, Mathieu
Fazekas, Gyoergy
Sandler, Mark
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 1 - 14

← 1 2 3 4 5 →