The Importance of Audio Descriptors in Automatic Soccer Highlights Generation

被引:0
|
作者
Raventos, Arnau [1 ]
Quijada, Raul [1 ]
Torres, Luis [1 ]
Tarres, Francesc [1 ]
Carasusan, Eusebio [2 ]
Giribet, Daniel [2 ]
机构
[1] UPC Barcelona Tech, Signal Theory & Commun Dept, Castelldefels, Barcelona, Spain
[2] Televisio Catalunya, Esplugas de Llobregat, Barcelona, Spain
关键词
video highlights; content analysis; audio descriptors; whistle detector; semantic detection; multi modal processing and fusion; FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic generation of sports highlights from recorded audiovisual content has been object of great interest in recent years. The problem is indeed important in the production of second and third division leagues highlights videos where the quantity of raw material is significant and does not contain manual annotations. Many approaches are mostly based on the analysis of the video and disregard the important information provided by the audio track. In this paper, a new approach that combines audio and video descriptors for automatic soccer highlights generation is proposed. The approach is based on the segmentation of the video contents into shots that are further analyzed in order to determine its relevance and interest. These video-shots are scored taking into account the fusion between different audio and video features. The paper is mainly focused to emphasize the importance of audio detectors that play a key role in the analysis and scoring of the video-shots. Specifically, a new algorithm for referee's whistle detection is proposed. The algorithm has been proven to be very robust and efficiently discriminates professional whistles against other types of noises such as public cheering-up, music instruments, etc. Several results have been produced using real soccer video sequences that prove the validity of the proposed audio and video fusion scheme.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Automatic Audio Description Generation System for Olympics" b
    Yamada I.
    Kumano T.
    Sato S.
    Miyazaki T.
    Imai A.
    Seiyama N.
    1600, Inst. of Image Information and Television Engineers (71): : 55 - 56
  • [32] Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework
    Xiong, ZY
    Radhakrishnan, R
    Divakaran, A
    Huang, TS
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 401 - 404
  • [33] Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework
    Xiong, ZY
    Radhakrishnan, R
    Divakaran, A
    Huang, TS
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 632 - 635
  • [34] Towards Automatic Code Generation for Robotic Soccer Behavior Simulation
    Sales, Raoni
    Mascarenhas, Ana Patricia Fontes Magalhaes
    Simoes, Marco A. C.
    Rodrigues de Souza, Josemar
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)
  • [35] Towards Automatic Code Generation for Robotic Soccer Behavior Simulation
    Raoni Sales
    Ana Patrícia Fontes Magalhães Mascarenhas
    Marco A. C. Simões
    Josemar Rodrigues de Souza
    Journal of Intelligent & Robotic Systems, 2024, 110
  • [36] Automatic Excitement-Level Detection for Sports Highlights Generation
    Boril, Hynek
    Sangwan, Abhijeet
    Hasan, Taufiq
    Hansen, John H. L.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2202 - 2205
  • [37] An Efficient Framework for Automatic Highlights Generation from Sports Videos
    Javed, Ali
    Bajwa, Khalid Bashir
    Malik, Hafiz
    Irtaza, Aun
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (07) : 954 - 958
  • [38] Generation of sports highlights using a combination of supervised & unsupervised learning in audio domain
    Radhakrishan, R
    Xiong, ZY
    Divakaran, A
    Ishikawa, Y
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 935 - 939
  • [39] Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery
    Huang, Haikun
    Solah, Michael
    Li, Dingzeyu
    Yu, Lap-Fai
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [40] Automatic Ontology Generation for Musical Instruments Based on Audio Analysis
    Kolozali, Sefki
    Barthet, Mathieu
    Fazekas, Gyoergy
    Sandler, Mark
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 1 - 14