The Importance of Audio Descriptors in Automatic Soccer Highlights Generation

被引:0
|
作者
Raventos, Arnau [1 ]
Quijada, Raul [1 ]
Torres, Luis [1 ]
Tarres, Francesc [1 ]
Carasusan, Eusebio [2 ]
Giribet, Daniel [2 ]
机构
[1] UPC Barcelona Tech, Signal Theory & Commun Dept, Castelldefels, Barcelona, Spain
[2] Televisio Catalunya, Esplugas de Llobregat, Barcelona, Spain
关键词
video highlights; content analysis; audio descriptors; whistle detector; semantic detection; multi modal processing and fusion; FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic generation of sports highlights from recorded audiovisual content has been object of great interest in recent years. The problem is indeed important in the production of second and third division leagues highlights videos where the quantity of raw material is significant and does not contain manual annotations. Many approaches are mostly based on the analysis of the video and disregard the important information provided by the audio track. In this paper, a new approach that combines audio and video descriptors for automatic soccer highlights generation is proposed. The approach is based on the segmentation of the video contents into shots that are further analyzed in order to determine its relevance and interest. These video-shots are scored taking into account the fusion between different audio and video features. The paper is mainly focused to emphasize the importance of audio detectors that play a key role in the analysis and scoring of the video-shots. Specifically, a new algorithm for referee's whistle detection is proposed. The algorithm has been proven to be very robust and efficiently discriminates professional whistles against other types of noises such as public cheering-up, music instruments, etc. Several results have been produced using real soccer video sequences that prove the validity of the proposed audio and video fusion scheme.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Audio watermarking technologies for automatic cue sheet generation systems
    Caccia, G
    Lancini, R
    Pascarella, A
    Tubaro, S
    Vicario, E
    SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS III, 2001, 4314 : 96 - 103
  • [42] New Automatic Taxonomy Generation Algorithm for the Audio Genre Classification
    Choi, Tacksung
    Moon, Sunkook
    Park, Youngcheol
    Youn, Daehee
    Lee, Seokpil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (03): : 111 - 118
  • [43] AUTOMATIC-GENERATION OF MICROCODE FOR A DIGITAL AUDIO SIGNAL PROCESSOR
    MCCULLOCH, CM
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (12): : 1031 - 1032
  • [44] Highlights detection and recognition in soccer videos
    Assfalg, E
    Bertini, M
    Colombo, C
    Del Bimbo, A
    Nunziati, W
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 18 - 23
  • [45] Composer4Everyone: Automatic Music Generation with Audio Motif
    Liu, Aozhi
    Wang, Jianzong
    Peng, Junqing
    Wang, Yiwen
    Mei, Yaqi
    Liang, Xiaojing
    Xia, Zimin
    Xiao, Jing
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 502 - 503
  • [46] Efficient Audio Segmentation in Soccer Videos
    Raghuram, M. A.
    Chavan, Nikhil R.
    Koolagudi, Shashidhar G.
    Ramteke, Pravin B.
    2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,
  • [47] Audio based Soccer Game Summarization
    Duxans, Helenca
    Anguera, Xavier
    Conejero, David
    BMSB: 2009 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, VOLS 1 AND 2, 2009, : 283 - 288
  • [48] psDirector: An Automatic Director for Watching View Generation from Panoramic Soccer Video
    Li, Chunyang
    Jia, Caiyan
    Chen, Zhineng
    Gu, Xiaoyan
    Bao, Hongyun
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 218 - 230
  • [49] Watch and Act: Dual Interacting Agents for Automatic Generation of Possession Statistics in Soccer
    Sarkar, Saikat
    Mukherjee, Dipti Prasad
    Chakrabarti, Amlan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3559 - 3567
  • [50] A semi-automatic system for ground truth generation of soccer video sequences
    D'Orazio, T.
    Leo, M.
    Mosca, N.
    Spagnolo, P.
    Mazzeo, P. L.
    AVSS: 2009 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2009, : 559 - +