Automatic summarization of soccer highlights using audio-visual descriptors

被引:23
|
作者
Raventos, A. [1 ]
Quijada, R. [1 ]
Torres, Luis [2 ]
Tarres, Francesc [1 ]
机构
[1] UPC BARCELONATECH, Signal Theory & Commun Dept, Castelldefels 08860, Spain
[2] UPC BARCELONATECH, Signal Theory & Commun Dept, Barcelona 08034, Spain
来源
SPRINGERPLUS | 2015年 / 4卷
关键词
Video summarization; Content analysis; Audiovisual descriptors; Multimedia feature extraction; Semantic detection; Multimodal processing and fusion; SHOT-BOUNDARY DETECTION; OF-THE-ART; RETRIEVAL;
D O I
10.1186/s40064-015-1065-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Automatic summarization generation of sports video content has been object of great interest for many years. Although semantic descriptions techniques have been proposed, many of the approaches still rely on low-level video descriptors that render quite limited results due to the complexity of the problem and to the low capability of the descriptors to represent semantic content. In this paper, a new approach for automatic highlights summarization generation of soccer videos using audio-visual descriptors is presented. The approach is based on the segmentation of the video sequence into shots that will be further analyzed to determine its relevance and interest. Of special interest in the approach is the use of the audio information that provides additional robustness to the overall performance of the summarization system. For every video shot a set of low and mid level audio-visual descriptors are computed and lately adequately combined in order to obtain different relevance measures based on empirical knowledge rules. The final summary is generated by selecting those shots with highest interest according to the specifications of the user and the results of relevance measures. A variety of results are presented with real soccer video sequences that prove the validity of the approach.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] AUTOMATIC SUMMARIZATION OF AUDIO-VISUAL SOCCER FEEDS
    Chen, Fan
    De Vleeschouwer, C.
    Duxans Barrobes, H.
    Gregorio Escalada, J.
    Conejero, D.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 837 - 842
  • [2] The Importance of Audio Descriptors in Automatic Soccer Highlights Generation
    Raventos, Arnau
    Quijada, Raul
    Torres, Luis
    Tarres, Francesc
    Carasusan, Eusebio
    Giribet, Daniel
    [J]. 2014 11TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2014,
  • [3] A audio-visual model for efficient video summarization
    El-Nagar, Gamal
    El-Sawy, Ahmed
    Rashad, Metwally
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [4] An audio-visual saliency model for movie summarization
    Rapantzikos, Konstantinos
    Evangelopoulos, Georgios
    Maragos, Petros
    Avrithis, Yannis
    [J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 320 - 323
  • [5] Automatic Piano Music Transcription Using Audio-Visual Features
    WAN Yulong
    WANG Xianliang
    ZHOU Ruohua
    YAN Yonghong
    [J]. Chinese Journal of Electronics, 2015, 24 (03) : 596 - 603
  • [6] Structure in soccer videos: Detecting and classifying highlights for automatic summarization
    Sgarbi, E
    Borges, DL
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 691 - 700
  • [7] Automatic Piano Music Transcription Using Audio-Visual Features
    Wan Yulong
    Wang Xianliang
    Zhou Ruohua
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 596 - 603
  • [8] Automatic extraction of soccer video highlights using a combination of motion and audio features
    Cabasson, R
    Divakaran, A
    [J]. STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 : 272 - 276
  • [9] PREDICTING AUDIO-VISUAL SALIENT EVENTS BASED ON VISUAL, AUDIO AND TEXT MODALITIES FOR MOVIE SUMMARIZATION
    Koutras, P.
    Zlatintsi, A.
    Iosif, E.
    Katsamanis, A.
    Maragos, P.
    Potamianos, A.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4361 - 4365
  • [10] Attention-Based Audio-Visual Fusion for Video Summarization
    Fang, Yinghong
    Zhang, Junpeng
    Lu, Cewu
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340