Automatic summarization of soccer highlights using audio-visual descriptors

被引:23
|
作者
Raventos, A. [1 ]
Quijada, R. [1 ]
Torres, Luis [2 ]
Tarres, Francesc [1 ]
机构
[1] UPC BARCELONATECH, Signal Theory & Commun Dept, Castelldefels 08860, Spain
[2] UPC BARCELONATECH, Signal Theory & Commun Dept, Barcelona 08034, Spain
来源
SPRINGERPLUS | 2015年 / 4卷
关键词
Video summarization; Content analysis; Audiovisual descriptors; Multimedia feature extraction; Semantic detection; Multimodal processing and fusion; SHOT-BOUNDARY DETECTION; OF-THE-ART; RETRIEVAL;
D O I
10.1186/s40064-015-1065-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Automatic summarization generation of sports video content has been object of great interest for many years. Although semantic descriptions techniques have been proposed, many of the approaches still rely on low-level video descriptors that render quite limited results due to the complexity of the problem and to the low capability of the descriptors to represent semantic content. In this paper, a new approach for automatic highlights summarization generation of soccer videos using audio-visual descriptors is presented. The approach is based on the segmentation of the video sequence into shots that will be further analyzed to determine its relevance and interest. Of special interest in the approach is the use of the audio information that provides additional robustness to the overall performance of the summarization system. For every video shot a set of low and mid level audio-visual descriptors are computed and lately adequately combined in order to obtain different relevance measures based on empirical knowledge rules. The final summary is generated by selecting those shots with highest interest according to the specifications of the user and the results of relevance measures. A variety of results are presented with real soccer video sequences that prove the validity of the approach.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Audio-Visual Objects
    Kubovy M.
    Schutz M.
    [J]. Review of Philosophy and Psychology, 2010, 1 (1) : 41 - 61
  • [42] Audio-Visual Segmentation
    Zhou, Jinxing
    Wang, Jianyuan
    Zhang, Jiayi
    Sun, Weixuan
    Zhang, Jing
    Birchfield, Stan
    Guo, Dan
    Kong, Lingpeng
    Wang, Meng
    Zhong, Yiran
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 386 - 403
  • [43] AUDIO-VISUAL CLINICS
    GRABER, TM
    HANNETT, HA
    [J]. AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 1963, 49 (07) : 538 - &
  • [44] AUDIO-VISUAL DEVELOPMENTS
    Schwartz, Mortimer
    [J]. JOURNAL OF LEGAL EDUCATION, 1952, 5 (01) : 88 - 95
  • [45] Audio-Visual Techniques
    Sears, William P., Jr.
    [J]. EDUCATION, 1948, 69 (02): : 132 - 132
  • [46] AUDIO-VISUAL POTPOURRI
    不详
    [J]. INDUSTRIAL PHOTOGRAPHY, 1968, 17 (07): : 30 - &
  • [47] AUDIO-VISUAL TECHNOLOGIES
    TAKESHITA, M
    FURUKAWA, M
    HAYATSU, R
    MURAKAMI, R
    SUZUKI, K
    HASHIZUME, K
    [J]. NEC RESEARCH & DEVELOPMENT, 1990, (96): : 265 - 277
  • [48] Audio-visual imposture
    Karam, Walid
    Mokbel, Chafic
    Greige, Hanna
    Chollet, Gerard
    [J]. MOBILE MULTIMEDIA/IMAGE PROCESSING FOR MILITARY AND SECURITY APPLICATIONS, 2006, 6250
  • [49] AUDIO-VISUAL UNIT
    WHARTON, BA
    [J]. PEDIATRICS, 1971, 47 (05) : 957 - &
  • [50] Audio-visual biometrics
    Aleksic, Petar S.
    Katsaggelos, Aggelos K.
    [J]. PROCEEDINGS OF THE IEEE, 2006, 94 (11) : 2025 - 2044