Sports video summarization using acoustic symmetric ternary codes and SVM

被引:2
|
作者
Banjar, Ameen [1 ]
Dawood, Hussain [2 ]
Javed, Ali [3 ]
Zeb, Bushra [3 ]
机构
[1] Univ Jeddah, Dept Informat Syst & Technol, Jeddah, Saudi Arabia
[2] Natl Skills Univ Islamabad, Dept Informat Engn Technol, Islamabad, Pakistan
[3] Univ Engn & Technol, Dept Software Engn, Taxila, Pakistan
关键词
Event detection; Excitement detection; SVM; Symmetric ternary codes; Video summarization; FRAMEWORK; NETWORK; REPLAY;
D O I
10.1016/j.apacoust.2023.109795
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Broadcasters produce and transmit a vast number of sports videos in cyberspace due to immense viewership and potential commercial benefits. The analysis and processing of such a huge amount of video content are very challenging. This situation demands the development of effective and efficient summarization methods to manage the massive sports video repository while keeping the viewer's interest along with potential storage and transmission benefits. This paper presents an automated summarization framework based on excitement detection for sports videos i.e., cricket, soccer, etc. The audio stream of the sports video is analyzed to capture the significant events that are then used to produce the concise video. For effective representation of audio signals, we proposed an acoustic feature descriptor symmetric ternary codes and used them to train a binary Support Vector Machine classifier for excitement detection. Each audio frame is labeled as either an excited audio frame or a non-excited audio frame. The video frames corresponding to the excited audio frames represent the key-events in the sports videos and are marked as the key-frames. Each key-frame is appended with the neigh-boring frames to produce video skims for each key-event based on the user's required summary length. Finally, these video skims are sequentially arranged to produce the user-driven video summary. We evaluated our highlights generation method on our own diverse YouTube dataset of cricket and soccer videos, and a largescale SoccerNet corpus of soccer videos. The average accuracy of 97.7% and 91.23% on both datasets confirms the reliability of our method in terms of key-event detection for sports highlight generation.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Shot classification and replay detection for sports video summarization
    Javed, Ali
    Ali Khan, Amen
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (05) : 790 - 800
  • [12] Bridging the semantic gap in sports video retrieval and summarization
    Li, BX
    Errico, JH
    Pan, H
    Sezan, I
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2004, 15 (03) : 393 - 424
  • [13] Integrating highlights for more complete sports video summarization
    Tjondronegoro, D
    Chen, YPP
    Pham, B
    IEEE MULTIMEDIA, 2004, 11 (04) : 22 - 37
  • [14] Highlight summarization in sports video based on replay detection
    Zhao, Zhao
    Jiang, Shuqiang
    Huang, Qingming
    Zhu, Guangyu
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1613 - +
  • [15] Sports video summarization and adaptation for application in mobile communication
    Gao W.
    Huang Q.-M.
    Jiang S.-Q.
    Zhang P.
    Journal of Zhejiang University: Science, 2006, 7 (05): : 819 - 829
  • [17] Summarization of User-Generated Sports Video by Using Deep Action Recognition Features
    Tejero-de-Pablos, Antonio
    Nakashima, Yuta
    Sato, Tomokazu
    Yokoya, Naokazu
    Linna, Marko
    Rahtu, Esa
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (08) : 2000 - 2011
  • [18] A Logic Framework for Sports Video Summarization using Text-Based Semantic Annotation
    Refaey, Mohammed A.
    Abd-Almageed, Wael
    Davis, Larry S.
    THIRD INTERNATIONAL WORKSHOP ON SEMANTIC MEDIA ADAPTATION AND PERSONALIZATION, PROCEEDINGS, 2008, : 69 - 75
  • [19] General framework for sports video summarization with its application to soccer
    Li, BX
    Pan, H
    Sezan, I
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 169 - 172
  • [20] Unsupervised video summarization using deep Non-Local video summarization networks
    Zang, Sha-Sha
    Yu, Hui
    Song, Yan
    Zeng, Ru
    NEUROCOMPUTING, 2023, 519 : 26 - 35