Sports video summarization using acoustic symmetric ternary codes and SVM

被引：2

作者：

Banjar, Ameen ^{[1
]}

Dawood, Hussain ^{[2
]}

Javed, Ali ^{[3
]}

Zeb, Bushra ^{[3
]}

机构：

[1] Univ Jeddah, Dept Informat Syst & Technol, Jeddah, Saudi Arabia

[2] Natl Skills Univ Islamabad, Dept Informat Engn Technol, Islamabad, Pakistan

[3] Univ Engn & Technol, Dept Software Engn, Taxila, Pakistan

来源：

APPLIED ACOUSTICS | 2024年 / 216卷

关键词：

Event detection; Excitement detection; SVM; Symmetric ternary codes; Video summarization; FRAMEWORK; NETWORK; REPLAY;

D O I：

10.1016/j.apacoust.2023.109795

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Broadcasters produce and transmit a vast number of sports videos in cyberspace due to immense viewership and potential commercial benefits. The analysis and processing of such a huge amount of video content are very challenging. This situation demands the development of effective and efficient summarization methods to manage the massive sports video repository while keeping the viewer's interest along with potential storage and transmission benefits. This paper presents an automated summarization framework based on excitement detection for sports videos i.e., cricket, soccer, etc. The audio stream of the sports video is analyzed to capture the significant events that are then used to produce the concise video. For effective representation of audio signals, we proposed an acoustic feature descriptor symmetric ternary codes and used them to train a binary Support Vector Machine classifier for excitement detection. Each audio frame is labeled as either an excited audio frame or a non-excited audio frame. The video frames corresponding to the excited audio frames represent the key-events in the sports videos and are marked as the key-frames. Each key-frame is appended with the neigh-boring frames to produce video skims for each key-event based on the user's required summary length. Finally, these video skims are sequentially arranged to produce the user-driven video summary. We evaluated our highlights generation method on our own diverse YouTube dataset of cricket and soccer videos, and a largescale SoccerNet corpus of soccer videos. The average accuracy of 97.7% and 91.23% on both datasets confirms the reliability of our method in terms of key-event detection for sports highlight generation.

引用

页数：10

共 50 条

[11] Shot classification and replay detection for sports video summarization
Javed, Ali
Ali Khan, Amen
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (05) : 790 - 800
[12] Bridging the semantic gap in sports video retrieval and summarization
Li, BX
Errico, JH
Pan, H
Sezan, I
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2004, 15 (03) : 393 - 424
[13] Integrating highlights for more complete sports video summarization
Tjondronegoro, D
Chen, YPP
Pham, B
IEEE MULTIMEDIA, 2004, 11 (04) : 22 - 37
[14] Highlight summarization in sports video based on replay detection
Zhao, Zhao
Jiang, Shuqiang
Huang, Qingming
Zhu, Guangyu
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1613 - +
[15] Sports video summarization and adaptation for application in mobile communication
Gao W.
Huang Q.-M.
Jiang S.-Q.
Zhang P.
Journal of Zhejiang University: Science, 2006, 7 (05): : 819 - 829
[16] Sports video summarization and adaptation for application in mobile communication
GAO Wen1
Journal of Zhejiang University Science A(Science in Engineering), 2006, (05) : 819 - 829
[17] Summarization of User-Generated Sports Video by Using Deep Action Recognition Features
Tejero-de-Pablos, Antonio
Nakashima, Yuta
Sato, Tomokazu
Yokoya, Naokazu
Linna, Marko
Rahtu, Esa
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (08) : 2000 - 2011
[18] A Logic Framework for Sports Video Summarization using Text-Based Semantic Annotation
Refaey, Mohammed A.
Abd-Almageed, Wael
Davis, Larry S.
THIRD INTERNATIONAL WORKSHOP ON SEMANTIC MEDIA ADAPTATION AND PERSONALIZATION, PROCEEDINGS, 2008, : 69 - 75
[19] General framework for sports video summarization with its application to soccer
Li, BX
Pan, H
Sezan, I
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 169 - 172
[20] Unsupervised video summarization using deep Non-Local video summarization networks
Zang, Sha-Sha
Yu, Hui
Song, Yan
Zeng, Ru
NEUROCOMPUTING, 2023, 519 : 26 - 35

← 1 2 3 4 5 →