Extracting semantic information from basketball video based on audio-visual features

被引：0

作者：

Kim, K ^{[1
]}

Choi, J ^{[1
]}

Kim, N ^{[1
]}

Kim, P ^{[1
]}

机构：

[1] Agcy Def Dev, Key Technol & Res Ctr, Kwangju 501759, South Korea

来源：

IMAGE AND VIDEO RETRIEVAL | 2002年 / 2383卷

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a mechanism for extracting semantic information from basketball video sequence using audio and video features. After we divide the input video into shots by a simple cut detection algorithm using visual information, we analyze audio signal data to predict the location of an important event from which a cheering sound happens to start using the combination of MFCC features and the LPC entropy. Finally, we extract semantics about class of shot by computer vision techniques such as basketball tracking and related objects detection. Experimental results show that the proposed scheme can concretely extract semantics from basketball video data as compared to the existing methods.

引用

页码：278 / 288

页数：11

共 50 条

[41] An audio-visual approach to web video categorization
Bogdan Emanuel Ionescu
Klaus Seyerlehner
Ionuţ Mironică
Constantin Vertan
Patrick Lambert
Multimedia Tools and Applications, 2014, 70 : 1007 - 1032
[42] Video concept detection by audio-visual grouplets
Jiang, Wei
Loui, Alexander C.
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (04) : 223 - 238
[43] Dynamic visual features for audio-visual speaker verification
Dean, David
Sridharan, Sridha
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 136 - 149
[44] Extracting Semantic Information from Visual Data: A Survey
Liu, Qiang
Li, Ruihao
Hu, Huosheng
Gu, Dongbing
ROBOTICS, 2016, 5 (01)
[45] Audio-visual content analysis for content-based video indexing
Tsekeridou, S
Pitas, I
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 667 - 672
[46] Audio-visual content analysis for content-based video indexing
Tsekeridou, Sofia
Pitas, Ioannis
International Conference on Multimedia Computing and Systems -Proceedings, 1999, 1 : 667 - 672
[47] Content-based video parsing and indexing based on audio-visual interaction
Tsekeridou, S
Pitas, I
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (04) : 522 - 535
[48] Detection of music segment boundaries using audio-visual features for a personal video recorder
Otsuka, Isao
Suginohara, Hidetsugu
Kusunoki, Yoshiaki
Divakaran, Ajay
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (01) : 150 - 154
[49] Semantic and Relation Modulation for Audio-Visual Event Localization
Wang, Hao
Zha, Zheng-Jun
Li, Liang
Chen, Xuejin
Luo, Jiebo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7711 - 7725
[50] Hierarchical discriminant features for audio-visual LVCSR
Potamianos, G
Luettin, J
Neti, C
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 165 - 168

← 1 2 3 4 5 →