Extracting semantic information from basketball video based on audio-visual features

被引:0
|
作者
Kim, K [1 ]
Choi, J [1 ]
Kim, N [1 ]
Kim, P [1 ]
机构
[1] Agcy Def Dev, Key Technol & Res Ctr, Kwangju 501759, South Korea
来源
IMAGE AND VIDEO RETRIEVAL | 2002年 / 2383卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a mechanism for extracting semantic information from basketball video sequence using audio and video features. After we divide the input video into shots by a simple cut detection algorithm using visual information, we analyze audio signal data to predict the location of an important event from which a cheering sound happens to start using the combination of MFCC features and the LPC entropy. Finally, we extract semantics about class of shot by computer vision techniques such as basketball tracking and related objects detection. Experimental results show that the proposed scheme can concretely extract semantics from basketball video data as compared to the existing methods.
引用
收藏
页码:278 / 288
页数:11
相关论文
共 50 条
  • [41] An audio-visual approach to web video categorization
    Bogdan Emanuel Ionescu
    Klaus Seyerlehner
    Ionuţ Mironică
    Constantin Vertan
    Patrick Lambert
    Multimedia Tools and Applications, 2014, 70 : 1007 - 1032
  • [42] Video concept detection by audio-visual grouplets
    Jiang, Wei
    Loui, Alexander C.
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (04) : 223 - 238
  • [43] Dynamic visual features for audio-visual speaker verification
    Dean, David
    Sridharan, Sridha
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 136 - 149
  • [44] Extracting Semantic Information from Visual Data: A Survey
    Liu, Qiang
    Li, Ruihao
    Hu, Huosheng
    Gu, Dongbing
    ROBOTICS, 2016, 5 (01)
  • [45] Audio-visual content analysis for content-based video indexing
    Tsekeridou, S
    Pitas, I
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 667 - 672
  • [46] Audio-visual content analysis for content-based video indexing
    Tsekeridou, Sofia
    Pitas, Ioannis
    International Conference on Multimedia Computing and Systems -Proceedings, 1999, 1 : 667 - 672
  • [47] Content-based video parsing and indexing based on audio-visual interaction
    Tsekeridou, S
    Pitas, I
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (04) : 522 - 535
  • [48] Detection of music segment boundaries using audio-visual features for a personal video recorder
    Otsuka, Isao
    Suginohara, Hidetsugu
    Kusunoki, Yoshiaki
    Divakaran, Ajay
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (01) : 150 - 154
  • [49] Semantic and Relation Modulation for Audio-Visual Event Localization
    Wang, Hao
    Zha, Zheng-Jun
    Li, Liang
    Chen, Xuejin
    Luo, Jiebo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7711 - 7725
  • [50] Hierarchical discriminant features for audio-visual LVCSR
    Potamianos, G
    Luettin, J
    Neti, C
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 165 - 168