Audio-visual sports highlights extraction using Coupled Hidden Markov Models

被引:6
|
作者
Xiong, ZY [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
D O I
10.1007/s10044-005-0244-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present our studies on the application of Coupled Hidden Markov Models(CHMMs) to sports highlights extraction from broadcast video using both audio and video information. First, we generate audio labels using audio classification via Gaussian mixture models, and video labels using quantization of the average motion vector magnitudes. Then, we model sports highlights using discrete-observations CHMMs on audio and video labels classified from a large training set of broadcast sports highlights. Our experimental results on unseen golf and soccer content show that CHMMs outperform Hidden Markov Models(HMMs) trained on audio-only or video-only observations. Next, we study how the coupling between the two single-modality HMMs offers improvement on modelling capability by making refinements on the states of the models. We also show that the number of states optimized in this fashion also gives better classification results than other number of states. We conclude that CHMMs provide a promising tool for information fusion techniques in the sports domain for audio-visual event detection and analysis.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [21] Automatic summarization of soccer highlights using audio-visual descriptors
    Raventos, A.
    Quijada, R.
    Torres, Luis
    Tarres, Francesc
    [J]. SPRINGERPLUS, 2015, 4
  • [22] Audio-visual event detection using duration dependent input output Markov models
    Naphade, MR
    Garg, A
    Huang, TS
    [J]. IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES, PROCEEDINGS, 2001, : 39 - 43
  • [23] HEALTHCARE AUDIO EVENT CLASSIFICATION USING HIDDEN MARKOV MODELS AND HIERARCHICAL HIDDEN MARKOV MODELS
    Peng, Ya-Ti
    Lin, Ching-Yung
    Sun, Ming-Ting
    Tsai, Kun-Cheng
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1218 - +
  • [24] Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy for Audio-Visual Emotion Recognition
    Lin, Jen-Chun
    Wu, Chung-Hsien
    Wei, Wen-Li
    [J]. AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT I, 2011, 6974 : 185 - 194
  • [25] Audio-visual speaker localization using graphical models
    Kushal, Akash
    Rahurkar, Mandar
    Li Fei-Fei
    Ponce, Jean
    Huang, Thomas
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 291 - +
  • [26] Audio/visual mapping with cross-modal hidden Markov models
    Fu, SL
    Gutierrez-Osuna, R
    Esposito, A
    Kakumanu, PK
    Garcia, ON
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (02) : 243 - 252
  • [27] Extraction of Information of Audio-Visual Contents
    Aguilar, Carlos
    Sanchez, Lydia
    Campos, Manuel
    [J]. TRIPLEC-COMMUNICATION CAPITALISM & CRITIQUE, 2011, 9 (02): : 543 - 550
  • [28] UNSUPERVISED EXTRACTION OF AUDIO-VISUAL OBJECTS
    Casanovas, Anna Llagostera
    Vandergheynst, Pierre
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2284 - 2287
  • [29] Address extraction using hidden Markov models
    Taghva, K
    Coombs, J
    Pereda, R
    Nartker, T
    [J]. Document Recognition and Retrieval XII, 2005, 5676 : 119 - 126
  • [30] A PROBABILISTIC PRINCIPAL COMPONENT ANALYSIS BASED HIDDEN MARKOV MODEL FOR AUDIO-VISUAL SPEECH RECOGNITION
    Ma, Zhanyu
    Leijon, Arne
    [J]. 2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 2170 - 2173