Audio-visual sports highlights extraction using Coupled Hidden Markov Models

被引:6
|
作者
Xiong, ZY [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
D O I
10.1007/s10044-005-0244-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present our studies on the application of Coupled Hidden Markov Models(CHMMs) to sports highlights extraction from broadcast video using both audio and video information. First, we generate audio labels using audio classification via Gaussian mixture models, and video labels using quantization of the average motion vector magnitudes. Then, we model sports highlights using discrete-observations CHMMs on audio and video labels classified from a large training set of broadcast sports highlights. Our experimental results on unseen golf and soccer content show that CHMMs outperform Hidden Markov Models(HMMs) trained on audio-only or video-only observations. Next, we study how the coupling between the two single-modality HMMs offers improvement on modelling capability by making refinements on the states of the models. We also show that the number of states optimized in this fashion also gives better classification results than other number of states. We conclude that CHMMs provide a promising tool for information fusion techniques in the sports domain for audio-visual event detection and analysis.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [1] Audio-visual sports highlights extraction using Coupled Hidden Markov Models
    Ziyou Xiong
    [J]. Pattern Analysis and Applications, 2005, 8 : 62 - 71
  • [2] Audio–visual sports highlights extraction using Coupled Hidden Markov Models
    Ziyou Xiong
    [J]. Pattern Analysis and Applications, 2006, 8 : 392 - 392
  • [3] Audio-visual sports highlights extraction using Coupled Hidden Markov Models (vol 8, pg 62, 2005)
    Xiong, ZY
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2006, 8 (04) : 392 - 392
  • [4] Audio-visual speech fusion using coupled hidden Markov models
    Chu, Stephen M.
    Huang, Thomas S.
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3911 - +
  • [5] Audio-visual speaker identification using coupled hidden markov models
    Fu, T
    Liu, XX
    Liang, LH
    Pi, XB
    Nefian, AV
    [J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 29 - 32
  • [6] Audio-visual speech modeling using coupled hidden Markov models
    Chu, SM
    Huang, TS
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2009 - 2012
  • [7] Characteristics of the use of coupled hidden Markov models for audio-visual Polish speech recognition
    Kubanek, M.
    Bobulski, J.
    Adrjanowicz, L.
    [J]. BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2012, 60 (02) : 307 - 316
  • [8] Audio-visual sound separation via hidden Markov models
    Hershey, J
    Casey, M
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1173 - 1180
  • [9] Audio-visual speech asynchrony detection using co-inertia analysis and coupled hidden markov models
    Enrique Argones Rúa
    Hervé Bredin
    Carmen García Mateo
    Gérard Chollet
    Daniel González Jiménez
    [J]. Pattern Analysis and Applications, 2009, 12 : 271 - 284
  • [10] Audio-visual speech asynchrony detection using co-inertia analysis and coupled hidden markov models
    Argones Rua, Enrique
    Bredin, Herve
    Garcia Mateo, Carmen
    Chollet, Gerard
    Gonzalez Jimenez, Daniel
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2009, 12 (03) : 271 - 284