Audio-visual sports highlights extraction using Coupled Hidden Markov Models

被引：6

作者：

Xiong, ZY ^{[1
]}

机构：

[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA

来源：

PATTERN ANALYSIS AND APPLICATIONS | 2005年 / 8卷 / 1-2期

关键词：

D O I：

10.1007/s10044-005-0244-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present our studies on the application of Coupled Hidden Markov Models(CHMMs) to sports highlights extraction from broadcast video using both audio and video information. First, we generate audio labels using audio classification via Gaussian mixture models, and video labels using quantization of the average motion vector magnitudes. Then, we model sports highlights using discrete-observations CHMMs on audio and video labels classified from a large training set of broadcast sports highlights. Our experimental results on unseen golf and soccer content show that CHMMs outperform Hidden Markov Models(HMMs) trained on audio-only or video-only observations. Next, we study how the coupling between the two single-modality HMMs offers improvement on modelling capability by making refinements on the states of the models. We also show that the number of states optimized in this fashion also gives better classification results than other number of states. We conclude that CHMMs provide a promising tool for information fusion techniques in the sports domain for audio-visual event detection and analysis.

引用

页码：62 / 71

页数：10

共 50 条

[1] Audio-visual sports highlights extraction using Coupled Hidden Markov Models
Ziyou Xiong
[J]. Pattern Analysis and Applications, 2005, 8 : 62 - 71
[2] Audio–visual sports highlights extraction using Coupled Hidden Markov Models
Ziyou Xiong
[J]. Pattern Analysis and Applications, 2006, 8 : 392 - 392
[3] Audio-visual sports highlights extraction using Coupled Hidden Markov Models (vol 8, pg 62, 2005)
Xiong, ZY
[J]. PATTERN ANALYSIS AND APPLICATIONS, 2006, 8 (04) : 392 - 392
[4] Audio-visual speech fusion using coupled hidden Markov models
Chu, Stephen M.
Huang, Thomas S.
[J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3911 - +
[5] Audio-visual speaker identification using coupled hidden markov models
Fu, T
Liu, XX
Liang, LH
Pi, XB
Nefian, AV
[J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 29 - 32
[6] Audio-visual speech modeling using coupled hidden Markov models
Chu, SM
Huang, TS
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2009 - 2012
[7] Characteristics of the use of coupled hidden Markov models for audio-visual Polish speech recognition
Kubanek, M.
Bobulski, J.
Adrjanowicz, L.
[J]. BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2012, 60 (02) : 307 - 316
[8] Audio-visual sound separation via hidden Markov models
Hershey, J
Casey, M
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1173 - 1180
[9] Audio-visual speech asynchrony detection using co-inertia analysis and coupled hidden markov models
Enrique Argones Rúa
Hervé Bredin
Carmen García Mateo
Gérard Chollet
Daniel González Jiménez
[J]. Pattern Analysis and Applications, 2009, 12 : 271 - 284
[10] Audio-visual speech asynchrony detection using co-inertia analysis and coupled hidden markov models
Argones Rua, Enrique
Bredin, Herve
Garcia Mateo, Carmen
Chollet, Gerard
Gonzalez Jimenez, Daniel
[J]. PATTERN ANALYSIS AND APPLICATIONS, 2009, 12 (03) : 271 - 284

← 1 2 3 4 5 →