A stream-weight optimization method for multi-stream HMMS based on likelihood value normalization

被引:0
|
作者
Tamura, S [1 ]
Iwano, K [1 ]
Furui, S [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Meguro Ku, Tokyo 1528552, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of audio-visual speech recognition, multi-stream HMMs are widely used, thus bow to automatically and properly determine stream weight factors using a small data set becomes an important research issue. This paper proposes a new stream-weight optimization method based on an output likelihood normalization criterion. In this method, the stream weights are adjusted to equalize the mean values of log likelihood for all HMMs. based on likelihood-ratio maximization which achieved significant improvement by using, a large optimization data set. The new method is evaluated using Japanese connected digit speech recorded in real-world environments. Using 10 seconds speech data for stream-weight optimization, a 10% absolute accuracy improvement is achieved compared to the result before optimization. By additionally applying the MLLR (maximum likelihood linear regression) adaptation, a 23% improvement is obtained over the audio-only scheme.
引用
收藏
页码:469 / 472
页数:4
相关论文
共 50 条
  • [1] A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMS
    Tamura, S
    Iwano, K
    Furui, S
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 857 - 860
  • [2] A stream-weight and threshold estimation method using adaboost for multi-stream speaker verification
    Asami, Taichi
    Iwano, Koji
    Furui, Sadaoki
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5939 - 5942
  • [3] Stream weight computation for multi-stream classifiers
    Potamianos, Alexandros
    Sanchez-Soto, Eduardo
    Daoudi, Khalid
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 353 - 356
  • [4] Robust scene extraction using multi-stream HMMs for baseball broadcast
    Bach, Nguyen Hun
    Shinoda, Koichi
    Furui, Sadaoki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (09) : 2553 - 2561
  • [5] Derivated-network optimization method of multi-stream heat exchanger networks
    Cui, Guo-Min
    Gao, Xiao-Zhong
    Guo, Jia
    Lu, Yan-Yan
    Guan, Xin
    Kung Cheng Je Wu Li Hsueh Pao/Journal of Engineering Thermophysics, 2008, 29 (08): : 1403 - 1406
  • [6] An automated method for synthesizing a multi-stream heat exchanger network based on stream pseudo-temperature
    Yuan, Dongwen
    Wang, Yao
    Xiao, Wu
    Yao, Pingjing
    Luo, Xing
    Roetzel, Wilfried
    16TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING AND 9TH INTERNATIONAL SYMPOSIUM ON PROCESS SYSTEMS ENGINEERING, 2006, 21 : 919 - 924
  • [7] DBN based multi-stream models for speech
    Zhang, YM
    Diao, Q
    Huang, S
    Hu, W
    Bartels, C
    Bilmes, J
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 836 - 839
  • [8] Secret Key Generation Method Based on Multi-stream Random Signal
    Jin Liang
    Cai Aolin
    Huang Kaizhi
    Zhong Zhou
    Lou Yangming
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (06) : 1405 - 1412
  • [9] Fused HMM-Adaptation of Multi-Stream HMMs for Audio-Visual Speech Recognition
    Dean, David
    Lucey, Patrick
    Sridharan, Sridha
    Wark, Tim
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2272 - 2275
  • [10] Multi-stream (Q , r) model and optimization for data prefetching
    Zhu, Xiaoyan
    Wang, Jun
    Yuan, Qi
    Zhang, Zhe
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 302 (01) : 130 - 143