Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

被引:20
|
作者
Wang, SJ [1 ]
Zhao, YX [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Beckham Inst, Urbana, IL 61801 USA
来源
基金
美国国家科学基金会;
关键词
affine transformation; Bayesian model selection; hidden Markov models (HMMs); linear regression (LR); model complexity; recursive Bayesian learning; robust priors; speaker adaptation; tree-structure;
D O I
10.1109/89.943344
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear regression (LR) or affine transformation parameters for HMM Gaussian mixture components are dynamically searched. An online Bayesian learning technique is proposed for recursive maximum a posteriori (MAP) estimation of LR and affine transformation parameters. This technique has the advantages of being able to accommodate flexible forms of transformation functions as well as a priori probability density functions (pdfs). To balance between model complexity and goodness of fit to adaptation data, a dynamic programming algorithm is developed for selecting models using a Bayesian variant of the "minimum description length" (MDL) principle. Speaker adaptation experiments with a 26-letter English alphabet vocabulary were conducted, and the results confirmed effectiveness of the online learning framework.
引用
收藏
页码:663 / 677
页数:15
相关论文
共 50 条
  • [41] Optimal Packet Scheduling in Tree-Structured LEO Satellite Clusters
    Maurizio A. Bonuccelli
    Francesca Martelli
    Susanna Pelagatti
    Mobile Networks and Applications, 2004, 9 : 289 - 295
  • [42] AN OPTIMAL BACKTRACK ALGORITHM FOR TREE-STRUCTURED CONSTRAINT SATISFACTION PROBLEMS
    BAYARDO, RJ
    MIRANKER, DP
    ARTIFICIAL INTELLIGENCE, 1994, 71 (01) : 159 - 181
  • [43] Optimal packet scheduling in tree-structured LEO satellite clusters
    Bonuccelli, MA
    Martelli, F
    Pelagatti, S
    MOBILE NETWORKS & APPLICATIONS, 2004, 9 (04): : 289 - 295
  • [44] IMAGE DECONVOLUTION USING TREE-STRUCTURED BAYESIAN GROUP SPARSE MODELING
    Zhang, Ganchi
    Roberts, Timothy D.
    Kingsbury, Nick
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4537 - 4541
  • [45] A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation-based noise adaptation
    Zhang, ZP
    Sugimura, T
    Furui, S
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 981 - 984
  • [46] Reducing computation in an i-vector speaker recognition system using a tree-structured universal background model
    McClanahan, Richard
    De Leon, Phillip L.
    SPEECH COMMUNICATION, 2015, 66 : 36 - 46
  • [47] A Tree-Structured Multitask Model Architectures Recommendation System
    Zhang, Lijun
    Liu, Xiao
    Guan, Hui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15578 - 15588
  • [48] ToMExO: A probabilistic tree-structured model for cancer progression
    Neyshabouri, Mohammadreza Mohaghegh
    Lagergren, Jens
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [49] Reducing computation in an i-vector speaker recognition system using a tree-structured universal background model
    McClanahan, Richard
    De Leon, Phillip L.
    Speech Communication, 2015, 66 : 36 - 46
  • [50] A Tree-Structured Multi-Task Model Recommender
    Zhang, Lijun
    Liu, Xiao
    Guan, Hui
    INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, VOL 188, 2022, 188