Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

被引:20
|
作者
Wang, SJ [1 ]
Zhao, YX [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Beckham Inst, Urbana, IL 61801 USA
来源
基金
美国国家科学基金会;
关键词
affine transformation; Bayesian model selection; hidden Markov models (HMMs); linear regression (LR); model complexity; recursive Bayesian learning; robust priors; speaker adaptation; tree-structure;
D O I
10.1109/89.943344
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear regression (LR) or affine transformation parameters for HMM Gaussian mixture components are dynamically searched. An online Bayesian learning technique is proposed for recursive maximum a posteriori (MAP) estimation of LR and affine transformation parameters. This technique has the advantages of being able to accommodate flexible forms of transformation functions as well as a priori probability density functions (pdfs). To balance between model complexity and goodness of fit to adaptation data, a dynamic programming algorithm is developed for selecting models using a Bayesian variant of the "minimum description length" (MDL) principle. Speaker adaptation experiments with a 26-letter English alphabet vocabulary were conducted, and the results confirmed effectiveness of the online learning framework.
引用
收藏
页码:663 / 677
页数:15
相关论文
共 50 条
  • [1] On-line Bayesian speaker adaptation using tree-structured transformation and robust priors
    Wang, SJ
    Zhao, YX
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 977 - 980
  • [2] Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition
    Thatphithakkul, Nattanun
    Kruatrachue, Boontee
    Wutiwiwatchai, Chai
    Marukatat, Sanparith
    Boonpiam, Vataya
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1570 - +
  • [3] Speaker adaptation using tree structured shared-state HMMs
    Ishii, J
    Tonomura, M
    Matsunaga, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1149 - 1152
  • [4] A decision tree-structured algorithm of speaker adaptation based on Gaussian Similarity Analysis
    Wu, J
    Wang, ZY
    CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (02): : 166 - 169
  • [5] Bayesian Optimization with Tree-structured Dependencies
    Jenatton, Rodolphe
    Archambeau, Cedric
    Gonzalez, Javier
    Seeger, Matthias
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [6] Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network
    Audhkhasi, Kartik
    Metallinou, Angeliki
    Li, Ming
    Narayanan, Shrikanth S.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 262 - 265
  • [7] Visual tracking with tree-structured appearance model for online learning
    Lv, Yun-Qiu
    Liu, Kai
    Cheng, Fei
    Li, Wei
    IET IMAGE PROCESSING, 2019, 13 (12) : 2106 - 2115
  • [8] Compression via optimal basis selection in large tree-structured dictionaries
    Huang, Yan
    Pollak, Ilya
    COMPUTATIONAL IMAGING IV, 2006, 6065 : IX - XI
  • [9] A tree-structured Markov random field model for Bayesian image segmentation
    D'Elia, C
    Poggi, G
    Scarpa, G
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2003, 12 (10) : 1259 - 1273
  • [10] Tree-Structured Decomposition and Adaptation in MOEA/D
    Zhang, Hanwei
    Zhou, Aimin
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XV, PT I, 2018, 11101 : 359 - 371