Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

被引:20
|
作者
Wang, SJ [1 ]
Zhao, YX [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Beckham Inst, Urbana, IL 61801 USA
来源
基金
美国国家科学基金会;
关键词
affine transformation; Bayesian model selection; hidden Markov models (HMMs); linear regression (LR); model complexity; recursive Bayesian learning; robust priors; speaker adaptation; tree-structure;
D O I
10.1109/89.943344
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear regression (LR) or affine transformation parameters for HMM Gaussian mixture components are dynamically searched. An online Bayesian learning technique is proposed for recursive maximum a posteriori (MAP) estimation of LR and affine transformation parameters. This technique has the advantages of being able to accommodate flexible forms of transformation functions as well as a priori probability density functions (pdfs). To balance between model complexity and goodness of fit to adaptation data, a dynamic programming algorithm is developed for selecting models using a Bayesian variant of the "minimum description length" (MDL) principle. Speaker adaptation experiments with a 26-letter English alphabet vocabulary were conducted, and the results confirmed effectiveness of the online learning framework.
引用
收藏
页码:663 / 677
页数:15
相关论文
共 50 条
  • [22] Tree-Structured Neural Topic Model
    Isonuma, Masaru
    Mori, Junichiro
    Bollegala, Danushka
    Sakata, Ichiro
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 800 - 806
  • [23] Tree-Structured Model with Unbiased Variable Selection and Interaction Detection for Ranking Data
    Shih, Yu-Shan
    Kung, Yi-Hung
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (02): : 448 - 459
  • [24] Design of optimal orthogonal tree-structured filter banks
    Gandhi, R
    Mitra, SK
    42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 1057 - 1060
  • [25] Tree-Structured Bayesian Networks for Wrapped Cauchy Directional Distributions
    Leguey, Ignacio
    Bielza, Concha
    Larranaga, Pedro
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2016, 2016, 9868 : 207 - 216
  • [26] Tree-structured Bayesian network learning with application to scene classification
    Wang, Z. F.
    Wang, Z. H.
    Xie, W. J.
    ELECTRONICS LETTERS, 2011, 47 (09) : 540 - 541
  • [27] Bayesian Melody Harmonization Based on a Tree-Structured Generative Model of Chord Sequences and Melodies
    Tsushima, Hiroaki
    Nakamura, Eita
    Yoshii, Kazuyoshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1644 - 1655
  • [28] Tree-Structured Bayesian Compressive Sensing based Image Watermarking
    Li, Xiumei
    Bai, Huang
    Sun, Junmei
    ELEVENTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2019, 11384
  • [29] NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
    Kontkanen, Petri
    Wettig, Hannes
    Myllymaki, Petri
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01)
  • [30] A Nonparametric Regression Model With Tree-Structured Response
    Wang, Yuan
    Marron, J. S.
    Aydin, Burcu
    Ladha, Alim
    Bullitt, Elizabeth
    Wang, Haonan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) : 1272 - 1285