Speaker adaptation using tree structured shared-state HMMs

被引:0
|
作者
Ishii, J
Tonomura, M
Matsunaga, S
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a novel speaker adaptation method that flexibly controls state-sharing of HMMs according to the amount of adaptation data. In our scheme, acoustic modeling is combined with adaptation to efficiently utilize the acoustic models sharing characteristics for adaptation. The shared-state set of HMMs is determined by using tree-structured shared-state HMMs created from the history recorded for acoustic model generation. The proposed method is applied to the parameter-tying and parameter-smoothing techniques. Experiments have been performed on a Japanese phoneme recognition test using continuous density mixture Gaussian HMMs. Using 50 adaptation phrases, a 42% reduction in the phoneme recognition error rate from the speaker-independent model was achieved.
引用
收藏
页码:1149 / 1152
页数:4
相关论文
共 50 条
  • [31] Using articulatory information for speaker adaptation
    Metze, F
    Waibel, A
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 405 - 410
  • [32] Speaker adaptation using an eigenphone basis
    Kenny, P
    Boulianne, G
    Ouellet, P
    Dumouchel, P
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (06): : 579 - 589
  • [33] Tree-Structured Decomposition and Adaptation in MOEA/D
    Zhang, Hanwei
    Zhou, Aimin
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XV, PT I, 2018, 11101 : 359 - 371
  • [34] A hybrid algorithm for speaker adaptation using MAP transformation and adaptation
    Chien, JT
    Lee, CH
    Wang, HC
    IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (06) : 167 - 169
  • [35] Detecting distant homologs using phylogenetic tree-based HMMs
    Qian, B
    Goldstein, RA
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 52 (03): : 446 - 453
  • [36] Towards a Speaker Independent Speech-BCI Using Speaker Adaptation
    Dash, Debadatta
    Wisler, Alan
    Ferrari, Paul
    Wang, Jun
    INTERSPEECH 2019, 2019, : 864 - 868
  • [37] Ensemble Speaker Modeling using Speaker Adaptive Training Deep Neural Network for Speaker Adaptation
    Li, Sheng
    Lu, Xugang
    Akita, Yuya
    Kawahara, Tatsuya
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2892 - 2896
  • [38] SPEAKER ADAPTATION USING LEARNING VECTOR QUANTIZATION
    KOO, MW
    UN, CK
    ELECTRONICS LETTERS, 1990, 26 (20) : 1731 - 1732
  • [39] Improvements in speaker adaptation using weighted training
    Jang, G
    Woo, S
    Jin, M
    Yoo, CD
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 548 - 551
  • [40] Multi Attribute D-S Evidence Theory Based OCC for Shared-State Scheduling in Large Scale Cluster
    He, Libo
    Qiang, Zhenping
    Zhou, Wei
    Yao, Shaowen
    INTERNATIONAL JOURNAL OF ONLINE ENGINEERING, 2016, 12 (12) : 43 - 48