EVALUATION OF LINEAR REGRESSION FOR SPEAKER ADAPTATION IN HMM-BASED ARTICULATORY MOVEMENTS ESTIMATION

被引:0
|
作者
Li, Hao [1 ]
Tao, Jianhua [1 ]
Wang, Yang [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
基金
中国国家自然科学基金; 中国国家社会科学基金;
关键词
speaker adaptation; acoustic-to-articulatory inversion; maximum likelihood linear regression;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic-to-articulatory inversion problem is usually studied in speaker-specific manner because both articulatory data and acoustic features contain speaker-specific components. This paper presents our work on speaker-adaptation training for this problem. We implement speaker adaptation in HMM-based acoustic-to-articulatory inversion mapping, and evaluate different combinatorial structures of the articulatory data and acoustic features. The HMM-based inversion mapping models are built with single-stream and multistream, independent clustering and shared clustering structures. The speaker adaptation is implemented in stream-independent structure and shared adaptation structure. The constrained maximum likelihood linear regression method is used for the speaker-adaptive transformation. The experimental results show that the sharing of the speaker-adaptive transformation of the articulatory feature stream and acoustic feature stream can improve the estimation accuracy in inversion mapping. The multi-stream system with shared clustering and shared adaptive transformation has the best result among all the tested structures.
引用
收藏
页码:4944 / 4948
页数:5
相关论文
共 50 条
  • [1] An Analysis of HMM-based prediction of articulatory movements
    Ling, Zhen-Hua
    Richmond, Korin
    Yamagishi, Junichi
    [J]. SPEECH COMMUNICATION, 2010, 52 (10) : 834 - 846
  • [2] Speaker Adaptation using Nonlinear Regression Techniques for HMM-based Speech Synthesis
    Hong, Doo Hwa
    Kang, Shin Jae
    Lee, Joun Yeop
    Kim, Nam Soo
    [J]. 2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 586 - 589
  • [3] Speaker Adaptation using Relevance Vector Regression for HMM-based Expressive TTS
    Hong, Doo Hwa
    Lee, Joun Yeop
    Jang, Se Young
    Kim, Nam Soo
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1216 - 1220
  • [4] Speaker adaptation method for acoustic-to-articulatory inversion using an HMM-based speech production model
    Hiroya, Sadao
    Honda, Masaaki
    [J]. IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1071 - 1078
  • [5] Speaker adaptation method for acoustic-to-articulatory inversion using an HMM-based speech production model
    Hiroya, S
    Honda, M
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1071 - 1078
  • [6] Estimation of articulatory movements from speech acoustics using an HMM-based speech production model
    Hiroya, S
    Honda, M
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (02): : 175 - 185
  • [7] Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis
    Gao, Weixun
    Cao, Qiying
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1149 - 1166
  • [8] Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
    [J]. Tamura, M., 1600, John Wiley and Sons Inc. (35):
  • [9] SPEAKER SIMILARITY EVALUATION OF FOREIGN-ACCENTED SPEECH SYNTHESIS USING HMM-BASED SPEAKER ADAPTATION
    Wester, Mirjam
    Karhila, Reima
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5372 - 5375
  • [10] Minimum generation error linear regression based model adaptation for HMM-based speech synthesis
    Qin, Long
    Wu, Yi-Jian
    Ling, Zhen-Hua
    Wang, Ren-Hua
    Da, Li-Rong
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3953 - +