EVALUATION OF LINEAR REGRESSION FOR SPEAKER ADAPTATION IN HMM-BASED ARTICULATORY MOVEMENTS ESTIMATION

被引：0

作者：

Li, Hao ^{[1
]}

Tao, Jianhua ^{[1
]}

Wang, Yang ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

基金：

中国国家自然科学基金; 中国国家社会科学基金;

关键词：

speaker adaptation; acoustic-to-articulatory inversion; maximum likelihood linear regression;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Acoustic-to-articulatory inversion problem is usually studied in speaker-specific manner because both articulatory data and acoustic features contain speaker-specific components. This paper presents our work on speaker-adaptation training for this problem. We implement speaker adaptation in HMM-based acoustic-to-articulatory inversion mapping, and evaluate different combinatorial structures of the articulatory data and acoustic features. The HMM-based inversion mapping models are built with single-stream and multistream, independent clustering and shared clustering structures. The speaker adaptation is implemented in stream-independent structure and shared adaptation structure. The constrained maximum likelihood linear regression method is used for the speaker-adaptive transformation. The experimental results show that the sharing of the speaker-adaptive transformation of the articulatory feature stream and acoustic feature stream can improve the estimation accuracy in inversion mapping. The multi-stream system with shared clustering and shared adaptive transformation has the best result among all the tested structures.

引用

页码：4944 / 4948

页数：5

共 50 条

[1] An Analysis of HMM-based prediction of articulatory movements
Ling, Zhen-Hua
Richmond, Korin
Yamagishi, Junichi
[J]. SPEECH COMMUNICATION, 2010, 52 (10) : 834 - 846
[2] Speaker Adaptation using Nonlinear Regression Techniques for HMM-based Speech Synthesis
Hong, Doo Hwa
Kang, Shin Jae
Lee, Joun Yeop
Kim, Nam Soo
[J]. 2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 586 - 589
[3] Speaker Adaptation using Relevance Vector Regression for HMM-based Expressive TTS
Hong, Doo Hwa
Lee, Joun Yeop
Jang, Se Young
Kim, Nam Soo
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1216 - 1220
[4] Speaker adaptation method for acoustic-to-articulatory inversion using an HMM-based speech production model
Hiroya, Sadao
Honda, Masaaki
[J]. IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1071 - 1078
[5] Speaker adaptation method for acoustic-to-articulatory inversion using an HMM-based speech production model
Hiroya, S
Honda, M
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1071 - 1078
[6] Estimation of articulatory movements from speech acoustics using an HMM-based speech production model
Hiroya, S
Honda, M
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (02): : 175 - 185
[7] Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis
Gao, Weixun
Cao, Qiying
[J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1149 - 1166
[8] Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
[J]. Tamura, M., 1600, John Wiley and Sons Inc. (35):
[9] SPEAKER SIMILARITY EVALUATION OF FOREIGN-ACCENTED SPEECH SYNTHESIS USING HMM-BASED SPEAKER ADAPTATION
Wester, Mirjam
Karhila, Reima
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5372 - 5375
[10] Minimum generation error linear regression based model adaptation for HMM-based speech synthesis
Qin, Long
Wu, Yi-Jian
Ling, Zhen-Hua
Wang, Ren-Hua
Da, Li-Rong
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3953 - +

← 1 2 3 4 5 →