Using HMM-based Speech Synthesis to Reconstruct the Voice of Individuals with Degenerative Speech Disorders

被引：0

作者：

Veaux, Christophe ^{[1
]}

Yamagishi, Junichi ^{[1
]}

King, Simon ^{[1
]}

机构：

[1] Univ Edinburgh, CSTR, Edinburgh EH8 9YL, Midlothian, Scotland

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

HTS; Voice Cloning; Voice Reconstruction; Assistive Technologies;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When individuals lose the ability to produce their own speech, due to degenerative diseases such as motor neuron disease (MND) or Parkinson's, they lose not only a functional means of communication but also a display of their individual and group identity. In order to build personalized synthetic voices, attempts have been made to capture the voice before it is lost, using a process known as voice banking. But, for some patients, the speech deterioration frequently coincides or quickly follows diagnosis. Using HMM-based speech synthesis, it is now possible to build personalized synthetic voices with minimal data recordings and even disordered speech. In this approach, the patient's recordings are used to adapt an average voice model pre-trained on many speakers. The structure of the voice model allows some reconstruction of the voice by substituting some components from the average voice in order to compensate for the disorders found in the patient's speech. In this paper, we compare different substitution strategies and introduce a context-dependent model substitution to improve the intelligibility of the synthetic speech while retaining the vocal identity of the patient. A subjective evaluation of the reconstructed voice for a patient with MND shows promising results for this strategy.

引用

页码：966 / 969

页数：4

共 50 条

[1] IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD
Jiao, Yishan
Xie, Xiang
Na, Xingyu
Tu, Ming
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[2] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
Kazumi, Kyosuke
Nankaku, Yoshihiko
Tokuda, Keiichi
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
[3] Voice characteristics conversion for HMM-based speech synthesis system
Masuko, T
Tokuda, K
Kobayashi, T
Imai, S
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1611 - 1614
[4] Usage of the HMM-Based Speech Synthesis for intelligent Arabic voice
Fares, Tamer S.
Khalil, Awad H.
Hegazy, Abd El-Fatah A.
[J]. INTELLIGENT SYSTEMS AND AUTOMATION, 2008, 1019 : 93 - +
[5] HMM-Based Vietnamese Speech Synthesis
Trinh Quoc Son
[J]. 2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
[6] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[7] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
[J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
[8] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
[J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
[9] A training method of average voice model for HMM-based speech synthesis
Yamagishi, J
Tamura, M
Masuko, T
Tokuda, K
Kobayashi, T
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2003, E86A (08) : 1956 - 1963
[10] HMM-Based Vietnamese Speech Synthesis
Trinh, Son
Hoang, Kiem
[J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47

← 1 2 3 4 5 →