Building an English Speech Synthesis System from a Japanese ALS Patient's Voice

被引：0

作者：

Iida, Akemi ^{[1
]}

Ito, Jun ^{[1
]}

Kajima, Shimpei ^{[2
]}

Sugawara, Tsutomu ^{[2
]}

机构：

[1] Tokyo Univ Technol, Tokyo, Japan

[2] Sophia Univ, Tokyo, Japan

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper reports on the development of an English speech synthesis system for a Japanese amyotropic lateral sclerosis patient as part of the project of developing a bilingual communication aid for this patient. The patient had a tracheotomy three years ago and anticipates the possibility of losing his phonatory function. His English speech database for Festival, a free speech synthesis system, was generated from his reading of a US diphone list. There were two problems with the recording. The first was the noise that the artificial ventilator made and the second was his difficulty in pronouncing English. Although the speaker's English database was successfully built by Festvox and the voice was recognized as his voice, the utterance was unintelligible. We therefore proposed reconstructing the patient's database by partially combining it with an English native speaker's database. Results showed that the proposed approach can be promising for those facing this problem.

引用

页码：1994 / +

页数：2

共 50 条

[1] Automatic evaluation system of English prosody for Japanese learner's speech
Suzuki, Motoyuki
Konno, Tatsuki
Ito, Akinori
Makino, Shozo
[J]. IMSCI '07: INTERNATIONAL MULTI-CONFERENCE ON SOCIETY, CYBERNETICS AND INFORMATICS, VOL 1, PROCEEDINGS, 2007, : 48 - 53
[2] A System Design of English Speech Synthesis
Li, Su-Mei
Liu, Cong-Cong
Yang, Yuan-Cheng
Li, Xin-Guang
Ma, Shan-Xian
[J]. 2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
[3] Text to Speech Synthesis System in Indian English
Mahanta, Deepshikha
Sharma, Bidisha
Sarmah, Priyankoo
Prasanna, S. R. Mahadeva
[J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2614 - 2618
[4] PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese
Antonisen, Silas
López-Espejo, Iván
[J]. arXiv,
[5] An Optimal Speech Recognition Module for Patient's Voice Monitoring System in Smart Healthcare Applications
Krishnaveni, M.
Subashini, P.
Gracy, J.
Manjutha, M.
[J]. 2018 RENEWABLE ENERGIES, POWER SYSTEMS & GREEN INCLUSIVE ECONOMY (REPS-GIE), 2018,
[6] NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System
Tohru Shimizu
Yutaka Ashikari
Eiichiro Sumita
张劲松
Satoshi Nakamura
[J]. Tsinghua Science and Technology, 2008, (04) : 540 - 544
[7] NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System
Shimizu, Tohru
Ashikari, Yutaka
Sumita, Eiichiro
Zhang, Jinsong
Nakamura, Satoshi
[J]. Tsinghua Science and Technology, 2008, 13 (04) : 540 - 544
[8] HMM-BASED SINGING VOICE SYNTHESIS AND ITS APPLICATION TO JAPANESE AND ENGLISH
Nakamura, Kazuhiro
Oura, Keiichiro
Nankaku, Yoshihiko
Tokuda, Keiichi
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] Protecting Your Voice from Speech Synthesis Attacks
Liu, Zihao
Zhang, Yan
Miao, Chenglin
[J]. 39TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2023, 2023, : 394 - 408
[10] Building a Patient-Centered Medical Home: Obtaining the Patient's Voice
Van Berckelaer, Anje
DiRocco, Danae
Ferguson, Monica
Gray, Paula
Marcus, Noora
Day, Susan
[J]. JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2012, 25 (02) : 192 - 198

← 1 2 3 4 5 →