Building an English Speech Synthesis System from a Japanese ALS Patient's Voice

被引:0
|
作者
Iida, Akemi [1 ]
Ito, Jun [1 ]
Kajima, Shimpei [2 ]
Sugawara, Tsutomu [2 ]
机构
[1] Tokyo Univ Technol, Tokyo, Japan
[2] Sophia Univ, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports on the development of an English speech synthesis system for a Japanese amyotropic lateral sclerosis patient as part of the project of developing a bilingual communication aid for this patient. The patient had a tracheotomy three years ago and anticipates the possibility of losing his phonatory function. His English speech database for Festival, a free speech synthesis system, was generated from his reading of a US diphone list. There were two problems with the recording. The first was the noise that the artificial ventilator made and the second was his difficulty in pronouncing English. Although the speaker's English database was successfully built by Festvox and the voice was recognized as his voice, the utterance was unintelligible. We therefore proposed reconstructing the patient's database by partially combining it with an English native speaker's database. Results showed that the proposed approach can be promising for those facing this problem.
引用
收藏
页码:1994 / +
页数:2
相关论文
共 50 条
  • [1] Automatic evaluation system of English prosody for Japanese learner's speech
    Suzuki, Motoyuki
    Konno, Tatsuki
    Ito, Akinori
    Makino, Shozo
    [J]. IMSCI '07: INTERNATIONAL MULTI-CONFERENCE ON SOCIETY, CYBERNETICS AND INFORMATICS, VOL 1, PROCEEDINGS, 2007, : 48 - 53
  • [2] A System Design of English Speech Synthesis
    Li, Su-Mei
    Liu, Cong-Cong
    Yang, Yuan-Cheng
    Li, Xin-Guang
    Ma, Shan-Xian
    [J]. 2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [3] Text to Speech Synthesis System in Indian English
    Mahanta, Deepshikha
    Sharma, Bidisha
    Sarmah, Priyankoo
    Prasanna, S. R. Mahadeva
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2614 - 2618
  • [4] PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese
    Antonisen, Silas
    López-Espejo, Iván
    [J]. arXiv,
  • [5] An Optimal Speech Recognition Module for Patient's Voice Monitoring System in Smart Healthcare Applications
    Krishnaveni, M.
    Subashini, P.
    Gracy, J.
    Manjutha, M.
    [J]. 2018 RENEWABLE ENERGIES, POWER SYSTEMS & GREEN INCLUSIVE ECONOMY (REPS-GIE), 2018,
  • [6] NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System
    Tohru Shimizu
    Yutaka Ashikari
    Eiichiro Sumita
    张劲松
    Satoshi Nakamura
    [J]. Tsinghua Science and Technology, 2008, (04) : 540 - 544
  • [7] NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System
    Shimizu, Tohru
    Ashikari, Yutaka
    Sumita, Eiichiro
    Zhang, Jinsong
    Nakamura, Satoshi
    [J]. Tsinghua Science and Technology, 2008, 13 (04) : 540 - 544
  • [8] HMM-BASED SINGING VOICE SYNTHESIS AND ITS APPLICATION TO JAPANESE AND ENGLISH
    Nakamura, Kazuhiro
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Protecting Your Voice from Speech Synthesis Attacks
    Liu, Zihao
    Zhang, Yan
    Miao, Chenglin
    [J]. 39TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2023, 2023, : 394 - 408
  • [10] Building a Patient-Centered Medical Home: Obtaining the Patient's Voice
    Van Berckelaer, Anje
    DiRocco, Danae
    Ferguson, Monica
    Gray, Paula
    Marcus, Noora
    Day, Susan
    [J]. JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2012, 25 (02) : 192 - 198