Speech-to-Speech Conversion: An Approach to Enhance the Speech Intelligibility of Dysarthric Speaker

被引:1
|
作者
Janai, Siddhanna [1 ]
Shreekanth, T. [2 ]
Chandan, M. [3 ]
Abraham, Ajish K. [4 ]
机构
[1] Maharaja Inst Technol, Mysore, Karnataka, India
[2] L&T Technol Serv, Vadodara, India
[3] JSS Sci & Technol Univ, Mysuru, India
[4] All India Inst Speech & Hearing, Mysore, Karnataka, India
关键词
All India Institute of Speech and Hearing (AIISH); Artificial Neural Networks(ANN); Automatic Speech Recognition (ASR); Linear Predictive Coding (LPC); Speech-to-Speech Conversion (STSC); Text-to-Speech (TTS); RECOGNITION;
D O I
10.4018/IJACI.2021010108
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A novel approach to build a speech-to-speech conversion (STSC) system for individuals with speech impairment dysarthria is described. STSC system takes impaired speech having inherent disturbance as input and produces a synthesized output speech with good pronunciation and noise free utterance. The STSC system involves two stages, namely automatic speech recognition (ASR) and automatic speech synthesis. ASR transforms speech into text, while automatic speech synthesis (or text-to-speech [TTS]) performs the reverse task. At present, the recognition system is developed for a small vocabulary of 50 words and the accuracy of 94% is achieved for normal speakers and 88% for speakers with dysarthria. The output speech of TTS system has achieved a MOS value of 4.5 out of 5 as obtained by averaging the response of 20 listeners. This method of STSC would be an augmentative and alternative communication aid for speakers with dysarthria.
引用
收藏
页码:184 / 206
页数:23
相关论文
共 50 条
  • [1] The influence of speaker and listener variables on intelligibility of dysarthric speech
    Patel, Rupal
    Usher, Nicole
    Kember, Heather
    Russell, Scott
    Laures-Gore, Jacqueline
    [J]. JOURNAL OF COMMUNICATION DISORDERS, 2014, 51 : 13 - 18
  • [2] Automatic speaker independent dysarthric speech intelligibility assessment system
    Tripathi, Ayush
    Bhosale, Swapnil
    Kopparapu, Sunil Kumar
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 69
  • [3] INTELLIGIBILITY MEASURES OF DYSARTHRIC SPEECH
    TIKOFSKY, RS
    TIKOFSKY, RP
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1964, 7 (04): : 325 - 333
  • [4] A STUDY OF INTELLIGIBILITY OF DYSARTHRIC SPEECH
    TIKOFSKY, RS
    LEHISTE, I
    TIKOFSKY, RP
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (11): : 1677 - &
  • [5] Intelligibility of modifications to dysarthric speech
    Hosom, JP
    Kain, AB
    Mishra, T
    van Santen, JPH
    Fried-Oken, M
    Staehely, J
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 924 - 927
  • [6] Improving the intelligibility of dysarthric speech
    Kain, Alexander B.
    Hosom, John-Paul
    Niu, Xiaochuan
    van Santen, Jan P. H.
    Fried-Oken, Melanie
    Staehely, Janice
    [J]. SPEECH COMMUNICATION, 2007, 49 (09) : 743 - 759
  • [7] Speech rate effects upon intelligibility and acceptability of dysarthric speech
    Dagenais, PA
    Brown, GR
    Moore, RE
    [J]. CLINICAL LINGUISTICS & PHONETICS, 2006, 20 (2-3) : 141 - 148
  • [8] Dysarthric speech: A comparison of computerized speech recognition and listener intelligibility
    Doyle, PC
    Leeper, HA
    Kotler, AL
    ThomasStonell, N
    ONeill, C
    Dylke, MC
    Rolls, K
    [J]. JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1997, 34 (03): : 309 - 316
  • [9] The Effect of Rate Control on Speech Rate and Intelligibility of Dysarthric Speech
    Van Nuffelen, Gwen
    De Bodt, Marc
    Wuyts, Floris
    Van de Heyning, Paul
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2009, 61 (02) : 69 - 75
  • [10] Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation
    Hattori, Nobuhiko
    Toda, Tomoki
    Kawai, Hisashi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2780 - +