Development of robotic voice conversion for RIBO using text-to-speech synthesis

被引:0
|
作者
Hossain, Md. Jakir [1 ]
Al Amin, Sayed Mahmud [1 ]
Islam, Md. Saiful [1 ]
Marium-E-Jannat [1 ]
机构
[1] Shahjalal Univ Sci & Technol Sylhet, Dept Comp Sci & Engn, Sylhet, Bangladesh
关键词
TTS; RIBO; Diode; Ring modulator; VCA; Transformer; RF;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RIBO is the first social interaction robot in Bangladesh. This robot is designed and developed by 'ROBO SUST' team of Shahjalal University of Science and Technology. RIBO is able to hands and eyes ups and downs, can walk very slowly and can speak some Bengali recorded sentences. Now the 'ROBO SUST' team is trying to develop the RIBO so that it can communicate with human. One of the parts to communicate with human is convert bengali text to bengali speech in robotic voice. In this article, we propose a method which will convert bengali text to speech in robotic voice using google text to speech system and ring modulator. There are existed some text to speech synthesizer system which can convert bengali text to bengali speech. Among these TTS synthesizer system google TTS system for bengali is better. Hence, we use google text to speech system to produce bengali speech from any bengali written text. Google TTS synthesizer system produces speech as audio object file which can be converted to .mp3 file. Then we modify this .mp3 file using the characteristics of diode and ring modulator concept to get machine voice. After changing pitch and speed of this machine voice we get our final robotic voice which will be used in RIBO as his voice.
引用
收藏
页码:422 / 425
页数:4
相关论文
共 50 条
  • [1] Spectral voice conversion for text-to-speech synthesis
    Kain, A
    Macon, MW
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 285 - 288
  • [2] EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH
    Kim, Tae-Ho
    Cho, Sungjae
    Choi, Shinkook
    Park, Sejik
    Lee, Soo-Young
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7774 - 7778
  • [3] Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
    Paul, Dipjyoti
    Shifas, Muhammed P., V
    Pantazis, Yannis
    Stylianou, Yannis
    INTERSPEECH 2020, 2020, : 1361 - 1365
  • [4] EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion
    Miao, Chenfeng
    Zhu, Qingying
    Chen, Minchuan
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1650 - 1661
  • [5] Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
    Huang, Wen-Chin
    Hayashi, Tomoki
    Wu, Yi-Chiao
    Kameoka, Hirokazu
    Toda, Tomoki
    INTERSPEECH 2020, 2020, : 4676 - 4680
  • [6] A TEXT-TO-SPEECH CONVERSION SYSTEM
    KLATT, DH
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
  • [7] TEXT-TO-SPEECH CONVERSION TECHNOLOGY
    OMALLEY, MH
    COMPUTER, 1990, 23 (08) : 17 - 23
  • [8] TEXT-TO-SPEECH SYNTHESIS
    SPROAT, RW
    OLIVE, JP
    AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
  • [9] Speech Modification for Prosody Conversion in Expressive Marathi Text-to-Speech Synthesis
    Anil, Manjare Chandraprabha
    Shirbahadurkar, S. D.
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 56 - 58
  • [10] Development of Assamese Text-to-Speech Synthesis System
    Sharma, Bidisha
    Adiga, Nagaraj
    Prasanna, S. R. Mahadeva
    TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,