Development of robotic voice conversion for RIBO using text-to-speech synthesis

被引：0

作者：

Hossain, Md. Jakir ^{[1
]}

Al Amin, Sayed Mahmud ^{[1
]}

Islam, Md. Saiful ^{[1
]}

Marium-E-Jannat ^{[1
]}

机构：

[1] Shahjalal Univ Sci & Technol Sylhet, Dept Comp Sci & Engn, Sylhet, Bangladesh

来源：

2018 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT) | 2018年

关键词：

TTS; RIBO; Diode; Ring modulator; VCA; Transformer; RF;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

RIBO is the first social interaction robot in Bangladesh. This robot is designed and developed by 'ROBO SUST' team of Shahjalal University of Science and Technology. RIBO is able to hands and eyes ups and downs, can walk very slowly and can speak some Bengali recorded sentences. Now the 'ROBO SUST' team is trying to develop the RIBO so that it can communicate with human. One of the parts to communicate with human is convert bengali text to bengali speech in robotic voice. In this article, we propose a method which will convert bengali text to speech in robotic voice using google text to speech system and ring modulator. There are existed some text to speech synthesizer system which can convert bengali text to bengali speech. Among these TTS synthesizer system google TTS system for bengali is better. Hence, we use google text to speech system to produce bengali speech from any bengali written text. Google TTS synthesizer system produces speech as audio object file which can be converted to .mp3 file. Then we modify this .mp3 file using the characteristics of diode and ring modulator concept to get machine voice. After changing pitch and speed of this machine voice we get our final robotic voice which will be used in RIBO as his voice.

引用

页码：422 / 425

页数：4

共 50 条

[1] Spectral voice conversion for text-to-speech synthesis
Kain, A
Macon, MW
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 285 - 288
[2] EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH
Kim, Tae-Ho
Cho, Sungjae
Choi, Shinkook
Park, Sejik
Lee, Soo-Young
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7774 - 7778
[3] Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
Paul, Dipjyoti
Shifas, Muhammed P., V
Pantazis, Yannis
Stylianou, Yannis
INTERSPEECH 2020, 2020, : 1361 - 1365
[4] EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion
Miao, Chenfeng
Zhu, Qingying
Chen, Minchuan
Ma, Jun
Wang, Shaojun
Xiao, Jing
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1650 - 1661
[5] Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Huang, Wen-Chin
Hayashi, Tomoki
Wu, Yi-Chiao
Kameoka, Hirokazu
Toda, Tomoki
INTERSPEECH 2020, 2020, : 4676 - 4680
[6] A TEXT-TO-SPEECH CONVERSION SYSTEM
KLATT, DH
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
[7] TEXT-TO-SPEECH CONVERSION TECHNOLOGY
OMALLEY, MH
COMPUTER, 1990, 23 (08) : 17 - 23
[8] TEXT-TO-SPEECH SYNTHESIS
SPROAT, RW
OLIVE, JP
AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
[9] Speech Modification for Prosody Conversion in Expressive Marathi Text-to-Speech Synthesis
Anil, Manjare Chandraprabha
Shirbahadurkar, S. D.
2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 56 - 58
[10] Development of Assamese Text-to-Speech Synthesis System
Sharma, Bidisha
Adiga, Nagaraj
Prasanna, S. R. Mahadeva
TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,

← 1 2 3 4 5 →