A mandarin text-to-speech technique implemented on a PIC-based microcontroller platform

被引：0

作者：

Yeh, Cheng-Yu ^{[1
]}

Chang, Chih-Hsuan ^{[1
]}

机构：

[1] Natl Chin Yi Univ Technol, Dept Elect Engn, Taichung 41170, Taiwan

来源：

IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING | 2016年 / 11卷

关键词：

text-to-speech; real-time embedded system; microcontroller; recurrent neural network; pitch-synchronous overlap-add;

D O I：

10.1002/tee.22327

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a Mandarin text-to-speech (TTS) technique is employed to achieve the implementation of a voiced E-book on the PIC-based embedded platform. A transformation from the text of E-book to the corresponding speech can help blind users and make the reading more effortless and relaxed. Both the microcontroller with a PIC32 Ethernet Starter Kit (80 MHz, 32-bit, 128 kB SRAM, 512 kB Flash) and the Multimedia Expansion Board designed by Microchip Technology Inc. are adopted as the embedded platform. Four subsystems, namely text analysis, a recurrent neural network-based prosodic generator, a synthesis unit generator with 411 Chinese syllabic waveforms, and a pitch-synchronous overlap-add-based speech synthesizer, are made in the Mandarin TTS system and are implemented with C programming language. Experimental results find that a system requirement of 1.66 MB storage memory and less than 25.4 kB runtime memory, as well as 21.3% CPU runtime, is sufficient for real-time operation such that a natural and fluent speech with a 16-bit PCM at 8 kHz sampling rate is provided. The performance of the PIC-based Mandarin TTS system is demonstrated to be good. (c) 2016 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.

引用

页码：S60 / S64

页数：5

共 50 条

[1] A PIC-based microcontroller design laboratory
Hamad, M.
Kassem, A.
Jabr, R. A.
Bechara, C.
Khattar, M.
6TH INTERNATIONAL WORKSHOP ON SYSTEM-ON-CHIP FOR REAL-TIME APPLICATIONS, PROCEEDINGS, 2006, : 66 - +
[2] A Mandarin text-to-speech system
Hwang, SH
Chen, SH
Wang, YR
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
[3] Text normalization in mandarin Text-to-Speech system
Jia, Yuxiang
Huang, Dezhi
Liu, Wu
Dong, Yuan
Yu, Shiwen
Wang, Haila
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4693 - +
[4] A Prosodic Mandarin Text-to-Speech System Based on Tacotron
Zhang, Chuxiong
Zhang, Sheng
Zhong, Haibing
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 165 - 169
[5] Pitch models of Mandarin text-to-speech
邵艳秋
穗志方
韩纪庆
Journal of Harbin Institute of Technology, 2009, 16 (02) : 179 - 184
[6] Pitch models of Mandarin text-to-speech
邵艳秋
穗志方
韩纪庆
Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
[7] An HMM-based Mandarin Chinese Text-to-Speech system
Qian, Yao
Soong, Frank
Chen, Yining
Chu, Min
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 223 - +
[8] Hierarchical Stress Modeling in Mandarin Text-to-Speech
Li, Ya
Tao, Jianhua
Xu, Xiaoying
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2024 - +
[9] Prosody model in a Mandarin Text-to-Speech System based on a hierarchical approach
Pan, NH
Jen, WT
Yu, SS
Yu, MS
Huang, SY
Wu, MJ
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 448 - 451
[10] An RNN-based prosodic information synthesizer for Mandarin text-to-speech
Chen, SH
Hwang, SH
Wang, YR
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 226 - 239

← 1 2 3 4 5 →