Phone-based speech synthesis with neural network and articulatory control

被引：0

作者：

Lo, WK

Ching, PC

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.

引用

页码：2227 / 2230

页数：4

共 50 条

[21] Improvement in speech recognition using phone-based filter and sum parameter optimization
Kouhi-Jelehkarana, Bahram
Bakhshi, Hamidreza
Razzazi, Farbod
IEICE ELECTRONICS EXPRESS, 2009, 6 (08): : 437 - 442
[22] An investigation of phone-based subword units for end-to-end speech recognition
Wang, Weiran
Wang, Guangsen
Bhatnagar, Aadyot
Zhou, Yingbo
Xiong, Caiming
Socher, Richard
INTERSPEECH 2020, 2020, : 1778 - 1782
[23] Articulatory Text-to-Speech Synthesis using the Digital Waveguide Mesh driven by a Deep Neural Network
Gully, Amelia J.
Yoshimura, Takenori
Murphy, Damian T.
Hashimoto, Kei
Nankaku, Yoshihiko
Tokuda, Keiichi
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 234 - 238
[24] Control of an Articulatory Speech Synthesizer based on Dynamic Approximation of Spatial Articulatory Targets
Birkholz, Peter
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 629 - 632
[25] Phone-based diagnostics dials in
Gefvert, Barbara
LASER FOCUS WORLD, 2015, 51 (08): : 57 - 57
[26] Evaluation of a mobile phone-based diet game for weight control
Lee, Wonbok
Chae, Young Moon
Kim, Sukil
Ho, Seung Hee
Choi, Inyoung
JOURNAL OF TELEMEDICINE AND TELECARE, 2010, 16 (05) : 270 - 275
[27] Tibetan speech synthesis based on an improved neural network
Ding, Yuntao
Cai, Rangzhuoma
Gong, Baojia
2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
[28] A smart phone-based personal area network for remote monitoring of biosignals
Moron, M. J.
Luque, J. R.
Botella, A. A.
Cuberos, E. J.
Casilari, E.
Diaz-Estrella, A.
4TH INTERNATIONAL WORKSHOP ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN 2007), 2007, 13 : 116 - +
[29] Articulatory Control of HMM-based Parametric Speech Synthesis Driven by Phonetic Knowledge
Ling, Zhen-Hua
Richmond, Korin
Yamagishi, Junichi
Wang, Ren-Hua
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 573 - +
[30] A synthesis method based on speech production and articulatory model
YU Zhenli (Dept. of Information and Electronic Engineering
ChineseJournalofAcoustics, 2000, (02) : 128 - 141

← 1 2 3 4 5 →