Phone-based speech synthesis with neural network and articulatory control

被引：0

作者：

Lo, WK

Ching, PC

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.

引用

页码：2227 / 2230

页数：4

共 50 条

[41] Articulatory speech synthesis based upon fluid dynamic principles
Huang, J
Levinson, S
Davis, D
Slimon, S
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 445 - 448
[42] Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-based Speech Synthesis
Ling, Zhen-Hua
Richmond, Korin
Yamagishi, Junichi
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 124 - +
[43] Neural Speech Synthesis with Transformer Network
Li, Naihan
Liu, Shujie
Liu, Yanqing
Zhao, Sheng
Liu, Ming
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6706 - 6713
[44] Centerline articulatory models of the velum and epiglottis for articulatory synthesis of speech
Laprie, Yves
Elie, Benjamin
Tsukanova, Anastasiia
Vuissoz, Pierre-Andre
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2110 - 2114
[45] An adaptive neural control scheme for articulatory synthesis of CV sequences
Huang, Guangpu
Er, Meng Joo
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 163 - 176
[46] ARTICULATORY FEATURES FOR EXPRESSIVE SPEECH SYNTHESIS
Black, Alan W.
Bunnell, H. Timothy
Dou, Ying
Muthukumar, Prasanna Kumar
Metze, Florian
Perry, Daniel
Polzehl, Tim
Prahallad, Kishore
Steidl, Stefan
Vaughn, Callie
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4005 - 4008
[47] On the Contribution of Articulatory Features to Speech Synthesis
Matura, Martin
Juzova, Marketa
Matousek, Jindrich
SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 398 - 407
[48] Cell phone-based mindfulness interventions for smoking cessation: Randominised control study
Tulucu, Fadime
CYPRUS TURKISH JOURNAL OF PSYCHIATRY AND PSYCHOLOGY, 2022, 4 (04): : 370 - 377
[49] A NEW NEURAL-NETWORK FOR ARTICULATORY SPEECH RECOGNITION AND ITS APPLICATION TO VOWEL IDENTIFICATION
ZACKS, J
THOMAS, TR
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (03): : 189 - 209
[50] Yara launches phone-based nitrogen sensor
Bomgardner, Melody
CHEMICAL & ENGINEERING NEWS, 2019, 97 (11) : 13 - 13

← 1 2 3 4 5 →