Phone-based speech synthesis with neural network and articulatory control

被引:0
|
作者
Lo, WK
Ching, PC
机构
来源
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.
引用
收藏
页码:2227 / 2230
页数:4
相关论文
共 50 条
  • [41] Articulatory speech synthesis based upon fluid dynamic principles
    Huang, J
    Levinson, S
    Davis, D
    Slimon, S
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 445 - 448
  • [42] Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-based Speech Synthesis
    Ling, Zhen-Hua
    Richmond, Korin
    Yamagishi, Junichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 124 - +
  • [43] Neural Speech Synthesis with Transformer Network
    Li, Naihan
    Liu, Shujie
    Liu, Yanqing
    Zhao, Sheng
    Liu, Ming
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6706 - 6713
  • [44] Centerline articulatory models of the velum and epiglottis for articulatory synthesis of speech
    Laprie, Yves
    Elie, Benjamin
    Tsukanova, Anastasiia
    Vuissoz, Pierre-Andre
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2110 - 2114
  • [45] An adaptive neural control scheme for articulatory synthesis of CV sequences
    Huang, Guangpu
    Er, Meng Joo
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 163 - 176
  • [46] ARTICULATORY FEATURES FOR EXPRESSIVE SPEECH SYNTHESIS
    Black, Alan W.
    Bunnell, H. Timothy
    Dou, Ying
    Muthukumar, Prasanna Kumar
    Metze, Florian
    Perry, Daniel
    Polzehl, Tim
    Prahallad, Kishore
    Steidl, Stefan
    Vaughn, Callie
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4005 - 4008
  • [47] On the Contribution of Articulatory Features to Speech Synthesis
    Matura, Martin
    Juzova, Marketa
    Matousek, Jindrich
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 398 - 407
  • [48] Cell phone-based mindfulness interventions for smoking cessation: Randominised control study
    Tulucu, Fadime
    CYPRUS TURKISH JOURNAL OF PSYCHIATRY AND PSYCHOLOGY, 2022, 4 (04): : 370 - 377
  • [49] A NEW NEURAL-NETWORK FOR ARTICULATORY SPEECH RECOGNITION AND ITS APPLICATION TO VOWEL IDENTIFICATION
    ZACKS, J
    THOMAS, TR
    COMPUTER SPEECH AND LANGUAGE, 1994, 8 (03): : 189 - 209
  • [50] Yara launches phone-based nitrogen sensor
    Bomgardner, Melody
    CHEMICAL & ENGINEERING NEWS, 2019, 97 (11) : 13 - 13