Phone-based speech synthesis with neural network and articulatory control

被引:0
|
作者
Lo, WK
Ching, PC
机构
来源
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.
引用
收藏
页码:2227 / 2230
页数:4
相关论文
共 50 条
  • [21] Improvement in speech recognition using phone-based filter and sum parameter optimization
    Kouhi-Jelehkarana, Bahram
    Bakhshi, Hamidreza
    Razzazi, Farbod
    IEICE ELECTRONICS EXPRESS, 2009, 6 (08): : 437 - 442
  • [22] An investigation of phone-based subword units for end-to-end speech recognition
    Wang, Weiran
    Wang, Guangsen
    Bhatnagar, Aadyot
    Zhou, Yingbo
    Xiong, Caiming
    Socher, Richard
    INTERSPEECH 2020, 2020, : 1778 - 1782
  • [23] Articulatory Text-to-Speech Synthesis using the Digital Waveguide Mesh driven by a Deep Neural Network
    Gully, Amelia J.
    Yoshimura, Takenori
    Murphy, Damian T.
    Hashimoto, Kei
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 234 - 238
  • [24] Control of an Articulatory Speech Synthesizer based on Dynamic Approximation of Spatial Articulatory Targets
    Birkholz, Peter
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 629 - 632
  • [25] Phone-based diagnostics dials in
    Gefvert, Barbara
    LASER FOCUS WORLD, 2015, 51 (08): : 57 - 57
  • [26] Evaluation of a mobile phone-based diet game for weight control
    Lee, Wonbok
    Chae, Young Moon
    Kim, Sukil
    Ho, Seung Hee
    Choi, Inyoung
    JOURNAL OF TELEMEDICINE AND TELECARE, 2010, 16 (05) : 270 - 275
  • [27] Tibetan speech synthesis based on an improved neural network
    Ding, Yuntao
    Cai, Rangzhuoma
    Gong, Baojia
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [28] A smart phone-based personal area network for remote monitoring of biosignals
    Moron, M. J.
    Luque, J. R.
    Botella, A. A.
    Cuberos, E. J.
    Casilari, E.
    Diaz-Estrella, A.
    4TH INTERNATIONAL WORKSHOP ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN 2007), 2007, 13 : 116 - +
  • [29] Articulatory Control of HMM-based Parametric Speech Synthesis Driven by Phonetic Knowledge
    Ling, Zhen-Hua
    Richmond, Korin
    Yamagishi, Junichi
    Wang, Ren-Hua
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 573 - +
  • [30] A synthesis method based on speech production and articulatory model
    YU Zhenli (Dept. of Information and Electronic Engineering
    ChineseJournalofAcoustics, 2000, (02) : 128 - 141