Phone-based speech synthesis with neural network and articulatory control

被引:0
|
作者
Lo, WK
Ching, PC
机构
来源
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.
引用
收藏
页码:2227 / 2230
页数:4
相关论文
共 50 条
  • [31] Cross-speaker Acoustic-to-Articulatory Inversion using Phone-based Trajectory HAM for Pronunciation Training
    Hueber, Thomas
    Ben-Youssef, Atef
    Bailly, Gerard
    Badin, Pierre
    Elisei, Frederic
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 782 - 785
  • [32] Phone-Based Filter Parameter Optimization for Robust Speech Recognition Using Likelihood Maximization
    Kouhi-Jelehkaran, Bahram
    Bakhshi, Hamidreza
    Razzazi, Farbod
    Amini, Sahar
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 557 - +
  • [33] Smart phone-based auscultation platform
    Chamberlain, Daniel
    Kodgule, Rahul
    Thorat, Yogesh
    Das, Vandana
    Miglani, Vivek
    Ganelin, Daniela
    Dalal, Alpa
    Sahasrabudhe, Tushar
    Lanjewar, Ajay
    Fletcher, Richard
    EUROPEAN RESPIRATORY JOURNAL, 2016, 48
  • [34] PHONE-BASED CSCW - TOOLS AND TRIALS
    RESNICK, P
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1993, 11 (04) : 401 - 424
  • [35] Cellular phone-based photoplethysmographic imaging
    Jonathan, Enock
    Leahy, Martin J.
    JOURNAL OF BIOPHOTONICS, 2011, 4 (05) : 293 - 296
  • [36] Style Transplantation in Neural Network-based Speech Synthesis
    Suzic, Sinisa B.
    Delic, Tijana, V
    Pekar, Darko J.
    Delic, Vlado D.
    Secujski, Milan S.
    ACTA POLYTECHNICA HUNGARICA, 2019, 16 (06) : 171 - 189
  • [37] Research on Dungan speech synthesis based on Deep Neural Network
    Chen, Lijia
    Yang, Hongwu
    Wang, Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 46 - 50
  • [38] A Comparison of Expressive Speech Synthesis Approaches based on Neural Network
    Xue, Liumeng
    Zhu, Xiaolian
    An, Xiaochun
    Xie, Lei
    PROCEEDINGS OF THE JOINT WORKSHOP OF THE 4TH WORKSHOP ON AFFECTIVE SOCIAL MULTIMEDIA COMPUTING AND FIRST MULTI-MODAL AFFECTIVE COMPUTING OF LARGE-SCALE MULTIMEDIA DATA (ASMMC-MMAC'18), 2018, : 15 - 20
  • [39] Mobile phone-based SCADA automation
    Karacor, M
    Ozdemir, E
    MEASUREMENT & CONTROL, 2004, 37 (09): : 268 - 272
  • [40] A HMM Based Speech Synthesis Method Using Articulatory Feature
    Li, Yong
    Yin, Qing
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 185 - 189