Phone-based speech synthesis with neural network and articulatory control

被引:0
|
作者
Lo, WK
Ching, PC
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.
引用
收藏
页码:2227 / 2230
页数:4
相关论文
共 50 条
  • [1] Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface
    Hueber, Thomas
    Bailly, Gerard
    Denby, Bruce
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 722 - 725
  • [2] Automatic Detection of Phone-Based Anomalies in Dysarthric Speech
    Laaridh, Imed
    Fredouille, Corinne
    Meunier, Christine
    ACM TRANSACTIONS ON ACCESSIBLE COMPUTING, 2015, 6 (03)
  • [3] A PHONE-BASED APPROACH TO NONLINGUISTIC SPEECH FEATURE IDENTIFICATION
    LAMEL, LF
    GAUVAIN, JL
    COMPUTER SPEECH AND LANGUAGE, 1995, 9 (01): : 87 - 103
  • [4] Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition
    Banik, Manoj
    Kotwal, Mohammed Rokibul Alam
    Hassan, Foyzul
    Islam, Gazi Md. Moshfiqul
    Rahman, Sharif Mohammad Musfiqur
    Hasan, Mohammad Mahedi
    Muhammad, Ghulam
    Huda, Mohammad Nurul
    PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 624 - 627
  • [5] ON THE USE OF NEURAL NETWORKS IN ARTICULATORY SPEECH SYNTHESIS
    RAHIM, MG
    GOODYEAR, CC
    KLEIJN, WB
    SCHROETER, J
    SONDHI, MM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (02): : 1109 - 1121
  • [6] Neural network based autonomous control of a speech synthesis system
    Panagiotopoulos, Dimokritos
    Orovas, Christos
    Syndoukas, Dimitrios
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
  • [7] A gesture-based concept for speech movement control in articulatory speech synthesis
    Kroeger, Berrid J.
    Birkholz, Peter
    VERBAL AND NONVERBAL COMMUNICATION BEHAVIOURS, 2007, 4775 : 174 - +
  • [8] Human Activity Recognition Using Cell Phone-Based Accelerometer and Convolutional Neural Network
    Prasad, Ashwani
    Tyagi, Amit Kumar
    Althobaiti, Maha M.
    Almulihi, Ahmed
    Mansour, Romany F.
    Mahmoud, Ayman M.
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [9] Evaluation of a Phone-Based Anomaly Detection Approach for Dysarthric Speech
    Laaridh, Imed
    Fredouille, Corinne
    Meunier, Christine
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 223 - 227
  • [10] Deep Neural Network Based Acoustic-to-articulatory Inversion Using Phone Sequence Information
    Xie, Xurong
    Liu, Xunying
    Wang, Lan
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1497 - 1501