Phone-based speech synthesis with neural network and articulatory control

被引：0

作者：

Lo, WK

Ching, PC

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simplified articulatory space input parameters based on a modified vowel diagram are used to provide flexible and effective articulatory control. It also enables the design of an articulatory control model for allophonic variations in speech signal. The network approach is chosen for its non-linear mapping of the relationship between the articulatory space parameters and the spectral information of speech signal. Sn addition, non-linear approximation for phone template transitions is facilitated. The phone templates of the synthesizer are implicitly stored as network parameters of a medium size network. The performance of this new speech synthesis technique is demonstrated with a prototype system specifically designed for Cantonese (a common Chinese dialect) and the synthetic speech quality is assessed by informal listening tests.

引用

页码：2227 / 2230

页数：4

共 50 条

[1] Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface
Hueber, Thomas
Bailly, Gerard
Denby, Bruce
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 722 - 725
[2] Automatic Detection of Phone-Based Anomalies in Dysarthric Speech
Laaridh, Imed
Fredouille, Corinne
Meunier, Christine
ACM TRANSACTIONS ON ACCESSIBLE COMPUTING, 2015, 6 (03)
[3] A PHONE-BASED APPROACH TO NONLINGUISTIC SPEECH FEATURE IDENTIFICATION
LAMEL, LF
GAUVAIN, JL
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (01): : 87 - 103
[4] Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition
Banik, Manoj
Kotwal, Mohammed Rokibul Alam
Hassan, Foyzul
Islam, Gazi Md. Moshfiqul
Rahman, Sharif Mohammad Musfiqur
Hasan, Mohammad Mahedi
Muhammad, Ghulam
Huda, Mohammad Nurul
PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 624 - 627
[5] ON THE USE OF NEURAL NETWORKS IN ARTICULATORY SPEECH SYNTHESIS
RAHIM, MG
GOODYEAR, CC
KLEIJN, WB
SCHROETER, J
SONDHI, MM
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (02): : 1109 - 1121
[6] Neural network based autonomous control of a speech synthesis system
Panagiotopoulos, Dimokritos
Orovas, Christos
Syndoukas, Dimitrios
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
[7] A gesture-based concept for speech movement control in articulatory speech synthesis
Kroeger, Berrid J.
Birkholz, Peter
VERBAL AND NONVERBAL COMMUNICATION BEHAVIOURS, 2007, 4775 : 174 - +
[8] Human Activity Recognition Using Cell Phone-Based Accelerometer and Convolutional Neural Network
Prasad, Ashwani
Tyagi, Amit Kumar
Althobaiti, Maha M.
Almulihi, Ahmed
Mansour, Romany F.
Mahmoud, Ayman M.
APPLIED SCIENCES-BASEL, 2021, 11 (24):
[9] Evaluation of a Phone-Based Anomaly Detection Approach for Dysarthric Speech
Laaridh, Imed
Fredouille, Corinne
Meunier, Christine
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 223 - 227
[10] Deep Neural Network Based Acoustic-to-articulatory Inversion Using Phone Sequence Information
Xie, Xurong
Liu, Xunying
Wang, Lan
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1497 - 1501

← 1 2 3 4 5 →