DIPHONES EVALUATION FOR TEXT-TO-SPEECH SYNTHESIS OF ITALIAN.

被引：0

作者：

Salza, P.L. ^{[1
]}

Sandri, S. ^{[1
]}

Foti, E. ^{[1
]}

机构：

[1] CSELT, Turin, Italy, CSELT, Turin, Italy

来源：

CSELT Technical Reports | 1988年 / 16卷 / 01期

关键词：

ACOUSTICS - INFORMATION SCIENCE - Language Translation and Linguistics;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

An experiment is described for the performance evaluation of: 1) specifically defined speech units against simple 'ideal' diphones for synthesizing vowel to vowel coarticulations and sonorant consonant clusters; 2) 'allodiphones' for synthesizing stressed mid vowel allophones. These new speech units should overcome the acoustic discontinuities of traditional synthesis by segments. By concatenation of properly segmented speech units, 20 test words were synthesized and grouped in 23 pairs, to be evaluated by subjective tests according to a three level paired comparison method. Both 'trained' and 'untrained' listeners could assign preference to one of the two stimuli of each pair or give no preference. Collected score shows that in particular contexts triphones provide better fitting of complex coarticulations, while allophones of mid vowels and /r/ require proper 'allodiphones'. The results of the experiment have been accounted for the design of a dictionary of new speech units for Italian text-to-speech synthesis of good acoustic quality.

引用

页码：9 / 11

共 50 条

[1] Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
Moulines, Eric, 1600, (09): : 5 - 6
[2] Implementation of high quality text-to-speech using words and diphones
Shukla, SR
Barnwell, TP
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4020 - 4020
[3] PHONETIC TRANSCRIPTION RULES FOR TEXT-TO-SPEECH SYNTHESIS OF ITALIAN
SALZA, PL
PHONETICA, 1990, 47 (1-2) : 66 - 83
[4] Computerized speech simulation: Subjective evaluation of an Italian text-to-speech synthesizer
Roccetti, M
Salomoni, P
Collinelli, I
SIMULATION IN INDUSTRY 2001, 2001, : 364 - 368
[5] INTONATION IN TEXT-TO-SPEECH SYNTHESIS - EVALUATION OF ALGORITHMS
AKERS, G
LENNIG, M
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (06): : 2157 - 2165
[6] TEXT-TO-SPEECH SYNTHESIS
SPROAT, RW
OLIVE, JP
AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
[7] PITCH-SYNCHRONOUS WAVE-FORM PROCESSING TECHNIQUES FOR TEXT-TO-SPEECH SYNTHESIS USING DIPHONES
MOULINES, E
CHARPENTIER, F
SPEECH COMMUNICATION, 1990, 9 (5-6) : 453 - 467
[8] TEXT-TO-SPEECH TRANSLATION SYSTEM FOR ITALIAN
LESMO, L
MEZZALAMA, M
TORASSO, P
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1978, 10 (05): : 569 - 591
[9] Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla
Basu, Tulika
Saha, Arup
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[10] Implementation and evaluation of a text-to-speech synthesis system for Turkish
Salor, Özgül
Pellom, Bryan
Demirekler, Mübeccel
EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology, 2003, : 1573 - 1576

← 1 2 3 4 5 →