DIPHONES EVALUATION FOR TEXT-TO-SPEECH SYNTHESIS OF ITALIAN.

被引:0
|
作者
Salza, P.L. [1 ]
Sandri, S. [1 ]
Foti, E. [1 ]
机构
[1] CSELT, Turin, Italy, CSELT, Turin, Italy
来源
CSELT Technical Reports | 1988年 / 16卷 / 01期
关键词
ACOUSTICS - INFORMATION SCIENCE - Language Translation and Linguistics;
D O I
暂无
中图分类号
学科分类号
摘要
An experiment is described for the performance evaluation of: 1) specifically defined speech units against simple 'ideal' diphones for synthesizing vowel to vowel coarticulations and sonorant consonant clusters; 2) 'allodiphones' for synthesizing stressed mid vowel allophones. These new speech units should overcome the acoustic discontinuities of traditional synthesis by segments. By concatenation of properly segmented speech units, 20 test words were synthesized and grouped in 23 pairs, to be evaluated by subjective tests according to a three level paired comparison method. Both 'trained' and 'untrained' listeners could assign preference to one of the two stimuli of each pair or give no preference. Collected score shows that in particular contexts triphones provide better fitting of complex coarticulations, while allophones of mid vowels and /r/ require proper 'allodiphones'. The results of the experiment have been accounted for the design of a dictionary of new speech units for Italian text-to-speech synthesis of good acoustic quality.
引用
收藏
页码:9 / 11
相关论文
共 50 条
  • [2] Implementation of high quality text-to-speech using words and diphones
    Shukla, SR
    Barnwell, TP
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4020 - 4020
  • [3] PHONETIC TRANSCRIPTION RULES FOR TEXT-TO-SPEECH SYNTHESIS OF ITALIAN
    SALZA, PL
    PHONETICA, 1990, 47 (1-2) : 66 - 83
  • [4] Computerized speech simulation: Subjective evaluation of an Italian text-to-speech synthesizer
    Roccetti, M
    Salomoni, P
    Collinelli, I
    SIMULATION IN INDUSTRY 2001, 2001, : 364 - 368
  • [5] INTONATION IN TEXT-TO-SPEECH SYNTHESIS - EVALUATION OF ALGORITHMS
    AKERS, G
    LENNIG, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (06): : 2157 - 2165
  • [6] TEXT-TO-SPEECH SYNTHESIS
    SPROAT, RW
    OLIVE, JP
    AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
  • [7] PITCH-SYNCHRONOUS WAVE-FORM PROCESSING TECHNIQUES FOR TEXT-TO-SPEECH SYNTHESIS USING DIPHONES
    MOULINES, E
    CHARPENTIER, F
    SPEECH COMMUNICATION, 1990, 9 (5-6) : 453 - 467
  • [8] TEXT-TO-SPEECH TRANSLATION SYSTEM FOR ITALIAN
    LESMO, L
    MEZZALAMA, M
    TORASSO, P
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1978, 10 (05): : 569 - 591
  • [9] Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla
    Basu, Tulika
    Saha, Arup
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [10] Implementation and evaluation of a text-to-speech synthesis system for Turkish
    Salor, Özgül
    Pellom, Bryan
    Demirekler, Mübeccel
    EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology, 2003, : 1573 - 1576