AlpSynth - Concatenation-based speech synthesis for the Slovenian language

被引:0
|
作者
Gros, JZ [1 ]
Mihelic, A [1 ]
Pavesic, N [1 ]
Zganec, M [1 ]
Gruden, S [1 ]
机构
[1] Alpineon RTD, SI-1000 Ljubljana, Slovenia
来源
关键词
speech processing; text-to-speech synthesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper focuses on the design and collection of a speech corpus of elemental speech units for AlpSynth, a corpus-driven Slovenian TTS system. We describe the design procedures for a new speech corpus: purpose definition, content selection, definition of recording conditions and requirements, corpus segmentation and annotation. First we describe and comment the results of a frequency analysis of Slovenian allophone strings performed on a large Slovenian input text that has been converted to allophones. Further we present a method we designed for selection of a compact and efficient set of Slovenian sentences out of a large text corpus so as to minimize the final representative speech corpus. The selected sentences cover all the desired most frequent Slovenian quadphones, triphones and subsequently diphones. We describe the recording sessions and recording conditions. We continue describing the corpus annotation process. Finally, we describe the archive structure of the spoken corpus and present the information on its structure, content and size.
引用
收藏
页码:213 / 216
页数:4
相关论文
共 50 条
  • [31] RULE-SYNTHESIS OF SPEECH BY WORD CONCATENATION - FIRST STEP
    OLIVE, JP
    NAKATANI, LH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (03): : 660 - 666
  • [32] MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD
    Hlaing, Chaw Su
    Thida, Aye
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 399 - 404
  • [33] COMPUTER SYNTHESIS OF SPEECH BY CONCATENATION OF FORMANT-CODED WORDS
    RABINER, LR
    SCHAFER, RW
    FLANAGAN, JL
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1971, 50 (05): : 1541 - +
  • [34] Concatenation-based pre-trained convolutional neural networks using attention mechanism for environmental sound classification
    Ashurov, Asadulla
    Yi, Zhou
    Liu, Hongqing
    Yu, Zhao
    Li, Manhai
    [J]. APPLIED ACOUSTICS, 2024, 216
  • [35] Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
    Hertz, SR
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 87 - 90
  • [36] Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System
    Liu, Shanfeng
    Wen, Zhengqi
    Li, Ya
    Tao, Jianghua
    Liu, Bin
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 506 - 510
  • [37] Annotating discourse markers in spontaneous speech corpora on an example for the Slovenian language
    Verdonik, Darinka
    Rojc, Matej
    Stabej, Marko
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2007, 41 (02) : 147 - 180
  • [38] Large vocabulary speech recognition of Slovenian language using morphological models
    Maucec, M
    Rotovnik, T
    Kacic, Z
    Horvat, B
    [J]. IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 158 - 161
  • [39] Annotating discourse markers in spontaneous speech corpora on an example for the Slovenian language
    Darinka Verdonik
    Matej Rojc
    Marko Stabej
    [J]. Language Resources and Evaluation, 2007, 41 : 147 - 180
  • [40] HMM-Based Speech Synthesis for the Greek Language
    Karabetsos, Sotiris
    Tsiakoulis, Pirros
    Chalamandaris, Aimilios
    Raptis, Spyros
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 349 - 356