AlpSynth - Concatenation-based speech synthesis for the Slovenian language

被引：0

作者：

Gros, JZ ^{[1
]}

Mihelic, A ^{[1
]}

Pavesic, N ^{[1
]}

Zganec, M ^{[1
]}

Gruden, S ^{[1
]}

机构：

[1] Alpineon RTD, SI-1000 Ljubljana, Slovenia

来源：

Proceedings ELMAR-2005 | 2005年

关键词：

speech processing; text-to-speech synthesis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper focuses on the design and collection of a speech corpus of elemental speech units for AlpSynth, a corpus-driven Slovenian TTS system. We describe the design procedures for a new speech corpus: purpose definition, content selection, definition of recording conditions and requirements, corpus segmentation and annotation. First we describe and comment the results of a frequency analysis of Slovenian allophone strings performed on a large Slovenian input text that has been converted to allophones. Further we present a method we designed for selection of a compact and efficient set of Slovenian sentences out of a large text corpus so as to minimize the final representative speech corpus. The selected sentences cover all the desired most frequent Slovenian quadphones, triphones and subsequently diphones. We describe the recording sessions and recording conditions. We continue describing the corpus annotation process. Finally, we describe the archive structure of the spoken corpus and present the information on its structure, content and size.

引用

页码：213 / 216

页数：4

共 50 条

[31] RULE-SYNTHESIS OF SPEECH BY WORD CONCATENATION - FIRST STEP
OLIVE, JP
NAKATANI, LH
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (03): : 660 - 666
[32] MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD
Hlaing, Chaw Su
Thida, Aye
[J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 399 - 404
[33] COMPUTER SYNTHESIS OF SPEECH BY CONCATENATION OF FORMANT-CODED WORDS
RABINER, LR
SCHAFER, RW
FLANAGAN, JL
[J]. BELL SYSTEM TECHNICAL JOURNAL, 1971, 50 (05): : 1541 - +
[34] Concatenation-based pre-trained convolutional neural networks using attention mechanism for environmental sound classification
Ashurov, Asadulla
Yi, Zhou
Liu, Hongqing
Yu, Zhao
Li, Manhai
[J]. APPLIED ACOUSTICS, 2024, 216
[35] Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
Hertz, SR
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 87 - 90
[36] Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System
Liu, Shanfeng
Wen, Zhengqi
Li, Ya
Tao, Jianghua
Liu, Bin
[J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 506 - 510
[37] Annotating discourse markers in spontaneous speech corpora on an example for the Slovenian language
Verdonik, Darinka
Rojc, Matej
Stabej, Marko
[J]. LANGUAGE RESOURCES AND EVALUATION, 2007, 41 (02) : 147 - 180
[38] Large vocabulary speech recognition of Slovenian language using morphological models
Maucec, M
Rotovnik, T
Kacic, Z
Horvat, B
[J]. IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 158 - 161
[39] Annotating discourse markers in spontaneous speech corpora on an example for the Slovenian language
Darinka Verdonik
Matej Rojc
Marko Stabej
[J]. Language Resources and Evaluation, 2007, 41 : 147 - 180
[40] HMM-Based Speech Synthesis for the Greek Language
Karabetsos, Sotiris
Tsiakoulis, Pirros
Chalamandaris, Aimilios
Raptis, Spyros
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 349 - 356

← 1 2 3 4 5 →